CN110046713A - Robustness sequence learning method and its application based on multi-objective particle swarm optimization - Google Patents

Robustness sequence learning method and its application based on multi-objective particle swarm optimization Download PDF

Info

Publication number
CN110046713A
CN110046713A CN201910318891.8A CN201910318891A CN110046713A CN 110046713 A CN110046713 A CN 110046713A CN 201910318891 A CN201910318891 A CN 201910318891A CN 110046713 A CN110046713 A CN 110046713A
Authority
CN
China
Prior art keywords
model
ranking
sequencing
particle
robustness
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910318891.8A
Other languages
Chinese (zh)
Other versions
CN110046713B (en
Inventor
李金忠
夏洁武
曾劲涛
彭蕾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jinggangshan University
Original Assignee
Jinggangshan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jinggangshan University filed Critical Jinggangshan University
Priority to CN201910318891.8A priority Critical patent/CN110046713B/en
Publication of CN110046713A publication Critical patent/CN110046713A/en
Application granted granted Critical
Publication of CN110046713B publication Critical patent/CN110046713B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Evolutionary Computation (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention relates to a kind of robustness sequence learning method and its application based on multi-objective particle swarm optimization, the following steps are included: step 1, based on deviation-variance isostatic theory, design the validity departure function and robustness variance function of order models, two optimality criterions of building sequence study;Step 2 is based on multi-objective particle frame on sequence learning data set, and the validity departure functions of iteration optimization order models and robustness variance function the two targets are to train order models, to generate order models filing disaggregation;Step 3, based on the thought of the preference ranking organization methods for enrichment evaluations PROMETHEE II in Multiple Attribute Decision Making Theory, the order models filing solution caused by the previous step concentrates selection one order models for having the Pareto of maximum " net flow " ranking value optimal in this, as the final order models trained.Compared with prior art, the present invention has many advantages, such as to improve whole user satisfaction, enhances user experience.

Description

Robustness sequencing learning method based on multi-objective particle swarm optimization and application thereof
Technical Field
The invention relates to the field of information retrieval and machine learning, in particular to a robustness sequencing learning method based on multi-objective particle swarm optimization and application thereof.
Background
The sequencing learning is to use the machine learning technology to automatically train out a sequencing model to solve the sequencing problem. The method is a hotspot problem researched in the field of information retrieval and machine learning, and has wide application prospects in the aspects of information retrieval, search engines, recommendation systems, question-answering systems and the like.
Due to the dynamics of the Web and the diversity of the user information requirements, the performance of some Web search queries may vary greatly and may suffer from significant loss under different ranking models, thereby reducing the user experience. A robust retrieval system should ensure that the user experience is not compromised by the presence of poorly performing queries. Therefore, in order to improve the overall user experience as much as possible, in addition to the traditional relevance and importance criteria, how to ensure the robustness of the ranking model, i.e. relative to a simple benchmark, although an average gain is obtained overall, the new ranking model usually suffers from the performance loss of many queries, which is an important issue faced by the ranking learning research in recent years. Therefore, it is essential to develop a robust ranking learning method to train a robust perceptual ranking model to improve the overall satisfaction of all users as much as possible.
Currently, rank learning researchers compare and evaluate multiple ranking models, mainly by designing advanced ranking features and/or by developing advanced rank learning methods, selecting a best-available ranking model based on some effectiveness metrics, such as Normalized Discounted Cumulative Gain (NDCG) and Expected Reciprocal Ranking (ERR), with the goal of focusing on improving the average effectiveness of the ranking model, which often neglects the robustness of the ranking model. The ranking model with poor robustness can cause instability of a ranking system, namely, the performance of some queries is good, while the performance of other queries is poor, so that the ranking result presented to a user is unstable, and the information requirements of different users are difficult to meet as much as possible, thereby being difficult to bring satisfactory experience to the user. Therefore, in the training process of the ranking model, it is necessary to establish an optimization target meeting the actual requirement, and the effectiveness and the robustness of the ranking model are considered to be optimized simultaneously.
Disclosure of Invention
The invention aims to overcome the defects in the prior art and provides a robustness ranking learning method based on multi-objective particle swarm optimization and application thereof.
The purpose of the invention can be realized by the following technical scheme:
a robustness sequencing learning method based on multi-objective particle swarm optimization comprises the following steps:
designing an effective deviation function and a robust variance function of a ranking model based on a deviation-variance balance theory, and constructing two optimization performance indexes of ranking learning;
secondly, training a ranking model by using two targets of an effectiveness deviation function and a robustness variance function of an iterative optimization ranking model on a ranking learning data set based on a multi-target particle swarm optimization algorithm framework, so as to generate a ranking model filing solution set;
and thirdly, based on the idea of a preference order structure evaluation method PROMETHEE II in the multi-attribute decision theory, selecting a Pareto optimal sorting model with the maximum 'net flow' sorting value from the sorting model filing solution set generated in the last step to serve as a trained final sorting model.
Preferably, the two optimization performance indexes for constructing the ranking learning are specifically:
the effectiveness deviation function and the robustness variance function of the query and the query set under the ranking model are respectively defined as follows:
definitions 1 query qiOf the validity deviation function BiasR(qi) Is defined as:
wherein ,representing a query qiAll documents D underiThe best effectiveness achieved under the ideal ranking model I, i.e. all documents correctly ranked,representing a query qiAll documents D underiActual effectiveness, Bias, obtained under the ranking model RR(qi) Representing a query qiDeviation of actual effectiveness under the ranking model R from the optimal effectiveness of the ideal ranking model I;
definition 2. validity deviation function Bias of query set QR(Q) is defined as:
among them, BiasR(Q) represents all queries Q in a set Q under a ranking model RiIs the average of the validity deviations, | Q | represents the query Q in the query set QiThe total number of (2);
definitions 3 query qiRobust Variance function Variance of (1)R(qi) Is defined as:
VarianceR(qi)=[BiasR(qi)-BiasR(Q)]2…(3)
wherein, VarianceR(qi) Representing queries q under a ranking model RiIs of validity BiasR(qi) Validity deviation Bias from query set QRA degree of dispersion of (Q);
definition 4. robust Variance function Variance of query set QR(Q) is defined as:
wherein, VarianceR(Q) represents all queries Q in a set Q under a ranking model RiThe mean of the robust variance of (a);
converting the robustness ranking learning problem into a multi-objective optimization problem considering both effectiveness and robustness, and formally describing the robustness ranking learning problem as follows according to the definition of the effectiveness deviation function and the robustness variance function:
Utility(Q)={min BiasR(Q),min VarianceR(Q)}…(5)
namely, in the process of sequencing learning, the effectiveness deviation function Bias is minimized at the same timeR(Q) and the robust Variance function VarianceR(Q) to train a ranking model, for which purpose the optimization performance index Bias based on the above-constructed ranking modelR(Q) and VarianceR(Q) a multi-objective intelligent optimization algorithm, such as a multi-objective particle swarm optimization algorithm, can be used while minimizing the BiasR(Q) and VarianceRThe value of (Q) is used for achieving the purpose of balancing the effectiveness and the robustness of the optimized sequencing model.
Preferably, in the multi-objective particle swarm optimization algorithm-based framework, the two objectives of the effectiveness deviation function and the robustness variance function of the iterative optimization ranking model are used to train the ranking model, so that the generation of the archival solution set of the ranking model specifically comprises:
step 1, initializing relevant parameters of a particle swarm;
step 2, based on the given sequencing learning data set, under the ideal sequencing model I, calculating each query qiBest effectiveness of
Step 3, initializing the relevant information of each particle to generate an initial sequencing model set P;
step 4, initializing an iteration counter t to be 0;
step 5, establishing an initial sequencing model filing solution set Archive, selecting a non-dominant sequencing model from the initial sequencing model P, and storing the non-dominant sequencing model in the sequencing model filing solution set Archive;
step 6, calculating the congestion distance of each non-dominant solution in the archiving solution set Archive of the sequencing model;
step 7, arranging the non-dominant solutions in Archive in a descending order according to the magnitude of the congestion distance;
step 8, performing operation on each particle to update the position and speed information of the particle;
step 9, updating the archiving solution set Archive of the sequencing model;
step 10, updating individual extreme values Pbest [ i ] of each particle in the particle swarm P;
and 11, if t is t +1, if t is less than MAXT, turning to the step 6, otherwise, outputting each sequencing model in the sequencing model Archive solution set Archive, namely generating a final sequencing model set.
Preferably, the relevant parameters of step 1 include population size Pop, acceleration factors c1 and c2, and initial inertia weight ω0Final inertial weight ω1Maximum iteration times MAXT, the number N of objective functions and the variation probability Mu.
Preferably, the initializing the relevant information of each particle in step 3 to generate the initial ordering model set P specifically includes:
31) randomly initializing the position P [ i ] of each particle;
real number coding is adopted, and in a feasible ordering model domain of an ordering learning problem, an initial position P [ i ] of each particle is randomly generated, namely a weight corresponding to each ordering characteristic, wherein i is more than or equal to 1 and is less than or equal to Pop;
32) initializing the speed V [ i ] of each particle as 0;
33) calculating an effectiveness deviation function and a robustness variance function value of the sequencing model;
according to the position P [ i ] of each particle]And a linear ranking scoring functionComputing queries qiEach document d ofijWherein f isijm(qi,dij) Representing query-document pairs (q)i,dij) According to different Score (q)i,dij) Value-from-big to-little for each query qiLower documents dijCarrying out top-n rapid sequencing, and marking Y according to the sequencing position and the relevance of the documentiCombining the ideal ordering model I, respectively calculating the query q according to the formula (1) to the formula (4)iAnd the validity deviation function Bias of the query set QR(qi) and BiasR(Q) and a robust Variance function VarianceR(qi) And VarianceR(Q) to obtain target values for an effectiveness bias function and a robustness variance function of the ranking model;
34) initializing an individual extreme value Pbest [ i ] ═ P [ i ] of the particle;
35) and determining the global extreme value Gbest of the initial population according to the Pbest [ i ] of each particle.
Preferably, the step 8 of performing an operation on each particle to update the position and velocity information of the particle is specifically as follows:
81) randomly selecting a certain particle i from a non-inferior solution set at the front end with larger crowding distance in the ordered sequencing model filing solution set Archive, and setting the position of the certain particle i as a global extremum Gbest;
82) updating the velocity V [ i ] of particle i according to equation (6):
Vim(t+1)=ωt*Vim(t)+c1*rand()*[Xim(t)-Pim(t)]+c2*rand()*[XGm(t)-Pim(t)]…(6)
wherein the inertia weight at the t-th iterationc1 and c2 are acceleration constantsNumber factor, rand () is [0, 1 ]]Random number of cells, Xim(t) and XGm(t) respectively representing individual extreme values Pbest [ i ] of the particle i at the t-th iteration]The m-th dimension component of Gbest and the m-th dimension component of the global extreme value Gbest, wherein the integer i is the particle number and takes the values of 1,2, … … and Pop;
83) updating the position P [ i ] of the particle i according to equation (7):
Pim(t+1)=Pim(t)+Vim(t+1)…(7)
84) checking whether P [ i ] is within the limits given by the variables, and if the position range of P [ i ] is exceeded, setting the corresponding dimensional variable in P [ i ] as the corresponding boundary value and setting the speed as the reverse direction, namely-V [ i ];
85) if t < MAXT Mu, then executing variation operation on the position P [ i ] of the particle by taking Mu as variation rate;
86) calculating an objective function value of the particle;
according to the position P [ i ] of the particle]And a linear ranking scoring functionComputing queries qiEach document d ofijWherein f isijm(qi,dij) Representing query-document pairs (q)i,dij) According to different Score (q)i,dij) Value-from-big to-little for each query qiLower documents dijCarrying out top-n rapid sequencing, and marking Y according to the sequencing position and the relevance of the documentiCombining the ideal ordering model I, respectively calculating the query q according to the formula (1) to the formula (4)iAnd the validity deviation function Bias of the query set QR(qi) and BiasR(Q) and a robust Variance function VarianceR(qi) And VarianceR(Q) to obtain target values for a validity deviation function and a robustness variance function of the ranking model.
Preferably, the updating of the Archive solution set Archive of the ranking model in step 9 specifically includes:
if the particles in the population P are not dominated by any particle in the order model Archive solution set Archive, inserting all new non-dominated particles in the order model Archive solution set Archive and deleting all particles dominated by the new particles in the order model Archive solution set Archive, if the order model Archive solution set Archive is full, replacing the particles by the following steps:
step 1, calculating the congestion distance of each non-dominant solution in an Archive solution set Archive of a sequencing model, and arranging the congestion distances in a descending order according to the size of the congestion distance;
and 2, randomly selecting one particle in the non-dominant solution at the front end with smaller crowding distance from the bottom end of the sequencing model Archive solution set Archive, and replacing the particle with a new particle.
Preferably, the step 10 of updating the individual extremum Pbest [ i ] of each particle in the particle group P is specifically:
and comparing the new position of the particle P [ i ] with the advantages and disadvantages of the Pbest [ i ] according to the dominance relation, and updating the individual optimum when the P [ i ] is dominant, namely Pbest [ i ] ═ P [ i ].
Preferably, the third step selects, based on the concept of a preference order structure assessment method prometeei in the multi-attribute decision theory, one Pareto optimal ranking model with the largest "net flow" ranking value from the ranking model filing solution set generated in the previous step, and the selected ranking model is specifically:
step 1, calculating an outflow function of each sequencing model;
respectively calculating the 'outflow' function Out (R) of each sequencing model for the sequencing model filing solution set Archive finally generated in the step twoi) Value, will Out (R)i) Is defined as:
wherein, | A | represents the total number of Pareto optimized solutions in the ranking model Archive solution set Archive;
step 2, calculating an inflow function of each sequencing model;
respectively calculating the 'inflow' function In (R) of each sequencing model for the sequencing model filing solution set Archive finally generated In the step twoi) Value In (R)i) Is defined as:
step 3, calculating a net flow function of each sequencing model;
respectively calculating the Net flow function Net (R) of each sequencing model for the sequencing model filing solution set Archive finally generated in the step twoi) Value, will Net (R)i) Is defined as:
Net(Ri)=Out(Ri)-In(Ri)
step 4. according to the Net flow function Net (R)i) Obtaining each sequencing model R in the sequencing model filing solution set ArchiveiThe "net flow" ranking value of (a) and a Pareto optimization solution with the largest "net flow" ranking value is selected as the final ranking model R.
The application of the robustness ranking learning method based on multi-objective particle swarm optimization is characterized in that the robustness ranking learning method based on multi-objective particle swarm optimization is applied to a search engine, wherein the search engine comprises Baidu Baigougle, Google must Bing, Sogou and Yahoo, a ranking model trained by the method is embedded into a ranking system of the search engine, and a webpage ranking result of query words required to be searched by a user is predicted by the ranking model, so that the satisfaction degree of the whole user can be improved, the experience of the user is enhanced, and the specific application process is as follows:
step 1, integrating a robustness sequencing learning method based on multi-objective particle swarm optimization into a search engine;
firstly, performing data preprocessing on partial webpages in a search engine webpage index database, and extracting and labeling ranking features of the webpages to construct a search engine ranking learning data set;
secondly, iteratively training a ranking model to generate a robustness-perceived ranking model by using a robustness ranking learning method based on multi-objective particle swarm optimization on the constructed ranking learning data set;
finally, embedding the generated robustness sensing sequencing model into a sequencing system of a search engine;
step 2, executing web page search and presenting a sequencing result;
in a search engine integrated with a robustness ranking learning method based on multi-objective particle swarm optimization, a user can circularly execute webpage search for multiple times;
firstly, a user inputs a query word to be searched in a search box of a search engine and clicks for searching;
secondly, a search engine webpage index database is called by a search engine ranking system, all webpages containing the query word are found out from the database, and the ranking results of webpage search are predicted by calculating which webpages should be ranked in front and which should be ranked behind;
and finally, returning the webpage search sorting result to a search page according to a set mode for presenting to a search user.
Compared with the prior art, the invention has the following advantages:
1) the multi-objective optimization model for sequencing learning is novel.
A multi-target sequencing learning model considering effectiveness and robustness at the same time is built, a deviation-variance equilibrium theory is introduced into a sequencing learning task, a corresponding effectiveness deviation function and a robustness variance function are given to measure the effectiveness and the robustness of the sequencing model respectively, the effectiveness deviation function and the robustness variance function are designed independently, and the weight assignment of a plurality of targets is not considered.
2) The method for sequencing learning is new.
Based on a multi-objective particle swarm optimization framework and the concept of a preference order structure assessment method PROMETHEE II in a multi-attribute decision theory, a robustness ranking learning method based on multi-objective particle swarm optimization is designed to optimize the effectiveness and robustness of a ranking model at the same time, so that the trained ranking model has better balance between effectiveness and robustness, and the overall user experience is improved.
Drawings
FIG. 1 is a flow chart of a robustness ranking learning method based on multi-objective particle swarm optimization according to the present invention;
FIG. 2 is a flow chart of the construction of the order model optimization performance indicators of the present invention;
FIG. 3 is a flowchart illustrating an embodiment of training a ranking model according to the present invention;
FIG. 4 is a flow diagram of a preferred embodiment of the ranking model of the present invention;
FIG. 5 is a schematic diagram of a first embodiment of the present invention;
fig. 6 is a schematic diagram of a second embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, shall fall within the scope of protection of the present invention.
The method aims at the problem of robust sequencing learning, considers the effectiveness and robustness of a sequencing model at the same time, optimizes the sequencing model respectively, models the robust sequencing learning problem into a multi-objective optimization problem, gives a corresponding effectiveness deviation function and a robustness variance function to measure the effectiveness and robustness of the sequencing model respectively based on a deviation-variance equilibrium theory, and designs a robust sequencing learning method based on multi-objective particle swarm optimization algorithm framework and a PROMETHEE II idea of a preference order structure evaluation method in a multi-attribute decision theory.
The existing sequencing learning method mainly models the sequencing learning problem into the problems of classification, regression, ordinal regression, convex optimization and the like, and the adopted technologies mainly comprise a support vector machine, a neural network, a multiple addition regression tree, an extreme learning machine, Boosting, genetic programming and the like, and the technologies are difficult to simultaneously optimize a plurality of performance indexes. Compared with the existing sequencing learning method, the robust sequencing learning method based on the multi-target particle swarm optimization is designed aiming at the robust sequencing learning. The sequencing learning method models a sequencing learning problem into a multi-objective optimization problem, and the adopted technology is a multi-objective particle swarm optimization technology. The method designs a plurality of objective functions independently without assigning weights of a plurality of objectives, and can simultaneously optimize the effectiveness and robustness of the ranking model in the training process of the ranking model, so that the trained ranking model has better balance between the effectiveness and the robustness, thereby improving the satisfaction experience of the whole user.
The robustness ranking learning method based on multi-objective particle swarm optimization simultaneously considers the effectiveness and robustness of a ranking model and constructs a ranking learning task as a multi-objective optimization problem; based on a deviation-variance equilibrium theory, designing a corresponding effectiveness deviation function and a robustness variance function to respectively measure the effectiveness and the robustness of the ranking model, wherein the effectiveness deviation function is used for measuring the effectiveness of the ranking model, and the robustness variance function is used for measuring the robustness of the ranking model; designed based on the multi-objective particle swarm optimization algorithm framework and the PROMETHEE II idea of the preference order structure evaluation method in the multi-attribute decision theory.
The flow framework of the robustness ranking learning method based on multi-objective particle swarm optimization is shown in FIG. 1, and mainly comprises three stages of construction of optimization performance indexes of a ranking model, training of the ranking model and optimization of the ranking model. Firstly, designing an effective deviation function and a robust variance function of a ranking model based on a deviation-variance balance theory, and constructing two optimization performance indexes of ranking learning; secondly, on the basis of a multi-objective particle swarm optimization algorithm framework, training a ranking model by iteratively optimizing two targets, namely an effectiveness deviation function and a robustness variance function of the ranking model on the ranking learning data set, so as to generate a ranking model filing solution set; and finally, based on the idea of a preference order structure assessment method PROMETHEE II in the multi-attribute decision theory, selecting a Pareto optimal sorting model with the maximum 'net flow' sorting value from the sorting model filing solution set generated in the last stage to serve as a trained final sorting model.
The robustness ranking learning method based on multi-target particle swarm optimization mainly comprises three stages of construction of the optimization performance index of a ranking model, training of the ranking model and optimization of the ranking model, wherein the specific steps of each stage are as follows:
stage one: and constructing an optimized performance index of the sequencing model.
Based on a deviation-variance equilibrium theory, designing an effectiveness deviation function and a robustness variance function of a ranking model to construct two optimization performance indexes of ranking learning: and the effectiveness and the robustness of the sequencing model are represented.
The steps of constructing the optimized performance index of the ranking model are shown in fig. 2, and the effectiveness deviation function and the robustness variance function of the query and the query set under the ranking model are respectively defined in each step and are set forth as follows:
step 1, constructing an effectiveness deviation function of query.
Definitions 1 query qiOf the validity deviation function BiasR(qi) Is defined as:
wherein ,representing a query qiAll documents D underiThe best effectiveness achieved under the ideal ranking model I, i.e. all documents correctly ranked,representing a query qiAll documents D underiActual effectiveness, Bias, obtained under the ranking model RR(qi) Representing a query qiDeviation of actual effectiveness under ranking model R from the optimal effectiveness of ideal ranking model I. Here, the validity may refer to some commonly used performance indicators in the field of information retrieval, such as Normalized Discounted Cumulative Gain (NDCG), Expected Reciprocal Rank (ERR), and the like, and the calculation formula of each validity indicator is omitted here.
And 2, constructing an effectiveness deviation function of the query set.
Definition 2. validity deviation function Bias of query set QR(Q) is defined as:
among them, BiasR(Q) represents all queries Q in a set Q under a ranking model RiIs the average of the validity deviations, | Q | represents the query Q in the query set QiThe total number of (c).
And 3, constructing a robustness variance function of the query.
Definitions 3 query qiRobust Variance function Variance of (1)R(qi) Is defined as:
VarianceR(qi)=[BiasR(qi)-BiasR(Q)]2…(3)
wherein, VarianceR(qi) Representing queries q under a ranking model RiIs of validity BiasR(qi) Validity deviation Bias from query set QRDegree of dispersion of (Q).
And 4, constructing a robust variance function of the query set.
Definition 4. robust Variance function Variance of query set QR(Q) is defined as:
wherein, VarianceR(Q) represents all queries Q in a set Q under a ranking model RiIs the average of the robust variance of (a).
Converting the robustness ranking learning problem into a multi-objective optimization problem considering both effectiveness and robustness, and formally describing the robustness ranking learning problem as follows according to the definition of the effectiveness deviation function and the robustness variance function:
Utility(Q)={min BiasR(Q),min VarianceR(Q)}…(5)
namely, in the process of sequencing learning, the effectiveness deviation function Bias is minimized at the same timeR(Q) and the robust Variance function VarianceR(Q) to train the ranking model. Therefore, the optimization performance index Bias based on the sequencing model constructed in the above wayR(Q) and VarianceR(Q) a multi-objective intelligent optimization algorithm, such as a multi-objective particle swarm optimization algorithm, can be used to simultaneously minimize the BiasR(Q) and VarianceRThe value of (Q) is used for achieving the purpose of balancing the effectiveness and the robustness of the optimized sequencing model.
And a second stage: and training a sequencing model.
Based on a multi-objective particle swarm optimization algorithm framework, two objective functions of the effectiveness deviation and the robustness variance of the ordering model designed in the previous stage are optimized on the ordering learning data set to iteratively train the ordering model, so that an archiving solution set of the ordering model is generated.
In a specific implementation, for the ordering learning problem, real number encoding is adopted to represent particles, one particle represents an ordering model, and the search space of the particles is M-dimensional (i.e., each particle is composed of M-dimensions, and M represents the total dimension of the ordering features to be considered in the ordering model). The position P and the speed V of the particle can be represented by an M-dimensional vector, the position P of the particle represents the weight corresponding to each sorting feature in the sorting model, and the position P [ i ] of the particle i]Is composed of the weights corresponding to the M-dimensional sorting features in the ith sorting model, i.e. using the vector P [ i]=(Pi1,Pi2,Pi3,…,Pim,…,PiM)TIs shown in which P isimRepresenting the weight corresponding to the M-dimension sorting feature in the ith sorting model, wherein the integer i is 1,2, … and Pop, the integer M is 1,2, … and M, the integer Pop represents the total size of the particle population, and the integer M represents the total dimension of the sorting feature in the sorting model; the velocity V of the particle is used to change the position P of the particle, i.e. to adjust the weight of each ordering feature in the ordering model, and similarly to the definition of the position of the particle, the velocity V [ i ] of the particle i]Using the vector V [ i ]]=(vi1,vi2,vi3,…,vim,…,viM)TTo indicate.
In the robustness ranking learning method based on multi-objective particle swarm optimization, the specific implementation flow of training of the ranking model is shown in fig. 3, and the detailed steps are as follows:
step 1, initializing relevant parameters of particle swarm.
The relevant parameters comprise population size Pop, acceleration factors c1 and c2, and initial inertia weight omega0Final inertial weight ω1Parameters such as maximum iteration times MAXT, the number N of objective functions, the variation probability Mu and the like.
Step 2, based on the given sequencing learning data set, under the ideal sequencing model I, calculating each query qiBest effectiveness of
And 3, initializing the relevant information of each particle to generate an initial sequencing model set P.
① randomly initialize the position P [ i ] of each particle.
Real number coding is adopted, and in a feasible ordering model domain of an ordering learning problem, an initial position P [ i ] of each particle is randomly generated, namely a weight corresponding to each ordering characteristic, wherein i is more than or equal to 1 and less than or equal to Pop.
② initializes the velocity V [ i ] of each particle to 0.
③ calculate the effectiveness deviation function and the robustness variance function value of the ranking model.
According to the position P [ i ] of each particle]And a linear ranking scoring functionComputing queries qiEach document d ofijWherein f isijm(qi,dij) Representing query-document pairs (q)i,dij) The mth dimension of (1) ordering the features. According to different Score (q)i,dij) Value-from-big to-little for each query qiLower documents dijCarrying out top-n rapid sequencing, and marking Y according to the sequencing position and the relevance of the documentiCombining the ideal ordering model I, respectively calculating the query q according to the formula (1) to the formula (4)iAnd the validity deviation function Bias of the query set QR(qi) and BiasR(Q) and a robust Variance function VarianceR(qi) And VarianceR(Q) to obtain target values for a validity deviation function and a robustness variance function of the ranking model.
④ initialize the individual extreme value Pbest [ i ] ═ P [ i ] of the particle.
⑤, determining the global extreme value Gbest of the initial population according to the Pbest [ i ] of each particle.
And 4, initializing an iteration counter t to be 0.
And 5, creating an initial sequencing model filing solution set Archive.
In the initial ranking model P, a non-dominant ranking model is selected and stored in a ranking model Archive solution set Archive.
And 6, calculating the congestion distance of each non-dominant solution in the Archive solution set Archive of the sequencing model.
And 7, arranging the non-dominant solutions in Archive in descending order according to the congestion distance.
And 8, performing the following operation on each particle to update information such as the position and the speed of the particle.
①, randomly selecting a certain particle i from the front non-inferior solution set with large crowding distance in the ordered sequencing model Archive solution set Archive, and setting the position of the certain particle i as a global extremum Gbest.
② update the velocity V [ i ] of particle i according to equation (6):
Vim(t+1)=ωt*Vim(t)+c1*rand()*[Xim(t)-Pim(t)]+c2*rand()*[XGm(t)-Pim(t)]… (6) wherein the inertial weight at the t-th iterationc1 and c2 are acceleration constant factors, and rand () is [0, 1 ]]Random number of cells, Xim(t) and XGm(t) respectively representing individual extreme values Pbest [ i ] of the particle i at the t-th iteration]And the m-th dimension component of the global extremum Gbest, wherein the integer i is the particle number and takes the values of 1,2, … … and Pop.
③ updating the position P [ i ] of particle i according to equation (7):
Pim(t+1)=Pim(t)+Vim(t+1)…(7)
④ checks whether P [ i ] is within the bounds given by the variable and if it is out of its position range, sets the corresponding dimensional variable in P [ i ] to the corresponding bound value with the velocity set to the inverse, i.e., -V [ i ].
⑤ if t < MAXT Mu, then performing mutation operation on the position P [ i ] of the particle with Mu as the mutation rate.
⑥ calculating the objective function value of the particle.
According to the position P [ i ] of the particle]And a linear ranking scoring functionComputing queries qiEach document d ofijWherein f isijm(qi,dij) Representing query-document pairs (q)i,dij) The mth dimension of (1) ordering the features. According to different Score (q)i,dij) Value-from-big to-little for each query qiLower documents dijCarrying out top-n rapid sequencing, and marking Y according to the sequencing position and the relevance of the documentiCombining the ideal ordering model I, respectively calculating the query q according to the formula (1) to the formula (4)iAnd the validity deviation function Bias of the query set QR(qi) and BiasR(Q) and a robust Variance function VarianceR(qi) And VarianceR(Q) to obtain target values for a validity deviation function and a robustness variance function of the ranking model.
And 9, updating the archiving solution set Archive of the sequencing model.
If the particles in the population P are not dominated by any particle in the order model Archive solution set Archive, then insert all new non-dominated particles in the order model Archive solution set Archive and delete all particles in the order model Archive solution set Archive that are dominated by the new particle. If the Archive solution set of the ordering model is full, replacing the particles according to the following steps:
① calculating the congestion distance of each non-dominant solution in the Archive solution set Archive and arranging in descending order according to the size of the congestion distance.
② randomly selects one particle in the non-dominant solution at the front end with smaller crowding distance from the bottom end of the sequencing model Archive solution set Archive to replace it with a new particle.
And step 10, updating the individual extreme value Pbest [ i ] of each particle in the particle swarm P.
And comparing the new position of the particle P [ i ] with the advantages and disadvantages of the Pbest [ i ] according to the dominance relation, and updating the individual optimum when the P [ i ] is dominant, namely Pbest [ i ] ═ P [ i ].
And 11, if t is t +1, if t is less than MAXT, turning to the step 6, otherwise, outputting each sequencing model in the sequencing model Archive solution set Archive, namely generating a final sequencing model set.
And a third stage: preference of ranking model.
By the idea of a preference order structure assessment method PROMETHEE II based on a multi-attribute decision theory, the effectiveness deviation function Bias of each sequencing model is combinedR(Q) and the robust Variance function VarianceRAnd (Q), selecting a Pareto optimal solution with the maximum net flow sorting value from the sorting model filing solution set generated in the last stage, namely, an optimal sorting model, and taking the Pareto optimal solution as a trained final sorting model. SortingThe preferred implementation flow of the model is shown in fig. 4, and the implementation steps are as follows:
step 1, calculating an outflow function of each sequencing model.
For the ranking model filing solution set Archive finally generated in the second stage, respectively calculating the 'outflow' function Out (R) of each ranking modeli) Value, will Out (R)i) Is defined as:
wherein | A | represents the total number of Pareto optimized solutions in the ranking model Archive solution set Archive.
And 2, calculating an inflow function of each sequencing model.
For the Archive solution set Archive of the ranking models finally generated In the second stage, the 'inflow' function In (R) of each ranking model is calculated respectivelyi) Value In (R)i) Is defined as:
and 3, calculating a 'net flow' function of each sequencing model.
For the Archive solution set Archive of the ranking models finally generated in the second stage, respectively calculating the Net flow function Net (R) of each ranking modeli) Value, will Net (R)i) Is defined as:
Net(Ri)=Out(Ri)-In(Ri)
step 4. according to the Net flow function Net (R)i) Obtaining each sequencing model R in the sequencing model filing solution set ArchiveiThe "net flow" ranking value of (a) and a Pareto optimization solution with the largest "net flow" ranking value is selected as the final ranking model R.
The specific embodiment is as follows:
as shown in fig. 5, assume that there exists a standard rank learning data set L2Rdataset, which is expressed as a set: l2Rdataset { (< q)i,dij>,yij)|qi∈Q,dij∈Di,yij∈Yi,1≤i≤|Q|,1≤j≤|Di|}, wherein qiRepresenting the ith query, Q representing a finite set of queries in the sorted learning data set, | Q | representing the total number of queries in the set of queries, dijRepresenting a collection of documents DiJ document of (1), DiRepresenting associations to queries qiDocument collection, | DiI represents a document set DiTotal number of documents in, yijRepresenting relevance annotations, reflecting the query qiAnd document dijThe degree of correlation between the two groups can be some grade values, such as {1, 2, 3, 4, 5}, Yi={yi1,yi2,…,yi|Di|Denotes the set of tags associated with the query. Query-document pair < qi,dijIs described by an M-dimensional ordering feature f, which can be expressed as < qi,dij>={qi,fij1,fij1,…,fijm,…,fijM}. Through the construction of the "ranking model optimization performance index" of the first stage, four performance indexes are generated: biasR(qi)、BiasR(Q)、VarianceR(qi) And VarianceR(Q), wherein BiasR(Q) and VarianceR(Q) is an objective function to be optimized by the multi-objective particle swarm optimization algorithm in the second stage. In the second stage "training of ranking model", for a given ranked learning data set, L2Rdataset, the query-document pairs < q in L2Rdataset are readi,dijCharacteristic value of qi,fij1,fij1,…,fijm,…,fijMAnd relevance notation yijWaiting for data, and calculating the Bias under an initial ranking model RR(qi)、BiasR(Q)、VarianceR(qi) And VarianceR(Q) value, effectiveness deviation function Bias of iterative optimization sequencing model based on multi-objective particle swarm optimization algorithm frameworkR(Q) and the robust Variance function VarianceR(Q) to continuously update the ranking model, ultimately producing an Archive solution set Archive of the ranking model. In the third stage, "optimization of the ranking model", for the ranking model filing solution set Archive generated in the previous stage, the "outflow" function value, "inflow" function value and "net flow" function value of each ranking model are calculated according to the corresponding method respectively based on the idea of preference order structure assessment method promete II in the multi-attribute decision theory, and then a Pareto optimization solution with the maximum net flow ranking value, i.e., the optimal ranking model, is selected from the Archive solution set to serve as the trained final ranking model R, the ranking model is a robustness-aware ranking model, and the ranking model can be used to predict the document ranking of new queries. When a new query q needs to retrieve the documents, screening out a corresponding document set in a document database D through a sorting system, scoring the fed-back document set based on a generated sorting model R, then performing descending order on the document set according to the scoring size, generating a sorting result of the predicted documents of the new query q, and outputting and displaying the sorting result.
The second embodiment is as follows:
as shown in fig. 6, the robustness ranking learning method based on multi-objective particle swarm optimization can be applied to the application scenarios of actual demand ranking such as information retrieval, search engines, recommendation systems, question answering systems, and the like. Here, the robust ranking learning method based on multi-objective particle swarm optimization is applied to search engines such as Baidu (Baidu), Google (Google), Bing (Bing), Saogu (Sogou), Yahoo (Yahoo), etc. as application examples. The ranking model trained by the method is embedded into a ranking system of a search engine, and the ranking model is used for predicting the webpage ranking result of the query word required to be searched by the user, so that the satisfaction degree of the whole user can be improved, and the experience of the user is enhanced.
The operation implementation flow of the robustness ranking learning method based on multi-objective particle swarm optimization applied to the search engine is shown in fig. 6, and the implementation steps are as follows:
step 1, integrating a robustness sequencing learning method based on multi-objective particle swarm optimization into a search engine.
Firstly, data preprocessing is carried out on partial webpages in a search engine webpage index database, and extraction and labeling of ranking features are carried out on the webpages to construct a search engine ranking learning data set.
Secondly, on the constructed sequencing learning data set, a robustness sequencing learning method based on multi-objective particle swarm optimization is used for iteratively training a sequencing model to generate a robustness-perceived sequencing model.
Finally, the generated robustness-aware ranking model is embedded in a ranking system of a search engine.
And 2, executing webpage search and presenting a sequencing result.
In a search engine integrated with a robustness ranking learning method based on multi-objective particle swarm optimization, a user can execute webpage search repeatedly in a circulating mode.
First, a user inputs a query word desired to be searched in a search box of a search engine, and clicks a search.
Secondly, the ranking system of the search engine calls a search engine web page index database to find out all web pages containing the query word, and calculates which web pages should be ranked in front and which web pages should be ranked behind to predict the ranking result of the web page search.
Finally, the web page search ranking results are returned to the "search" page in a manner for presentation to the searching user.
While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and various equivalent modifications and substitutions can be easily made by those skilled in the art within the technical scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (10)

1. A robustness sequencing learning method based on multi-objective particle swarm optimization is characterized by comprising the following steps:
designing an effective deviation function and a robust variance function of a ranking model based on a deviation-variance balance theory, and constructing two optimization performance indexes of ranking learning;
secondly, training a ranking model by using two targets of an effectiveness deviation function and a robustness variance function of an iterative optimization ranking model on a ranking learning data set based on a multi-target particle swarm optimization algorithm framework, so as to generate a ranking model filing solution set;
and thirdly, based on the idea of a preference order structure evaluation method PROMETHEE II in the multi-attribute decision theory, selecting a Pareto optimal sorting model with the maximum 'net flow' sorting value from the sorting model filing solution set generated in the last step to serve as a trained final sorting model.
2. The method for robustness ranking learning based on multi-objective particle swarm optimization according to claim 1, wherein the effectiveness deviation function and the robustness variance function of the design ranking model are used to construct two optimization performance indexes of ranking learning, specifically:
the effectiveness deviation function and the robustness variance function of the query and the query set under the ranking model are respectively defined as follows:
definitions 1 query qiOf the validity deviation function BiasR(qi) Is defined as:
wherein ,representing a query qiAll documents D underiThe best effectiveness achieved under the ideal ranking model I, i.e. all documents correctly ranked,representing all documents D under a query qiiActual effectiveness, Bias, obtained under the ranking model RR(qi) Representing a query qiDeviation of actual effectiveness under the ranking model R from the optimal effectiveness of the ideal ranking model I;
definition 2. validity deviation function Bias of query set QR(Q) is defined as:
among them, BiasR(Q) represents all queries Q in a set Q under a ranking model RiIs the average of the validity deviations, | Q | represents the query Q in the query set QiThe total number of (2);
definitions 3 query qiRobust Variance function Variance of (1)R(qi) Is defined as:
VarianceR(qi)=[BiasR(qi)-BiasR(Q)]2…(3)
wherein, VarianceR(qi) Representing queries q under a ranking model RiIs of validity BiasR(qi) Validity deviation Bias from query set QRA degree of dispersion of (Q);
definition 4. robust Variance function Variance of query set QR(Q) is defined as:
wherein, VarianceR(Q) represents all queries Q in a set Q under a ranking model RiThe mean of the robust variance of (a);
converting the robustness ranking learning problem into a multi-objective optimization problem considering both effectiveness and robustness, and formally describing the robustness ranking learning problem as follows according to the definition of the effectiveness deviation function and the robustness variance function:
Utility(Q)={min BiasR(Q),min VarianceR(Q)}…(5)
namely, in the process of sequencing learning, the effectiveness deviation function Bias is minimized at the same timeR(Q) and the robust Variance function VarianceR(Q) to train a ranking model, for which purpose the optimization performance index Bias based on the above-constructed ranking modelR(Q) and VarianceR(Q) a multi-objective intelligent optimization algorithm, such as a multi-objective particle swarm optimization algorithm, can be used while minimizing the BiasR(Q) and VarianceRThe value of (Q) is used for achieving the purpose of balancing the effectiveness and the robustness of the optimized sequencing model.
3. The method for learning robust ranking based on multi-objective particle swarm optimization according to claim 2, wherein the framework based on multi-objective particle swarm optimization algorithm trains the ranking model according to two objectives of an effectiveness deviation function and a robust variance function of an iterative optimization ranking model, so as to generate a ranking model filing solution set, specifically:
step 1, initializing relevant parameters of a particle swarm;
step 2, based on the given sequencing learning data set, under the ideal sequencing model I, calculating each query qiBest effectiveness of
Step 3, initializing the relevant information of each particle to generate an initial sequencing model set P;
step 4, initializing an iteration counter t to be 0;
step 5, establishing an initial sequencing model filing solution set Archive, selecting a non-dominant sequencing model from the initial sequencing model P, and storing the non-dominant sequencing model in the sequencing model filing solution set Archive;
step 6, calculating the congestion distance of each non-dominant solution in the archiving solution set Archive of the sequencing model;
step 7, arranging the non-dominant solutions in Archive in a descending order according to the magnitude of the congestion distance;
step 8, performing operation on each particle to update the position and speed information of the particle;
step 9, updating the archiving solution set Archive of the sequencing model;
step 10, updating individual extreme values Pbest [ i ] of each particle in the particle swarm P;
and 11, if t is t +1, if t is less than MAXT, turning to the step 6, otherwise, outputting each sequencing model in the sequencing model Archive solution set Archive, namely generating a final sequencing model set.
4. The method as claimed in claim 3, wherein the relevant parameters of step 1 include population size Pop, acceleration factors c1 and c2, and initial inertial weight ω0Final inertial weight ω1Maximum iteration times MAXT, the number N of objective functions and the variation probability Mu.
5. The method according to claim 3, wherein the initializing the relevant information of each particle to generate the initial ranking model set P in step 3 is specifically:
31) randomly initializing the position P [ i ] of each particle;
real number coding is adopted, and in a feasible ordering model domain of an ordering learning problem, an initial position P [ i ] of each particle is randomly generated, namely a weight corresponding to each ordering characteristic, wherein i is more than or equal to 1 and is less than or equal to Pop;
32) initializing the speed V [ i ] of each particle as 0;
33) calculating an effectiveness deviation function and a robustness variance function value of the sequencing model;
according to the position P [ i ] of each particle]And a linear ranking scoring functionComputing queries qiEach document d ofijWherein f isijm(qi,dij) Representing query-document pairs (q)i,dij) According to different Score (q)i,dij) Value-from-big to-little for each query qiLower documents dijCarrying out top-n rapid sequencing, and marking Y according to the sequencing position and the relevance of the documentiCombining the ideal ordering model I, respectively calculating the query q according to the formula (1) to the formula (4)iAnd the validity deviation function Bias of the query set QR(qi) and BiasR(Q) and a robust Variance function VarianceR(qi) And VarianceR(Q) to obtain target values for an effectiveness bias function and a robustness variance function of the ranking model;
34) initializing an individual extreme value Pbest [ i ] ═ P [ i ] of the particle;
35) and determining the global extreme value Gbest of the initial population according to the Pbest [ i ] of each particle.
6. The method for robustness ranking learning based on multi-objective particle swarm optimization according to claim 3, wherein the step 8 of performing an operation on each particle to update the position and velocity information of the particle is specifically as follows:
81) randomly selecting a certain particle i from a non-dominant solution set at the front end with a larger crowding distance in the ordered sequencing model filing solution set Archive, and setting the position of the certain particle i as a global extremum Gbest;
82) updating the velocity V [ i ] of particle i according to equation (6):
Vim(t+1)=ωt*Vim(t)+c1*rand()*[Xim(t)-Pim(t)]+c2*rand()*[XGm(t)-Pim(t)]…(6)
wherein the inertia weight at the t-th iterationc1 and c2 are acceleration constant factors, and rand () is [0, 1 ]]Random number of cells, Xim(t) and XGm(t) respectively representing individual extreme values Pbest [ i ] of the particle i at the t-th iteration]The m-th dimension component of Gbest and the m-th dimension component of the global extreme value Gbest, wherein the integer i is the particle number and takes the values of 1,2, … … and Pop;
83) updating the position P [ i ] of the particle i according to equation (7):
Pim(t+1)=Pim(t)+Vim(t+1)…(7)
84) checking whether P [ i ] is within the limits given by the variables, and if the position range of P [ i ] is exceeded, setting the corresponding dimensional variable in P [ i ] as the corresponding boundary value and setting the speed as the reverse direction, namely-V [ i ];
85) if t < MAXT Mu, then executing variation operation on the position P [ i ] of the particle by taking Mu as variation rate;
86) calculating an objective function value of the particle;
according to the position P [ i ] of the particle]And a linear ranking scoring functionComputing queries qiEach document d ofijWherein f isijm(qi,dij) Representing query-document pairs (q)i,dij) According to different Score (q)i,dij) Value-from-big to-little for each query qiLower documents dijCarrying out top-n rapid sequencing, and marking Y according to the sequencing position and the relevance of the documentiCombining the ideal ordering model I, respectively calculating the query q according to the formula (1) to the formula (4)iAnd the validity deviation function Bias of the query set QR(qi) and BiasR(Q) and a robust Variance function VarianceR(qi) And VarianceR(Q) to obtain target values for a validity deviation function and a robustness variance function of the ranking model.
7. The method for robustness ranking learning based on multi-objective particle swarm optimization according to claim 3, wherein the step 9 of updating the ranking model Archive solution set Archive specifically comprises:
if the particles in the population P are not dominated by any particle in the order model Archive solution set Archive, inserting all new non-dominated particles in the order model Archive solution set Archive and deleting all particles dominated by the new particles in the order model Archive solution set Archive, if the order model Archive solution set Archive is full, replacing the particles by the following steps:
step 1, calculating the congestion distance of each non-dominant solution in an Archive solution set Archive of a sequencing model, and arranging the congestion distances in a descending order according to the size of the congestion distance;
and 2, randomly selecting one particle in the non-dominant solution at the front end with smaller crowding distance from the bottom end of the sequencing model Archive solution set Archive, and replacing the particle with a new particle.
8. The method for robustness ranking learning based on multi-objective particle swarm optimization according to claim 3, wherein the step 10 of updating the individual extremum Pbest [ i ] of each particle in the particle swarm P is specifically as follows:
and comparing the new position of the particle P [ i ] with the advantages and disadvantages of the Pbest [ i ] according to the dominance relation, and updating the individual optimum when the P [ i ] is dominant, namely Pbest [ i ] ═ P [ i ].
9. The method for learning robustness ranking based on multi-objective particle swarm optimization according to claim 1, wherein in the third step, based on an idea of a preference order structure assessment method promethe II in a multi-attribute decision theory, a Pareto optimal ranking model with a maximum "net flow" ranking value is selected from the ranking model filing solution set generated in the previous step, and the selected ranking model is specifically:
step 1, calculating an outflow function of each sequencing model;
respectively calculating the 'outflow' function Out (R) of each sequencing model for the sequencing model filing solution set Archive finally generated in the step twoi) Value, will Out (R)i) Is defined as:
wherein, | A | represents the total number of Pareto optimized solutions in the ranking model Archive solution set Archive;
step 2, calculating an inflow function of each sequencing model;
respectively calculating the 'inflow' function In (R) of each sequencing model for the sequencing model filing solution set Archive finally generated In the step twoi) Value In (R)i) Is defined as:
step 3, calculating a net flow function of each sequencing model;
respectively calculating the Net flow function Net (R) of each sequencing model for the sequencing model filing solution set Archive finally generated in the step twoi) Value, will Net (R)i) Is defined as:
Net(Ri)=Out(Ri)-In(Ri)
step 4. according to the Net flow function Net (R)i) Obtaining each sequencing model R in the sequencing model filing solution set ArchiveiThe "net flow" ranking value of (a) and a Pareto optimization solution with the largest "net flow" ranking value is selected as the final ranking model R.
10. The application of the robustness ranking learning method based on multi-objective particle swarm optimization is characterized in that the robustness ranking learning method based on multi-objective particle swarm optimization is applied to a search engine, wherein the search engine comprises Baidu, Google, Bing, Saogouu and Yahoo, a ranking model trained by the method is embedded into a ranking system of the search engine, and the ranking model is used for predicting a webpage ranking result of query words required to be searched by a user, so that the satisfaction degree of the whole user can be improved, and the experience of the user is enhanced, and the specific application process is as follows:
step 1, integrating a robustness sequencing learning method based on multi-objective particle swarm optimization into a search engine;
firstly, performing data preprocessing on partial webpages in a search engine webpage index database, and extracting and labeling ranking features of the webpages to construct a search engine ranking learning data set;
secondly, iteratively training a ranking model to generate a robustness-perceived ranking model by using a robustness ranking learning method based on multi-objective particle swarm optimization on the constructed ranking learning data set;
finally, embedding the generated robustness sensing sequencing model into a sequencing system of a search engine;
step 2, executing web page search and presenting a sequencing result;
in a search engine integrated with a robustness ranking learning method based on multi-objective particle swarm optimization, a user can circularly execute webpage search for multiple times;
firstly, a user inputs a query word to be searched in a search box of a search engine and clicks for searching;
secondly, a search engine webpage index database is called by a search engine ranking system, all webpages containing the query word are found out from the database, and the ranking results of webpage search are predicted by calculating which webpages should be ranked in front and which should be ranked behind;
and finally, returning the webpage search sorting result to a search page according to a set mode for presenting to a search user.
CN201910318891.8A 2019-04-19 2019-04-19 Robustness ordering learning method based on multi-target particle swarm optimization and application thereof Active CN110046713B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910318891.8A CN110046713B (en) 2019-04-19 2019-04-19 Robustness ordering learning method based on multi-target particle swarm optimization and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910318891.8A CN110046713B (en) 2019-04-19 2019-04-19 Robustness ordering learning method based on multi-target particle swarm optimization and application thereof

Publications (2)

Publication Number Publication Date
CN110046713A true CN110046713A (en) 2019-07-23
CN110046713B CN110046713B (en) 2023-05-12

Family

ID=67278070

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910318891.8A Active CN110046713B (en) 2019-04-19 2019-04-19 Robustness ordering learning method based on multi-target particle swarm optimization and application thereof

Country Status (1)

Country Link
CN (1) CN110046713B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111581546A (en) * 2020-05-13 2020-08-25 北京达佳互联信息技术有限公司 Method, device, server and medium for determining multimedia resource sequencing model
CN111783209A (en) * 2020-07-03 2020-10-16 吉林大学 Self-adaptive structure reliability analysis method combining learning function and kriging model
CN111881359A (en) * 2020-08-04 2020-11-03 携程计算机技术(上海)有限公司 Sorting method, system, equipment and storage medium in internet information retrieval
CN112100529A (en) * 2020-11-17 2020-12-18 北京三快在线科技有限公司 Search content ordering method and device, storage medium and electronic equipment
CN112508202A (en) * 2021-02-07 2021-03-16 北京淇瑀信息科技有限公司 Method and device for adjusting model stability and electronic equipment
CN113642632A (en) * 2021-08-11 2021-11-12 国网冀北电力有限公司计量中心 Power system customer classification method and device based on adaptive competition and balance optimization

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120226654A1 (en) * 2009-11-05 2012-09-06 Bae Systems Plc Generating a set of solutions to a multi-objective problem
CN108229657A (en) * 2017-12-25 2018-06-29 杭州健培科技有限公司 A kind of deep neural network training and optimization algorithm based on evolution algorithmic
CN108469983A (en) * 2018-04-02 2018-08-31 西南交通大学 A kind of virtual machine deployment method based on particle cluster algorithm under cloud environment
CN109063242A (en) * 2018-06-20 2018-12-21 中国人民解放军国防科技大学 Guidance tool error identification method based on particle swarm optimization

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120226654A1 (en) * 2009-11-05 2012-09-06 Bae Systems Plc Generating a set of solutions to a multi-objective problem
CN108229657A (en) * 2017-12-25 2018-06-29 杭州健培科技有限公司 A kind of deep neural network training and optimization algorithm based on evolution algorithmic
CN108469983A (en) * 2018-04-02 2018-08-31 西南交通大学 A kind of virtual machine deployment method based on particle cluster algorithm under cloud environment
CN109063242A (en) * 2018-06-20 2018-12-21 中国人民解放军国防科技大学 Guidance tool error identification method based on particle swarm optimization

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
GARY G. YEN 等: ""Dynamic Multiple Swarms in Multiobjective Particle Swarm Optimization"", 《IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS - PART A: SYSTEMS AND HUMANS》 *
张建华 等: ""利用粒子群算法计算AHP判断矩阵排序权重"", 《太原科技大学学报》 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111581546A (en) * 2020-05-13 2020-08-25 北京达佳互联信息技术有限公司 Method, device, server and medium for determining multimedia resource sequencing model
CN111581546B (en) * 2020-05-13 2023-10-03 北京达佳互联信息技术有限公司 Method, device, server and medium for determining multimedia resource ordering model
CN111783209A (en) * 2020-07-03 2020-10-16 吉林大学 Self-adaptive structure reliability analysis method combining learning function and kriging model
CN111783209B (en) * 2020-07-03 2022-09-27 吉林大学 Self-adaptive structure reliability analysis method combining learning function and kriging model
CN111881359A (en) * 2020-08-04 2020-11-03 携程计算机技术(上海)有限公司 Sorting method, system, equipment and storage medium in internet information retrieval
CN111881359B (en) * 2020-08-04 2023-11-17 携程计算机技术(上海)有限公司 Ordering method, ordering system, ordering equipment and ordering storage medium in internet information retrieval
CN112100529A (en) * 2020-11-17 2020-12-18 北京三快在线科技有限公司 Search content ordering method and device, storage medium and electronic equipment
CN112508202A (en) * 2021-02-07 2021-03-16 北京淇瑀信息科技有限公司 Method and device for adjusting model stability and electronic equipment
CN113642632A (en) * 2021-08-11 2021-11-12 国网冀北电力有限公司计量中心 Power system customer classification method and device based on adaptive competition and balance optimization
CN113642632B (en) * 2021-08-11 2023-10-27 国网冀北电力有限公司计量中心 Power system customer classification method and device based on self-adaptive competition and equalization optimization

Also Published As

Publication number Publication date
CN110046713B (en) 2023-05-12

Similar Documents

Publication Publication Date Title
CN110046713A (en) Robustness sequence learning method and its application based on multi-objective particle swarm optimization
CN109408600B (en) Book recommendation method based on data mining
CN108021658B (en) Intelligent big data searching method and system based on whale optimization algorithm
CN110110225B (en) Online education recommendation model based on user behavior data analysis and construction method
JP2010165348A (en) Method for performing annotation and computer program therefor
CN108897791B (en) Image retrieval method based on depth convolution characteristics and semantic similarity measurement
CN103593425A (en) Preference-based intelligent retrieval method and system
CN104573130A (en) Entity resolution method based on group calculation and entity resolution device based on group calculation
CN107145519B (en) Image retrieval and annotation method based on hypergraph
JP2020512651A (en) Search method, device, and non-transitory computer-readable storage medium
CN102968419A (en) Disambiguation method for interactive Internet entity name
CN111078835A (en) Resume evaluation method and device, computer equipment and storage medium
Manne et al. Text categorization with K-nearest neighbor approach
Zhang et al. Visual search reranking via adaptive particle swarm optimization
CN113435101B (en) Particle swarm optimization-based power failure prediction method for support vector machine
CN116662521B (en) Electronic document screening and inquiring method and system
JP5432936B2 (en) Document search apparatus having ranking model selection function, document search method having ranking model selection function, and document search program having ranking model selection function
CN112182026A (en) Power grid section data retrieval method considering manifold sorting algorithm
CN111444414A (en) Information retrieval model for modeling various relevant characteristics in ad-hoc retrieval task
CN115829683A (en) Power integration commodity recommendation method and system based on inverse reward learning optimization
CN113312523B (en) Dictionary generation and search keyword recommendation method and device and server
Kim et al. Evolutionary learning of web-document structure for information retrieval
CN115391479A (en) Ranking method, device, electronic medium and storage medium for document search
KR101592670B1 (en) Apparatus for searching data using index and method for using the apparatus
CN106649537A (en) Search engine keyword optimization technology based on improved swarm intelligence algorithm

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant