CN110060097A

CN110060097A - User behavior sequence of recommendation method based on attention mechanism and convolutional neural networks

Info

Publication number: CN110060097A
Application number: CN201910257110.9A
Authority: CN
Inventors: 鲜学丰; 赵朋朋; 刘建
Original assignee: Suzhou Vocational University
Current assignee: Suzhou Vocational University
Priority date: 2019-04-01
Filing date: 2019-04-01
Publication date: 2019-07-26

Abstract

The user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks that the invention discloses a kind of, wherein the following steps are included: acquiring the passing behavioral data of user as long-term preference interaction sequence data；The weight of each behavioral data in the long-term preference interaction sequence data is calculated using attention mechanism algorithm, and goes out the long-term preference of user according to the weight calculation；The behavioral data in a period of time is chosen as short-term preference interaction sequence data；According to the short-term preference interaction sequence data, the short-term preference of user is calculated using convolutional neural networks algorithm；The long-term preference of user is spliced with short-term preference, to generate user behavior sequence of recommendation；The present invention can recommend compared to the method for the interaction between traditional fitting user-project or project-project, the long-term preference and short-term preference of enough combination users quickly and effectively to user.

Description

User behavior sequence of recommendation method based on attention mechanism and convolutional neural networks

Technical field

The present invention relates to user behavior sequence of recommendation methods, are based on attention mechanism and volume more specifically to one kind The user behavior sequence of recommendation method of product neural network.

Background technique

As development of Mobile Internet technology continues to develop, the information content in network is extended and is increased, net rapidly with exponential law Information overload and information puzzle problem on network are got worse.In order to provide the user with satisfied information and service, recommender system It comes into being, becomes the research field of numerous researcher's concerns.Happiness of the recommender system by prediction user to information resources Good degree carries out information filtering.

With the fast development of platform economy, Amazon, many companies such as Taobao and excellent step are creating self ecosystem System, by keeping user here with interacting for products & services.User can access these by mobile device in daily life Platform can thus generate a large amount of user behaviors log.For example, in June, 2017, excellent step has accumulated 6800 general-purpose families travelling number According to；Registering more than 11,000,000,000 has been produced more than 5000 general-purpose families on Foursquare.Effective recommender system is constructed, Key factor is accurately to characterize and understand the interest and taste of user, and these are changed and development continuous always.In order to Realize the target, sequence of recommendation comes into being, with the item that may be interacted based on user's history active sequences come recommended user Mesh.

It is different from traditional recommender system, in sequence of recommendation, transaction or session of more and more data from series. These transaction or session form the sequence pattern of user, and the project of the next possible access of user is more likely to depend on user most The nearly project for generating interaction.User is caused next to buy the probability of flour higher than independent for example, buying milk and butter together Buy the probability of milk or butter.However, traditional recommended method, such as collaborative filtering, matrix decomposition and Top-N recommend, it is uncomfortable Capture sequence pattern is closed, because they will not model the sequence of movement.In order to solve this problem, early stage based on horse The sequence method of Markov's chain generallys use individual model to characterize the long-term preference of user and short-term preference, then by them It integrates.However, the partial order behavior between these Method Modeling every two adjacencies based on Markov chain, but It is difficult to model higher order relationship.In recent years, deep neural network has obtained in-depth study in related fields, generates to sequence of recommendation Very big influence.Most popular neural network for modeling user's sequence pattern is Recognition with Recurrent Neural Network (RNN).It is based on The method of RNN is more powerful than traditional sequential grammar.However, RNN assumes that time dependence is monotonically changed, it means that it is current or Hidden state is more important than previous.In sequence of recommendation, and not all adjacent movement all has dependence.In order to solve this A problem has scholar to propose and carries out convolution to the insertion vector of project using the convolution filter of convolutional neural networks (CNN) Operate and then obtain the sequence signature of part.However, this method is only using user's embeded matrix as the long-term preference of user, This is not enough to learn the long-term preference of user.

Therefore, we propose a kind of sequence of recommendation side based on attention mechanism and convolutional neural networks in the present invention Method.Attention mechanism is used to model the long-term preference of user, and convolutional neural networks are used to excavate the short-term sequence preference of user, most The two is recommended in conjunction with to generate afterwards.

Summary of the invention

It is excellent it is an object of the invention to solve at least the above problems and/or defect, and provide at least to will be described later Point.

The present invention provides the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks, energy The long-term preference and short-term preference of enough combination users quickly and effectively is recommended to user.

In order to realize these purposes according to the present invention and other advantages, provide a kind of based on attention mechanism and convolution The user behavior sequence of recommendation method of neural network, wherein the following steps are included: acquiring the passing behavioral data of user as length Phase preference interaction sequence data；

The weight of each behavioral data in the long-term preference interaction sequence data, and root are calculated using attention mechanism algorithm Go out the long-term preference of user according to the weight calculation；

The behavioral data in a period of time is chosen as short-term preference interaction sequence data；

According to the short-term preference interaction sequence data, the short-term inclined of user is calculated using convolutional neural networks algorithm It is good；

The long-term preference of user is spliced with short-term preference, to generate user behavior sequence of recommendation.

Further, the passing behavioral data is all passing behavioral datas of the user on a fixed platform, or For the passing behavioral data for calculating certain time to the past from current date.

Further, the behavioral data in described selection a period of time is to be selected from current date to rise to the past calculating centainly The passing behavioral data of time.

Further, the method for the short-term preference that user is calculated using convolutional neural networks algorithm includes following step It is rapid:

Using the short-term preference interaction sequence data of horizontal convolution operation processing, vector o is obtained；

Short-term preference interaction sequence data are handled using vertical convolution operation, obtain vector

Link vector o and vectorThe short-term preference of user is calculated with this.

Further, the splicing is input in a fully-connected network, to generate user behavior sequence of recommendation.

Further, described to calculate each behavior number in the long-term preference interaction sequence data using attention mechanism algorithm According to weight the following steps are included:

Calculate the weight α of each behavioral data in the long-term preference interaction sequence data_j, comprising:

It calculates to hide first and indicates h_j

Wherein,It is the activation primitive of line rectification function (RELU), W₁And b₁For empirical parameter, v_jLong-term preference is indicated to hand over J-th object vector is represented in mutual sequence data；

Then h is indicated according to hiding_jCalculate weight α_j

Wherein, u is the vector of the current user to be calculated of representative, u^TIndicate the transposition of the vector, L^uRepresent the long-term of the user Preference interaction sequence data.

Further, it is described according to the weight calculation go out user long-term preference the following steps are included:

The project for considering weight is embedded in the long-term preference of the summation characterization user of long-term preference interaction sequence data u^long:

Further, the short-term preference interaction sequence data be in t moment, by the insertion vector of the l project of the past, I.e. from v_t-lTo v_t-1, the matrix E of composition^{(u, t)}It is considered as image, carries out horizontal and vertical convolution algorithm respectively；

It is described utilize the short-term preference interaction sequence data of horizontal convolution operation processing, obtain vector o the following steps are included:

Utilize horizontal filterIt is slipped over simultaneously from project matrix in different moments Sequence signature is captured, n indicates number of filter, and h indicates the height of horizontal filter, and k indicates the dimension of vector, for each Moment t, with each F^xFrom E^{(u, t)}Top slide to bottom, a value of the i-th ∈ [1, l-h+1] of the result of feature interaction by with Lower formula calculates,

Wherein,It is the activation primitive of line rectification function (RULU), indicates inner product operation,Representing matrix E^{(u, t)}The the i-th to the i-th+h-1 row, so, final convolution results are expressed as vector c^x

Then, to the output result c of single filter^xMaximum pond (max-pooling) is carried out to operate to obtain maximum Value, therefore, for the last output result vector of n filterIt indicates,

O=[max (c¹), max (c²) ... max (eⁿ)]

Wherein,

It is described to handle short-term preference interaction sequence data using vertical convolution operation, obtain vectorThe following steps are included:

WithA filterFilterWith E^{(u, t)}Each column interact, convolution As a result it is I.e.

P is E^(u,t)Pth row polymerize the insertion vector of front project using vertical filter,The output of a filter As a result it is

Further, the link vector o and vectorMethod are as follows:

Connection result

For parameter,For parameter,It is activation primitive.

Further, the method that the long-term preference of user is spliced with short-term preference includes:

Splicing result y is obtained by following formula^(u,^t):

For parameter,For parameter, N indicates the quantity of project, numerical valueTable Show that, in the probability possibility size that moment t user u is interacted with project s, v is the passing project of the user；

Wherein parameter W₂, b₁, b₂, W₁, W₃, b₃For empirical value, or obtained according to following methods:

Using gradient ascent algorithm and back-propagation method come Optimal Parameters:

It is assumed that user is more partial to the project of interaction of lower a period of time relative to unobservable project, then define Such a ordering ruleFor project p and q, then have:P is user u in t The project of interaction is carved, q is that user u in t moment does not generate interactive project also, for { (u, the L observed every time^u, p) }, it produces Raw a pair of Preference order D={ (u, L^u, p, q) },

Then, by maximizing following objective function come training pattern,

For Θ representative model to all parameters, λ is regularization weight, and δ is logistic function,

By optimization process above, the optimal solution of model parameter Θ is acquired, all parameters are released according to optimal solution.

The method that the present invention compares the interaction between traditional fitting user-project or project-project, it is enough fast and effective Combination user long-term preference and short-term preference to user recommend.

Further advantage, target and feature of the invention will be partially reflected by the following instructions, and part will also be by this The research and practice of invention and be understood by the person skilled in the art.

Detailed description of the invention

Fig. 1 is the mould of the user behavior sequence of recommendation method of the present invention based on attention mechanism and convolutional neural networks Type figure.

Specific embodiment

In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.

It should be appreciated that such as " having ", "comprising" and " comprising " term used herein do not allot one or more The presence or addition of a other elements or combinations thereof.

As shown in Figure 1, the user behavior sequence of recommendation side of the present invention based on attention mechanism and convolutional neural networks Method, wherein the following steps are included: acquiring the passing behavioral data of user as long-term preference interaction sequence data；

It calculates to hide first and indicates h_j

Then h is indicated according to hiding_jCalculate weight α_j

。

Further, the short-term preference interaction sequence data be in t moment, by the insertion vector of the l project of the past, I.e. from v_t-lTo v_t-1, the matrix E of composition^(u,t)It is considered as image, carries out horizontal and vertical convolution algorithm respectively；

Utilize horizontal filterIt is slipped over simultaneously from project matrix in different moments Sequence signature is captured, n indicates number of filter, and h indicates the height of horizontal filter, and k indicates the dimension of vector, for each Moment t, with each F^xFrom E^(u,t)Top slide to bottom, a value of the i-th ∈ [1, l-h+1] of the result of feature interaction by with Lower formula calculates,

Wherein,It is the activation primitive of line rectification function (RULU), indicates inner product operation,Representing matrix E^(u,t)The the i-th to the i-th+h-1 row, so, final convolution results are expressed as vector c^x

O=[max (c¹),max(c²),...max(cⁿ)]

Wherein,

It is described to handle short-term preference interaction sequence data using vertical convolution operation, obtain vectorThe following steps are included: withA filterFilterWithEach column interact, convolution results are I.e.

Further, the connection link vector o and vectorMethod are as follows:

Connection result

For parameter,For parameter,It is activation primitive.

Splicing result y is obtained by following formula^(u,t):

Then, by maximizing following objective function come training pattern,

Embodiment

Firstly, we use U={ u₁,u₂,……,u_MIndicate M user set, V={ v₁, v₂... ..., v_NIndicate source The N number of project set in domain.Each user generates interaction sequence with a series of projectsThis sequence arranges sequentially in time, and wherein t indicates opposite time index Rather than the absolute time, | L^u| indicate the interaction sequence length of user.There is the above symbol to indicate, our sequence of recommendation is appointed Business is defined as follows, we are absorbed in from implicit feedback and user's interaction sequence feedback data (for example, user's continuously registers and purchase Buy transaction record) in extract information.As the history interaction sequence L of given a user u and he^u, our purpose is from interaction In record excavate user long-term preference and short-term preference come recommended user u next a period of time in there may be interactive items Mesh set.

Correlation technique

(1) gradient ascent algorithm

Gradient rise method is more commonly used optimization algorithm.Gradient rise method is based on the idea that be: finding certain function most Big value, the best way is sought along the gradient direction of the function.If gradient is denoted asThen the gradient of function f (x, y) can It indicates are as follows:

This gradient means to move along the direction of xIt moves in direction along yWherein, function f (x, y) It must be defined on point to be calculated and can be micro-.In each iteration, gradient ascent algorithm all can move one along gradient direction Step.Wherein, gradient operator always points at the fastest-rising direction of functional value.Mentioned here is moving direction, without referring to Amount of movement size.The magnitude is known as step-length, is denoted as α.The iterative formula that gradient rises can indicate are as follows:

The formula will be iterated always execution, until reaching some stop condition.For example the number of iterations reaches some Designated value or algorithm reach some error range that can permit.

(2) back-propagation algorithm

Back-propagation algorithm, i.e. BP algorithm are to be suitable for a kind of study of multilayer neural networks in the case where there is tutor's guidance Algorithm, it is established on the basis of gradient descent method.The input/output relation of BP network is substantially a kind of mapping relations: one The function completed of BP neural network of input m output be tieed up from n Euclidean space into m dimension Euclidean space one finite field it is continuous Mapping, this mapping have nonlinearity.Its information processing capability is multiple compound from simple non-linear functions, because This reappears ability with very strong function.This is the basis that BP algorithm is able to apply.Back-propagation algorithm is mainly by two links (excitation is propagated, weight updates) iterative cycles iteration, until the response to input of network reaches scheduled target zone.

The learning process of BP algorithm is made of forward-propagating process and back-propagation process.It is defeated during forward-propagating Enter information by input layer through hidden layer, successively handles and be transmitted to output layer.If cannot get desired output valve in output layer, The quadratic sum of output and desired error is then taken to be transferred to backpropagation as objective function, successively find out objective function to each mind Partial derivative through first weight constitutes objective function and measures to the ladder of weight vector, and as the foundation of modification weight, the study of network exists It is completed during weight modification.When error reaches desired value, e-learning terminates.

As shown in Figure 1, proposing a kind of completely new sequence of recommendation method based on attention mechanism and convolutional neural networks. It is proposed that model be broadly divided into 4 layers, first layer is embeding layer (Embedding Layer), and user and item are mapped to Continuous lower dimensional space；The second layer is the long-term preference that attention layer (Attention Layer) is used to model user；Third layer It is convolutional layer (Convolutional Layer), for excavating the short-term sequence preference of user；4th layer is full articulamentum (Fully-connected Layer) is fitted the long-term preference of user and short-term preference and recommends to generate.The experimental results showed that I The method that proposes better than state-of-the-art method.

1、Embedding Layer

It is indicated similar to the discrete vocabulary symbol in natural language processing, the expression ability of the ID of original user and item It is limited.Therefore, user and item are mapped in two low-dimensional continuous spaces by our model first step, are denoted as respectivelyMatrix U_eAnd matrix V_eRespectively indicate user_embedding and item_ that mapping obtains Embedding, K indicate the dimension of lower dimensional space, M and N not Biao Shi user and project quantity.

2、Attention Layer

Since the long-term project collection of user usually changes over time, the long-term preference of static state for learning each user is indicated The dynamic of the long-term preference of user cannot be expressed completely.Furthermore, it is believed that identical project may produce different users Raw different influence, and different projects can generate different influences to that will access or buy down several projects.Example Such as, user a buys article m because of the interest of oneself, and user b purchase article m is to give friend.In such case Under, predicting what when they may buy in the near future, project m there should be different weights to user a and b.

In order to meet above-mentioned requirements, we use attention mechanism, which has been successfully applied to many tasks, such as answer automatically It answers, document classification and recommendation.Given user u, we calculate his/her historical series L first^uIn each project importance, Then the insertion vector that we polymerize these projects is indicated with the long-term preference for forming user.In form, attention net definitions It is as follows:

It is parameter, v_jThe insertion vector of expression project j,It is RELU activation primitive, is used to The non-thread sexuality of lift scheme.Firstly, we are by the history interaction sequence L of user u^uEach of project lower dimensional space Vector v_jIt is input in a fully-connected network and obtains its hiding expression h_j；Then, its corresponding attention is acquired according to formula (1) Power weight α_j.It is that each input uses identical context vector in traditional attention model, we regard the insertion vector of user u For context vector, and it will pay attention to score α_jAs h_jNormalization similarity between u measures together with softmax function, Project j is characterized to the importance u of user.Finally, the summation characterization for the project insertion vector that we are weighted with attention score The long-term preference of user, is denoted as u^long:

3, convolutional layer

Convolutional layer utilizes the local sequence signature for going capture user of the convolution filter (filter) of CNN, for given A user u and its interaction sequence L^u, we are in t moment, by the insertion vector of l project of front (i.e. from v_t-lTo v_t-1) Composition matrix E (^u,^t) it is considered as " image ", horizontal and vertical convolution algorithm is carried out respectively.Horizontal convolutional layer

In horizontal convolution operation, we utilize horizontal filterIn difference Moment slips over from project matrix and captures sequence signature, and n indicates number of filter, and h indicates the height of horizontal filter.For Each moment t, we are with each F^xFrom E^(u,t)Top slide to bottom.I-th ∈ [1, l-h+ of the result of feature interaction 1] a value is calculated by formula (2),

It is RULU activation primitive, indicates inner product operation,Representing matrix E^(u,t)The the i-th to the i-th+h-1 row. So final convolution results are expressed as vector c^x

Then, we are to the output result c to single filter^xMaximum pond (max-pooling) is carried out to operate to obtain Obtain maximum value.Therefore, for the last output result of n filter, we use vectorIt indicates,

O=[max (c¹),max(c²) ... max (cⁿ)]

Vertical convolutional layer

In vertical convolutional layer, we are with (~) come label symbol.

It is similar with horizontal convolutional layer, Wo MenyongA filterFilterWith E^(u ^{, t)}Each column interact, convolution results are

I.e.

P is E^{(u, t)}Pth row.Using vertical filter, we can polymerize the insertion vector of front project, therefore, this The local sequence that a little vertical filters can capture user u with the weighted sum of the insertion vector of l project before t moment is special Sign.The output result of a filter is

It is not necessary to be operated to vertical convolution results using pondization, because it is desirable that keeping the poly- of each potential dimension It closes.Therefore, the output of this layer is

Finally, we connect the output of two convolutional layers, then they are input in a fully-connected network, to obtain More advanced and abstract sequence signature indicates

It is activation primitive.

4、Fully-connected Layer

In order to obtain the overall preference of user, we will also be noted that the output of power layer and the output of convolutional layer are connected to one It rises, and they is input in a full connection output layer, be expressed as follows,

N indicates the quantity of project, numerical valueIt indicates to use in moment t The probability possibility size that family u is interacted with project s.Finally, according to y^{(u, t)}Carry out the project that recommended user next may be interactive.

Network model parameter optimization

For Optimized model, optimal model parameter is obtained, we are using based on above-mentioned gradient ascent algorithm and reversely Transmission method carrys out Optimized model.Meanwhile we use BRP Optimality Criteria objective function.We assume that relative to unobservable Project, user are more partial to the project of purchase of lower a period of time, and then we define such a ordering ruleIt is right In project p and q, then have:P is project of the user u in t moment interaction, and q is that user u exists T moment does not generate interactive project also.For { (u, the L observed every time^u, p) }, we generate a pair of of Preference order D= {(u,L^u,p,q)}。

Then, we train our model by the following objective function of maximization,

For Θ representative model to all parameters, λ is regularization weight, and δ is logistic function.

By optimization process above, we can be in the hope of the optimal solution of model parameter Θ.

Although the embodiments of the present invention have been disclosed as above, but it is not limited in listed fortune in specification and embodiments With.It can be applied to various suitable the field of the invention completely.It for those skilled in the art, can be easily real Now other modification.Therefore without departing from the general concept defined in the claims and the equivalent scope, the present invention is not limited to Specific details and legend shown and described herein.

Claims

1. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks, which is characterized in that including following Step: the passing behavioral data of acquisition user is as long-term preference interaction sequence data；

The weight of each behavioral data in the long-term preference interaction sequence data is calculated using attention mechanism algorithm, and according to institute State the long-term preference that weight calculation goes out user；

According to the short-term preference interaction sequence data, the short-term preference of user is calculated using convolutional neural networks algorithm；

2. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks as described in claim 1, It is characterized in that, the passing behavioral data is all passing behavioral datas of the user on a fixed platform, or for from currently The passing behavioral data of certain time is calculated from date to the past.

3. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks as described in claim 1, It is characterized in that, the behavioral data chosen in a period of time is to be selected from current date to play the mistake for calculating certain time to the past Toward behavioral data.

4. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks as described in claim 1, Be characterized in that, the method for the short-term preference that user is calculated using convolutional neural networks algorithm the following steps are included:

5. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks as described in claim 1, It is characterized in that, the splicing is input in a fully-connected network, to generate user behavior sequence of recommendation.

6. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks as described in claim 1, It is characterized in that, the weight that each behavioral data in the long-term preference interaction sequence data is calculated using attention mechanism algorithm The following steps are included:

It calculates to hide first and indicates h_j

Wherein,It is the activation primitive of line rectification function (RELU), W₁And b₁For empirical parameter, v_jIndicate long-term preference interaction sequence J-th object vector is represented in column data；

Then h is indicated according to hiding_jCalculate weight α_j

Wherein, u is the vector of the current user to be calculated of representative, u^TIndicate the transposition of the vector, L^uRepresent the long-term preference of the user Interaction sequence data.

7. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks as claimed in claim 6,

It is characterized in that, it is described according to the weight calculation go out user long-term preference the following steps are included:

The project for considering weight is embedded in the long-term preference u of the summation characterization user of long-term preference interaction sequence data^long:

8. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks as claimed in claim 4, It is characterized in that, the short-term preference interaction sequence data are in t moment, by the insertion vector of the l project of the past, i.e., from v_t-l To v_t-1, the matrix E of composition^(u,t)It is considered as image, carries out horizontal and vertical convolution algorithm respectively；

Utilize horizontal filterIt slips over and captures from project matrix in different moments Sequence signature, n indicate number of filter, and h indicates the height of horizontal filter, and k indicates the dimension of vector, for each moment T, with each F^xFrom E^(u,t)Top slide to bottom, a value of the i-th ∈ [1, l-h+1] of the result of feature interaction is by following public affairs Formula calculates,

Wherein,It is the activation primitive of line rectification function (RULU), indicates inner product operation,Representing matrix E^(u,t) The the i-th to the i-th+h-1 row, so, final convolution results are expressed as vector c^x

Then, to the output result c of single filter^xMaximum pond (max-pooling) operation is carried out to obtain maximum value, because This, for the last output result vector of n filterIt indicates,

O=[max (c¹),max(c²),…max(cⁿ)]

Wherein,

WithA filterFilterWith E^(u,t)Each column interact, convolution results are I.e.

P is E^(u,t)Pth row polymerize the insertion vector of front project using vertical filter,The output result of a filter For

9. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks as claimed in claim 8, It is characterized in that, the link vector o and vectorMethod are as follows:

Connection result

For parameter,For parameter,It is activation primitive.

10. the user behavior sequence of recommendation method based on attention mechanism and convolutional neural networks as described in claim 1, It is characterized in that, the method that the long-term preference of user is spliced with short-term preference includes:

Splicing result y is obtained by following formula^(u,t):

For parameter,For parameter, N indicates the quantity of project, numerical valueIt indicates The probability possibility size that moment t user u is interacted with project s, v are the passing project of the user；

It is assumed that user is more partial to the project of interaction of lower a period of time relative to unobservable project, then define in this way One ordering ruleFor project p and q, then have:P is that user u is handed in t moment Mutual project, q is that user u in t moment does not generate interactive project also, for { (u, the L observed every time^u, p) }, generate one To Preference order D={ (u, L^u, p, q) },

Then, by maximizing following objective function come training pattern,