CN108648049A

CN108648049A - A kind of sequence of recommendation method based on user behavior difference modeling

Info

Publication number: CN108648049A
Application number: CN201810414330.3A
Authority: CN
Inventors: 陈恩红; 刘淇; 李徵; 赵洪科; 张凯
Original assignee: University of Science and Technology of China USTC
Current assignee: Chen Enhong; Huang Zhenya; Liu Qi; University of Science and Technology of China USTC
Priority date: 2018-05-03
Filing date: 2018-05-03
Publication date: 2018-10-12
Anticipated expiration: 2038-05-03
Also published as: CN108648049B

Abstract

The invention discloses a kind of sequence of recommendation methods based on user behavior difference modeling, including：Obtain the historical behavior information of user；Product features vector is calculated according to the historical behavior information of acquisition；Series Modeling is carried out in conjunction with product features vector usage behavior difference modeling method, the current demand and history preference of user are obtained by two different neural network frameworks；According to the current purchasing demand of user and history preference, it is predicted by combination learning interested commodity next to user, and matched in commodity vector space, most similar multiple commodity in commodity vector space are found with prediction result, commercial product recommending sequence is generated.This method can provide accurately sequence of recommendation service to the user by the difference modeling to user's sequential behavior, the intelligent current demand understood in user's purchase decision and long-term preference.

Description

A kind of sequence of recommendation method based on user behavior difference modeling

Technical field

The present invention relates to machine learning and technical field of electronic commerce, more particularly to a kind of distinguished based on user behavior to model Sequence of recommendation method.

Background technology

With the continuous development of online shopping platform, commending system has become irreplaceable important in e-commerce Component part.Commending system can learn the preference information hidden in user's history behavior, to user's Shopping Behaviors into The further prediction of row promotes the income of e-commerce platform to be promoted while helping customer to select to be satisfied with commodity.Therefore, such as What efficient, accurate commercial product recommending service for providing personalization to the user is all the weight of educational circles and industrial quarters research all the time Want problem.

Currently, mainly there is following two major classes research about commending system：

1) commending system based on user's static preferences

Algorithm based on content, collaborative filtering or mixed type is all to belong to such, these methods are by by user's commodity Information sees static feature as, and the commodity phase Sihe hidden in these features is excavated by the methods of cluster, matrix decomposition The information such as user individual preference, to carry out Similar content or the recommendation of similar preference to user.Under this class model, Yong Huli History behavioral data is counted as the static nature of user, and the preference of user is considered steady in a long-term and can be to user's future Decision process has an impact, and on this basis, this kind of commending system only needs to learn user's history preference, and is pushed away for its preference Recommend similar commodity.

2) the sequence of recommendation method based on short session

In partial line upper mounting plate, especially small-sized retail platform and multimedia contents service quotient lack enough use Family historical behavior, but its backstage has added up the short session content of a large amount of user.In view of to lack user long-term for this scene Preference profiles have scholar to propose user's sequence of recommendation method based on short session.Such methods are typically based on the short-term of user Operation behavior builds deep neural network to model the dynamic change of user's behavior in a short time, to be used for predicting under user The interested merchandise items of one step simultaneously recommend user.

It is above in line platform recommendation method, the commending system based on user's static preferences can be good at study and understand The stabilization preference of user, and its favorite commodity of user or service can be accurately recommended, but this method is static , only the preference of user to be regarded as and is remained unchanged for a long period of time, the preference for not accounting for user is also the process of a dynamic change, Meanwhile this method does not account for the current demand of user, is strictly what user liked it is therefore possible to recommend user But it is not that user is required.Sequence of recommendation method based on short session is to record the interactive process of user in a short time Come, the sequence signature during analyzing user's current decision in these acts and efforts for expediency, to judge that user is next interested Commodity or service.The sequence of recommendation method of this short session can model user behavior in a short time by deep neural network Dynamic change, but this method has ignored user itself preference, this makes recommendation results often meet user demand but not It is the favorite type of user.Meanwhile two methods fail to the dynamic change in the entire decision process by user and carry out deeply Enter modeling, does not make a concrete analysis of the difference preference's degree gone out expressed by user's difference behavior.Therefore, using existing recommendation method It is difficult to complete decision process when accurate modeling user selects commodity or service, user demand and preference cannot be combined, Recommendation is caused to fail to reach user's expectation.

Invention content

The object of the present invention is to provide a kind of sequence of recommendation methods based on user behavior difference modeling, when by user The difference of sequence behavior models, the intelligent current demand understood in user's purchase decision and long-term preference, and can be carried for user For accurately sequence of recommendation service.

The purpose of the present invention is what is be achieved through the following technical solutions：

A kind of sequence of recommendation method based on user behavior difference modeling, including：

Obtain the historical behavior information of user；

Product features vector is calculated according to the historical behavior information of user；

Series Modeling is carried out in conjunction with product features vector usage behavior difference modeling method, passes through two different nerve nets Network framework obtains the current demand and history preference of user；

According to the current purchasing demand of user and history preference, pass through combination learning interested commodity next to user It is predicted, and is matched in commodity vector space, found most similar more in commodity vector space with prediction result A commodity generate commercial product recommending sequence.

As seen from the above technical solution provided by the invention, for the deep enough spy of the difference modeling energy of user behavior Suffered influence factor is begged in user's decision process, the different need of the user reflected for the different types of behavior of user Ask with preference carried out it is more deep consider, and then the operation of user's next step is effectively predicted, to recommend use The more satisfied commodity in family, and can sequence of recommendation dynamically be carried out by being interacted with user, it compensates for existing method and lacks Weary dynamic and personalized certain drawback.

Description of the drawings

In order to illustrate the technical solution of the embodiments of the present invention more clearly, required use in being described below to embodiment Attached drawing be briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for this For the those of ordinary skill in field, without creative efforts, other are can also be obtained according to these attached drawings Attached drawing.

Fig. 1 is a kind of flow of sequence of recommendation method based on user behavior difference modeling provided in an embodiment of the present invention Figure.

Specific implementation mode

With reference to the attached drawing in the embodiment of the present invention, technical solution in the embodiment of the present invention carries out clear, complete Ground describes, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.Based on this The embodiment of invention, every other implementation obtained by those of ordinary skill in the art without making creative efforts Example, belongs to protection scope of the present invention.

The embodiment of the present invention provides a kind of sequence of recommendation method based on user behavior difference modeling, as shown in Figure 1, it is led Include the following steps：

Step 1, the historical behavior information for obtaining user.

Each user can leave a series of log recording from the background when browsing is in line platform, these records have The commodity relevant operations such as specific sequential relationship, including user's browsing, click, addition shopping cart, collection and purchase.These data It can be collected directly from online shopping platform or online service provider.

In the embodiment of the present invention, the historical behavior information of acquired user is the data of interaction commodity sequence form, is used The interaction commodity sequence of family u is expressed as：X therein indicates commodity, under be designated as commodity Serial number, b indicate user's operation behavior, be one-hot vector, vectorial length is the quantity of type of interaction.

Step 2 calculates product features vector according to the historical behavior information of user.

In the embodiment of the present invention, by modeling the commodity sequence relation in user behavior, the Skip- based on negative sampling is built Gram models generate the feature vector of commodity；Main process is as follows：

According to the interaction commodity sequence of user uProduct features vector is set up, It is intended to maximize following target：

Wherein, N is interaction commodity sequenceLength, what p was indicated is softmax functional forms, definition be and x_j、x_i Relevant probability, p (x_j|x_i) this form is referred to as softmax functions in professional domain, form is as follows：

Wherein, w_iWith v_iIt is and commodity x_iContext indicate corresponding latent variable and object vector；w_jIt is and commodity x_jContext indicate corresponding latent variable；w_k'It is and commodity x_k'Context indicate corresponding latent variable；K''s Value is 1 to N；

In order to mitigate the computation complexity of gradient, the following process of above formula is replaced：

Wherein, σ (r)=1/ (1+exp (- r)) is sigmoid functions, and E is the number of the negative sample to be drawn of each positive sample Amount, here positive sample refer to and x_iContext-sensitive commodity, the incoherent commodity that negative sample refers to just, the size of E can To be set according to actual conditions or experience by user；

It is different to consider different commodity occurrence numbers, certain noise can be brought to above-mentioned negative sampling process, based on pair The mode that commodity individual occurrence number is weighted, is again defined as above formula：

Wherein, Θ (x_i) it is commodity x_iThe frequency statistics occurred in interaction commodity sequence, the then target that commodity insertion characterizes As maximize loss function：

Later, product features vector P is obtained by way of gradient decline_u={ v₁,v₂,...,v_N, wherein v_jIndicate quotient Product x_jD dimensional feature vectors.

Step 3 carries out Series Modeling in conjunction with product features vector usage behavior difference modeling method, by two different Neural network framework obtains the current demand and history preference of user.

After obtaining product features vector, difference behavior modeling can be using Continuous behavior as priori, to recommend For the purpose of the project that target user most possibly accesses when accessing next time.The decision process of user is mainly by two factor shadows It rings：Current motivation and history preference.More specifically, the current consumer motivation of user is dynamic in a short time, nearest Fluctuation is also critically important for reflection Short-term characteristic.In view of all recent behaviors (such as clicking, collect, shopping cart, purchase) It might mean that the current short-term motivation of user, it is dynamic that current consumption is presented using all types of recent behaviors in the present invention Machine.On the other hand, for the history preference using user, not all types of behaviors can describe the preference of user.In order to The long-term preference of user is modeled, the present invention only retains the potential preference that user is explicitly described from interactive history i.e. buying behavior Behavior.In fact, the interactive process of user is a series of implicit feedbacks over time.Therefore, it is visited with from static mode The conventional recommendation systems of rope user items interaction are different, and the next item down suggestion is handled by sequence modeling.Specifically, Wo Menshe Two distinguishing behavior modeling processes are counted：It is current discriminatively to learn user for session behavior modeling and preference behavior modeling Consumer motivation and preference steady in a long-term.In addition, on this basis, we have invented two kinds, and the deep-cycle based on LSTM is refreshing Through network, to learn the arrangement of both motivations and preference behavior jointly.

First, conversate behavior modeling, product features vector P_u={ v₁,v₂,...,v_NCorresponding interactive commodity sequence ForIndicator function is defined as follows to determine commodity x_iWhether current sessions behavior is met Range：

D_SBL(x_i,x_N)=Φ ((N-i)≤Ts)；

Wherein, Φ (a) is a Boolean type function, when a is true, functional value 1, it is on the contrary then be 0；Ts indicates session The control time of behavior walks, for controlling the length of session behavior；x_NIt is current interaction commodity sequence S_u ^bIn the last one commodity；

After definition initializes LSTM matrixes, in t-th of iteration step, the hiding layer state h of each_tUpdate, With the hiding layer state h of a upper time step_t-1And the product features vector v currently inputted_tWith behavior vector b_tIt is related；Wherein Steps are as follows for update：

h_t=o_ttanh(c_t)

Wherein, i_t、f_t、o_tInput gate, forgetting door, out gate in respectively t-th of iteration step；c_tIt is network element Memory module；b_tFor the user's operation behavior of t-th of the commodity of correspondence inputted in t-th of iteration step；W_vi、W_hi、W_ci、W_biIt is corresponding For input gate i_tMiddle v_t、h_t-1、c_t-1、b_tWeight；W_vf、W_hf、W_cf、W_bf, correspond to forget door f_tMiddle v_t、h_t-1、c_t-1、b_tPower Weight；W_vc、W_hc、W_bcCorrespond to v in memory module_t、h_t-1、b_tWeight；W_vo、W_ho、W_co、W_boCorrespond to out gate o_tMiddle v_t、 h_t-1、c_t-1、b_tWeight；Input gate i is corresponded to respectively_t, forget door f_t, out gate o_t, memory module c_t Deviation；h_tFor the output of current state；Tanh is hyperbolic tangent function.

Then the current purchasing demand of user is expressed as：

Ψ_SBL=h_N；

During aforesaid operations, iterations are identical as commodity amount in interaction commodity sequence, i.e. t=1,2 ..., N, h_N The last one commodity of sequence x_NOutput after input, i.e. n-th iteration step export.

Secondly, the history preference modeling for carrying out user, for each user's operation commodity-behavior to (v_i,b_i)∈S_u ^b； Its indicator function is expressed as：

D_PBL(v_i,b_i)=Φ (b_i∈P)；

Wherein, P is the set of preference behavior, and main includes purchase, collection, addition shopping cart operation behavior；

The preference expression for learning user using two-way LSTM networks, in each time step of history preference modeling There are two hidden layers to export, and for s-th of time step, wherein forward direction exportsIt is to be exported by its previous time stepWith work as Preceding commodity-behavior is to (v_s,b_s) determined；Backward outputIt is to be exported by its latter time stepWith current quotient Product-behavior is to (v_s,b_s) determined；Corresponding formula is as follows：

h_s=o_stanh(c_s)

Wherein, i_s、f_s、o_sThe input gate of respectively s-th time step forgets door, out gate；c_sIt is the note of network element Recall module；b_sFor the user's operation behavior of s-th of the commodity of correspondence inputted in s-th of iteration step；W_vi'、W_hi'、W_ci'、W_bi' right It should be input gate i_sMiddle v_s、h_s-1、c_s-1、b_sWeight；W_vf'、W_hf'、W_cf'、W_bf', correspond to forget door f_sMiddle v_s、h_s-1、c_s-1、 b_sWeight；W_vc'、W_hc'、W_bc' correspond to v in memory module_s、h_s-1、b_sWeight；W_vo'、W_ho'、W_co'、W_bo' correspond to it is defeated Go out o_sMiddle v_s、h_s-1、c_s-1、b_sWeight； Input gate i is corresponded to respectively_s, forget door f_s, out gate o_s, memory module c_sDeviation；h_sFor the output of current state；If it is forward process, the output h of current state_sAsIf it is backward process, the output h of current state_sAs

By two-way LSTM networks, the preference that active user can be accessed to each time step characterizes vector：

Wherein,ForVector splicing, the average pond process that the history preference of user is expressed as：

Step 4, the current purchasing demand according to user and history preference, it is next to user interested by combination learning Commodity predicted, and matched in commodity vector space, find with prediction result the most phase in commodity vector space Close multiple commodity generate commercial product recommending sequence.

In the embodiment of the present invention, the current purchasing demand Ψ of user is combined by a full linking layer_SBLWith history preference Ψ_PBL, to which the predicted vector of the next interested commodity of user be calculated：

Wherein,WithCorresponding is the weight of current purchasing demand and history preference；Bias indicates model bias.

During model training, it is assumed that really the next interested commodity vector of user is：v_T+1=(y₁, y₂,...,y_d)；The loss function of model may be defined as：

Wherein, d is the dimension of vector.

Said program of the embodiment of the present invention records for the historical behavior of user, division difference is carried out according to time series The sequence information of different user is embodied on the structure product features vector and user behavior difference modeling method of the present invention, Product features vector is generated with commodity insertion characterizing method, and difference sequence is carried out to the different behaviors of commodity to user and is built Mould, learns the current demand and history preference of user respectively, and interested commodity next to user are predicted.This method The history preference of user is combined with current demand, the difference preference of commodity is built for expressed by user's difference behavior Mould is dynamically learnt user's decision process by recurrent neural network, and then generates personalized sequence of recommendation to user, is made up Existing method lacks dynamic and personalized certain drawback.

Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment can By software realization, the mode of necessary general hardware platform can also be added to realize by software.Based on this understanding, The technical solution of above-described embodiment can be expressed in the form of software products, the software product can be stored in one it is non-easily In the property lost storage medium (can be CD-ROM, USB flash disk, mobile hard disk etc.), including some instructions are with so that a computer is set Standby (can be personal computer, server or the network equipment etc.) executes the method described in each embodiment of the present invention.

The foregoing is only a preferred embodiment of the present invention, but scope of protection of the present invention is not limited thereto, Any one skilled in the art is in the technical scope of present disclosure, the change or replacement that can be readily occurred in, It should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with the protection model of claims Subject to enclosing.

Claims

1. a kind of sequence of recommendation method based on user behavior difference modeling, which is characterized in that including：

Obtain the historical behavior information of user；

Series Modeling is carried out in conjunction with product features vector usage behavior difference modeling method, passes through two different neural network framves Structure obtains the current demand and history preference of user；

According to the current purchasing demand of user and history preference, carried out by combination learning interested commodity next to user Prediction, and is matched in commodity vector space, finds with prediction result most similar multiple quotient in commodity vector space Product generate commercial product recommending sequence.

2. a kind of sequence of recommendation method based on user behavior difference modeling according to claim 1, which is characterized in that institute The historical behavior information of the user of acquisition is the data of interaction commodity sequence form, and the interaction commodity sequence of user u is expressed as：X therein indicates commodity, under be designated as the serial numbers of commodity, b indicates user's operation row For.

3. a kind of sequence of recommendation method based on user behavior difference modeling according to claim 2, which is characterized in that root According to user historical behavior information calculate product features vector step include：

According to the interaction commodity sequence of user uSet up product features vector, it is intended to Maximize following target：

Wherein, N is interaction commodity sequence S_uLength, and p (x_j|x_i) softmax functions are defined as, form is as follows：

Wherein, w_iWith v_iIt is and commodity x_iContext indicate corresponding latent variable and object vector；w_jIt is and commodity x_j's Context indicates corresponding latent variable；w_k'It is and commodity x_k'Context indicate corresponding latent variable；The value of k' N is arrived for 1；

The following process of above formula is replaced：

Wherein, σ (r)=1/ (1+exp (- r)) is sigmoid functions, and E is the quantity of the negative sample to be drawn of each positive sample；

Based on the mode being weighted to commodity individual occurrence number, above formula is defined as again：

Wherein, Θ (x_i) it is commodity x_iThe frequency statistics occurred in interaction commodity sequence, the then target that commodity insertion characterizes are Maximize loss function：

Later, product features vector P is obtained by way of gradient decline_u={ v₁,v₂,...,v_N, wherein v_jIndicate commodity x_j D dimensional feature vectors.

4. a kind of sequence of recommendation method based on user behavior difference modeling according to claim 3, which is characterized in that institute It states and modeling method progress Series Modeling is distinguished according to the historical behavior information usage behavior of acquisition, calculating the current purchase of user needs Summation history preference process include：

First, conversate behavior modeling, product features vector P_u={ v₁,v₂,...,v_TCorresponding interactive commodity sequence isIndicator function is defined as follows to determine commodity x_iWhether current sessions behavior is met Range：

D_SBL(x_i,x_T)=Φ ((T-i)≤Ts)；

Wherein, Φ (a) is a Boolean type function, when a is true, functional value 1, it is on the contrary then be 0；Ts indicates session behavior Control time step, for controlling the length of session behavior；x_TIt is interactive commodity sequenceIn the last one commodity；

After definition initializes LSTM matrixes, in t-th of iteration step, the hiding layer state h of each_tUpdate and upper one The hiding layer state h of a time step_t-1And the product features vector v currently inputted_tAnd behavior vector b_tIt is related；Wherein update step It is rapid as follows：

Wherein, i_t、f_t、o_tInput gate, forgetting door, out gate in respectively t-th of iteration step；c_tIt is the memory mould of network element Block；b_tFor the user's operation behavior of t-th of the commodity of correspondence inputted in t-th of iteration step；W_vi、W_hi、W_ci、W_biIt corresponds to input Door i_tMiddle v_t、h_t-1、c_t-1、b_tWeight；W_vf、W_hf、W_cf、W_bf, correspond to forget door f_tMiddle v_t、h_t-1、c_t-1、b_tWeight；W_vc、 W_hc、W_bcCorrespond to v in memory module_t、h_t-1、b_tWeight；W_vo、W_ho、W_co、W_boCorrespond to out gate o_tMiddle v_t、h_t-1、c_t-1、b_t Weight；Input gate i is corresponded to respectively_t, forget door f_t, out gate o_t, memory module c_tDeviation；h_t For the output of current state；Tanh is hyperbolic tangent function；

Then the current purchasing demand of user is expressed as：

Ψ_SBL=h_N；

Secondly, the history preference modeling for carrying out user, for each user's operation commodity-behavior pairIt is indicated Function representation is：

D_PBL(v_i,b_i)=Φ (b_i∈P)；

Wherein, P is the set of preference behavior；

The preference expression for learning user using two-way LSTM networks has two in each time step of history preference modeling A hidden layer output, for s-th of time step, wherein forward direction exportsIt is to be exported by its previous time stepWith it is current Commodity-behavior is to (v_s,b_s) determined；Backward outputIt is to be exported by its latter time stepWith current commodity-row For to (v_s,b_s) determined；Corresponding formula is as follows：

Wherein, i_s、f_s、o_sThe input gate of respectively s-th time step forgets door, out gate；c_sIt is the memory mould of network element Block；b_sFor the user's operation behavior of s-th of the commodity of correspondence inputted in s-th of iteration step；W_vi'、W_hi'、W_ci'、W_bi' correspond to Input gate i_sMiddle v_s、h_s-1、c_s-1、b_sWeight；W_vf'、W_hf'、W_cf'、W_bf', correspond to forget door f_sMiddle v_s、h_s-1、c_s-1、b_s's Weight；W_vc'、W_hc'、W_bc' correspond to v in memory module_s、h_s-1、b_sWeight；W_vo'、W_ho'、W_co'、W_bo' correspond to out gate o_sMiddle v_s、h_s-1、c_s-1、b_sWeight； Input gate i is corresponded to respectively_s, forget door f_s, out gate o_s, note Recall module c_sDeviation；h_sFor the output of current state；If it is forward process, the output h of current state_sAsSuch as Fruit be after to process, then the output h of current state_sAs

5. a kind of sequence of recommendation method based on user behavior difference modeling according to claim 1 or 3 or 4, feature It is, the current purchasing demand and history preference according to user, passes through combination learning interested quotient next to user Product carry out prediction：

The current purchasing demand Ψ of user is combined by a full linking layer_SBLWith history preference Ψ_PBL, to which use be calculated The predicted vector of the next interested commodity in family：

Wherein,WithCorresponding is the weight of current purchasing demand and history preference；Bias indicates model bias；

During model training, it is assumed that really the next interested commodity vector of user is：v_T+1=(y₁,y₂,..., y_d)；The loss function of model is defined as：

Wherein, d is the dimension of vector.