CN111369278A

CN111369278A - Click rate prediction method based on long-term interest modeling of user

Info

Publication number: CN111369278A
Application number: CN202010103305.0A
Authority: CN
Inventors: 汤景凡; 张秀杰; 张旻; 姜明; 黄涛; 吴鑫强
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2020-02-19
Filing date: 2020-02-19
Publication date: 2020-07-03

Abstract

The invention discloses a click rate prediction method based on long-term interest modeling of a user. The invention comprises the following steps: step (1) dividing a user behavior sequence into different sessions, and then extracting short-term interest and hobbies of a user by using a self-attention mechanism module; step (2) extracting the long-term interest and interest deviation of the user through a multi-channel memory network; step (3), the interest activation unit associates the short-term interest and hobby of the user, the long-term interest and hobby deviation characteristics of the user with the target object; step (4) linking the three characteristics processed in the step (3) and inputting the three characteristics into a multiple sensing machine to predict the click probability; and (5) calculating a negative log-likelihood function for the output probability, wherein the smaller the negative log-likelihood function value is, the better the corresponding effect of the method is. The method solves the problems of low interpretability and low accuracy in click rate prediction, and can obtain better effect and interpretability.

Description

Click rate prediction method based on long-term interest modeling of user

Technical Field

The invention relates to the field of click rate prediction in a design recommendation system, in particular to a method for realizing click rate prediction by a multi-channel memory network and a self-attention mechanism

Background

Predicting the click rate of the user for purchasing the commodity, and the key for solving the problems is how to effectively process the serialized information. Currently, there are three types of methods for processing sequence information

First, the serialized information is processed by a simple LSTM model or a CNN model, and we know that the LSTM finally outputs a hidden vector, so that the interpretability is not strong. LSTM, on the other hand, does not distinguish the role of each feature well when processing a sequence of features. This is also why we use a memory network to accurately process serialized information. As for the CNN model, we can find that CNN cannot handle the long interval association problem of features in the sequence well. This may result in failure to extract the variation feature of the user's taste.

The second method is to extract the characteristic information in the sequence through a memory network. The method can well measure the action of each clicked item in the sequence and can provide good interpretability, but the problem of session of the user clicking the characteristic sequence is not well considered in the memory network. It may not be possible to effectively predict the click probability of other items in the same session.

The third method extracts the deviations of the long-term and interest interests of the user in the serialized information by using a multi-channel memory network. The memory network has good interpretability, and can measure different functions of each article through the Attention. The migration change characteristics of the interest and the hobbies of the user on the same type of goods are mined through multiple channels, and each channel represents the migration change of the interest of the user on a certain type of goods. The user historical behavior sequence is divided into different sessions according to time intervals, and then the user short-term interest is represented by the latest session sequence. Due to the consistency of the user behavior sequence, the users click on similar items in the same time interval. Therefore, the method of selecting the latest k behavior sequences as the latest interest expression of the user can cause the items in the sequences to belong to a plurality of different sessions, so that the short-term interest and hobbies of the user cannot be effectively expressed.

Disclosure of Invention

The invention aims to provide a method for predicting long-term and short-term click rate of a user by combining a self-attention mechanism and a multi-channel memory network aiming at the defects of the prior art so as to solve the problems of poor interpretability and low accuracy in click rate prediction.

The technical scheme adopted by the invention for solving the technical problem comprises the following steps:

step (1) dividing a user behavior sequence into different sessions, and then extracting short-term interest and hobbies of a user by using a self-attention mechanism module;

step (2) extracting the long-term interest and interest deviation of the user through a multi-channel memory network;

step (3), the interest activation unit associates the short-term interest and hobby of the user, the long-term interest and hobby deviation characteristics of the user with the target object;

step (4) linking the three characteristics processed in the step (3) and inputting the three characteristics into a multiple sensing machine to predict the click probability;

and (5) calculating a negative log-likelihood function for the output probability, wherein the smaller the negative log-likelihood function value is, the better the corresponding effect of the method is.

The session division in the step (1) is specifically realized by the following steps:

the 1-1 user click sequence has the characteristics of timeliness and continuity, and the user click sequences are divided into different sessions according to time intervals, such as clothes or food. Dividing a user click sequence into different sessions according to a time interval, specifically, if the click time interval of two adjacent articles in the sequence exceeds a threshold (30 minutes is set in an experiment), dividing the two adjacent articles into different session intervals, otherwise, dividing the two adjacent articles into the same session; the short-term interest of the user has the characteristic of timeliness, so that the short-term interest of the user can be represented only by using the click sequence in the last session interval; compared with methods for selecting the latest k point hit item sequences as short-term interests of the user, the session can more reasonably consider the characteristics of timeliness and continuity of the user hit sequences.

1-2, embedding and converting sparse characteristics of user, commodity and user behavior sequence into low-dimensional dense characteristic representation, and representing all user information into matrix

Wherein N is_uRepresenting the number of users; all article information can be expressed as

Wherein N is_iIndicating the number of the articles; the user behavior sequence is expressed as

N represents the number of commodities clicked by the user's historical behavior, b_iThe representation represents the ith behavior embedding vector in the corresponding session.

In step 1, a self-attention module is constructed to extract short-term interests and hobbies of the user in the latest session, and the method is specifically realized as follows:

dividing a user behavior sequence S into different sessions R according to time intervals, wherein the latest session sequence of a user u at the time t is expressed as

Representing the number of times of the user u click behavior in the latest session sequence at the time t, b_iRepresenting the i-th behavior embedding vector in the corresponding session, d_modelRepresenting item embedding dimensions in per click behavior；

First, a query is generated using non-linear variation of a shared parameter

Keys (keys) and

the value (value) maps to the same space;

wherein the content of the first and second substances,

is the corresponding mapping weight matrix; relu is used as an activation function that learns nonlinear interactions in features;

respectively representing a query (query), a key (key) and a value (value) matrix after mapping;

the correlation matrix is then calculated as follows:

wherein one is output

Attention moment diagram of

It represents

Similarity between individual items; wherein

For the attention value after the zoom point multiplication,

is a matrix representation of the user's short-term preferences.

And (3) extracting the long-term interest and hobby deviation conditions of the user by the multi-channel memory network in the step (2). We first describe the memory network used in the model and then the processing of the multiple channels.

Extracting the long-term interest and the interest deviation of the user through the multi-channel memory network in the step (2), which is specifically realized as follows:

define the memory matrix of user u as

Wherein

Expressing preference expression vectors of corresponding characteristics of the historical item records by the user u; the memory matrix is used for storing long-term preference of users, and the user memory matrix needs to be read and updated every time new data is added.

The operation of reading and updating the user memory matrix is as follows:

2-1. read operation in memory network

Wherein α is an enhancement parameter, | | | | represents a modulo operation, b_tThe representation represents the t-th behavior embedding vector in the corresponding session,

representing the weight values without regularization,

representing weight values after regularization;

then the weight sum of the vectors in the memory matrix is used as the final output

2-2. write operation of memory network

erase_t＝δ(E^Tb_t+b_e)

Where δ (.) denotes sigmoid activation function, ⊙ denotes dot-by-element multiplication operation, E and b_eIs the erasure parameter to learn;

an erasure vector representing the characteristic information; b_tRepresenting the t-th behavior embedding vector in the corresponding session;

wherein A and b_aIs a learnable parameter, tanh represents an activation function; such erasing, adding, or further erasingThe new strategy allows for forgetting and enhancing user interest during learning, and the model can automatically determine signals to be attenuated and enhanced using update and erase operations;

2-3, adding multiple channels to enhance the mining of the memory network on the characteristics of the bias of the user interests and hobbies; using additional interest offset matrices

To store the bias change of interest of the user u in the same type of item in the sequence t, firstly

The method comprises the following steps that m unit slots are included, wherein each unit slot is a channel which represents a characteristic channel for the migration of interest and hobbies of a user on a certain type of articles; at the time t of the sequence of user u, from the set

Wherein k feature indexes are selected, wherein

Is a weight vector in the read operation of the memory network, and updates the corresponding interest and hobby offset channel for the index number i selected arbitrarily as described in the following formula

Wherein

Is the ith cell slot of the user memory matrix, b_tA behavior embedding vector; multiple channel unit utilizes the behavior sequence information and the interest shift information of the same channel at the previous time

And memory momentsArray corresponding cell slot information

As input, then using GRU to further mine and update the change situation of user interest and hobbies under the corresponding channel; since the parameters of the GRU are shared for multiple channels, many training parameters are not added.

The interest activation unit in step (3) associates the short-term interest and hobby of the user, the long-term interest and hobby deviation characteristics of the user with the target item, and the specific implementation is as follows:

3-1, when the click rate is predicted, if the target object accords with the short-term interest of the user, the short-term interest of the user has great influence on the click event; otherwise, the short-term interest and hobbies of the user do not interfere with the click event of the user; therefore, it is stated that the short-term interest needs to be associated with the corresponding target object to effectively extract the final short-term interest expression vector of the user

Dynamically distributing the weight of each unit vector in the short-term preference matrix by using an interest activation unit; the specific formula for calculating the weight by using the interest activation unit is as follows:

wherein the content of the first and second substances,

a learnable interaction matrix representing the association target item and the user's short-term interest, d represents the final vector dimension of the item after passing through the embedding layer,

an embedded vector representation representing the item v,

indicating the short-term interest of user u at time tInterest and hobby matrix

The ith row vector of (1); presentation of results after user long-term preference associated with target item

The calculation is as follows:

wherein the content of the first and second substances,

a learnable interaction matrix representing long-term interests and interests of the associated user and the target item,

representing the long-term interest preference matrix of user u at time t

The ith row vector of (1); representation vector after user interest migration change feature is associated with target item

As described by the following equation:

wherein the content of the first and second substances,

a learnable parameter matrix representing associated target items and user interest migration changes,

an interest migration matrix representing user u at time t

The ith row vector of (1); user information, article information, U^s、U^lAnd U^dThe embedded vectors of (a) are concatenated, flattened and then input to a multi-layered perceptron to implement the score prediction.

Step (4) connecting the long-term hobby, short-term hobby and hobby deviation characteristics of the user, and then sending the connected characteristic vector into DNN for training to obtain a final click probability p (x); the corresponding loss function is as follows:

where x represents all the feature vector connections, D represents the training set, N represents the number of training sets, y ∈ {0,1} represents whether the user will click on the item, and p (.) is the final predicted result value of the network model, which represents the probability that the user will click on the item.

The invention has the following beneficial effects:

first, the user's short-term interests are time-efficient, while the user's click sequences are consistent. This means that the user clicks on items belonging to the same category, such as clothing or cake, in a sequence over a period of time. The traditional method directly selects the latest K serial items as the user's short-term interest and well expresses, and neglects the timeliness and coherence of the user's short-term interest.

Secondly, the inherent interests and hobbies of the user are relatively fixed, and meanwhile, the interests and hobbies of the user on the same type of articles can also be migrated and changed along with the change of time, and the long-term interests and hobbies deviation of the user are mined by utilizing a multi-channel memory network. The traditional long-term interest hobbies neglect the migration change characteristics of the user interests, so that the traditional long-term interest hobbies do not have good effect.

And finally, when the click rate is predicted, if the target object accords with the short-term hobbies of the user, the short-term hobbies of the user have great influence on the click event. Otherwise, the user's short-term hobbies have no effect on the user's click event. This means that the user interests are dynamic and related to the target item to be predicted. The activation unit is used for associating the interest and the hobbies of the user with the target commodity so as to generate a dynamic representation of the hobbies of the user.

The invention utilizes the self-attention mechanism to extract the short-term interest and preference of the user, and simultaneously uses the multi-channel memory network to extract the long-term interest and the interest deviation condition of the user.

Drawings

FIG. 1 is a diagram of a model of the present invention;

FIG. 2 is a schematic diagram of the short term preference extraction from attention module according to the present invention.

Detailed Description

The attached drawings disclose a flow chart of the preferred embodiment of the invention in a non-limiting way; the technical solution of the present invention will be described in detail below with reference to the accompanying drawings.

As shown in fig. 1, a click-through rate prediction method based on long-term interest modeling of a user includes the following basic steps:

step (4) linking the three characteristics processed in the step (3) and inputting the three characteristics into a multiple sensing machine to predict the click probability; short-term interest and hobbies excavated in the self-attention mechanism module, long-term interest and hobbies excavated in the multi-channel memory network are connected and offset characteristics are input into DNN to realize click rate prediction;

dividing the user behavior sequence S into different according to time intervalssession R, where the last session sequence of user u at time t is denoted as

Representing the number of times of the user u click behavior in the latest session sequence at the time t, b_iRepresenting the i-th behavior embedding vector in the corresponding session, d_modelRepresenting the item embedding dimension in each click behavior;

as shown in FIG. 2, first, a query is queried using a non-linear change in a shared parameter

Keys (keys) and

the value (value) maps to the same space;

wherein the content of the first and second substances,

is the corresponding mapping weight momentArraying; relu is used as an activation function that learns nonlinear interactions in features;

the correlation matrix is then calculated as follows:

wherein one is output

Attention moment diagram of

It represents

Similarity between individual items; wherein

For the attention value after the zoom point multiplication,

is a matrix representation of the user's short-term preferences.

define the memory matrix of user u as

Wherein

The operation of reading and updating the user memory matrix is as follows:

2-1. read operation in memory network

representing the weight values without regularization,

representing weight values after regularization;

2-2. write operation of memory network

erase_t＝δ(E^Tb_t+b_e)

wherein A and b_aIs a learnable parameter, tanh represents an activation function; this erase, add, update strategy allows for forgetting and enhancing user interest during learning, and the model can automatically determine signals to be attenuated and enhanced using update and erase operations;

Wherein k feature indexes are selected, wherein

Is a weight vector in the read operation of the memory network, and updates the index number i selected arbitrarily as described in the following formulaCorresponding interest bias channel

Wherein

Cell slot information corresponding to memory matrix

wherein the content of the first and second substances,

an embedded vector representation representing the item v,

representing a short-term interest preference matrix for user u at time t

The calculation is as follows:

wherein the content of the first and second substances,

representing the long-term interest preference matrix of user u at time t

As described by the following equation:

wherein the content of the first and second substances,

an interest migration matrix representing user u at time t

Claims

1. A click rate prediction method based on long-term interest modeling of a user is characterized by comprising the following steps:

2. The method for predicting click rate based on long and short term interests of users according to claim 1, wherein the session division in the step (1) is implemented as follows:

1-1 user click sequences have the characteristics of timeliness and continuity, and are divided into different sessions according to time intervals, specifically: if the click time interval of two adjacent articles in the user click sequence exceeds a set threshold, dividing the two adjacent articles into different session intervals, otherwise, dividing the two adjacent articles into the same session; the short-term interest of the user has the characteristic of timeliness, so that the short-term interest of the user can be represented only by using the click sequence in the last session interval;

3. The click rate prediction method based on the long-term interest modeling of the user as claimed in claim 1, wherein a self-attention module is constructed in step 1 to extract the short-term interest and hobbies of the user in the latest session, and the specific implementation is as follows:

Representing the number of times of the user u click behavior in the latest session sequence at the time t, b_iRepresenting the i-th behavior embedding vector in the corresponding session, d_mtoelRepresenting the item embedding dimension in each click behavior;

first, a query is generated using non-linear variation of a shared parameter

Keys (keys) and

the value (value) maps to the same space;

wherein the content of the first and second substances,

the correlation matrix is then calculated as follows:

wherein one is output

Attention moment diagram of

It represents

Similarity between individual items; wherein

For the attention value after the zoom point multiplication,

is a matrix representation of the user's short-term preferences.

4. The click rate prediction method based on user long-term interest modeling according to claim 3, wherein the step (2) of extracting the user long-term interest and interest bias through the multi-channel memory network is implemented as follows:

define the memory matrix of user u as

Wherein

5. The click rate prediction method based on user long-term interest modeling according to claim 4, wherein the operations of reading and updating the user memory matrix are as follows:

2-1. read operation in memory network

representing the weight values without regularization,

representing weight values after regularization;

2-2. write operation of memory network

erase_t=δ(E^Tb_t+b_e)

For storageThe user u is interested in the bias change situation of the same type of articles in the sequence t, firstly

Wherein k feature indexes are selected, wherein

Wherein

Cell slot information corresponding to memory matrix

6. The click rate prediction method based on user long-term interest modeling according to claim 1 or 5, wherein the interest activation unit in step (3) associates the characteristics of the user short-term interest, the user long-term interest and the interest bias with the target item, and the method is implemented as follows:

wherein the content of the first and second substances,

an embedded vector representation representing the item v,

representing a short-term interest preference matrix for user u at time t

The calculation is as follows:

wherein the content of the first and second substances,

representing the long-term interest preference matrix of user u at time t

As described by the following equation:

wherein the content of the first and second substances,

an interest migration matrix representing user u at time t

7. The method of claim 6, wherein the step (4) of linking the long-term interest, short-term interest and interest bias characteristics of the user, and then inputting the linked feature vectors into DNN to train and obtain a final click probability p (x); the corresponding loss function is as follows