CN112396099A

CN112396099A - Click rate estimation method based on deep learning and information fusion

Info

Publication number: CN112396099A
Application number: CN202011277167.4A
Authority: CN
Inventors: 李静梅; 黄海亮; 代昕
Original assignee: Harbin Engineering University
Current assignee: Harbin Engineering University
Priority date: 2020-11-16
Filing date: 2020-11-16
Publication date: 2021-02-23
Anticipated expiration: 2040-11-16
Also published as: CN112396099B

Abstract

The invention provides a click rate estimation method based on deep learning and information fusion, which is characterized by being divided into three modules: the system comprises a field decomposition machine FFM of a shallow extraction module, a convolutional neural network CNN of a deep extraction module and a deep confidence network DBN of a feature fusion module. The shallow layer module and the deep layer module adopt parallel structures and share fixed dense vectors converted from discrete features of users and commodities; the shallow layer module adopts a second-order combination of FFM automatic extraction features, the deep layer module adopts a CNN local perception domain to extract a high-order nonlinear feature combination, and the fusion module adopts a DBN to fuse the output of the shallow layer FFM and the deep layer CNN, so that the interaction of the shallow layer features and the deep layer features is realized. The method provided by the invention has the advantages that the internal relation between the features is excavated by combining the feature interaction depth of the shallow layer and the deep layer, the problems of gradient explosion and gradient disappearance are effectively solved, and the click prediction capability is improved.

Description

Click rate estimation method based on deep learning and information fusion

Technical Field

The invention relates to a click rate estimation method, in particular to a click rate estimation method based on deep learning and information fusion, and belongs to the field of recommendation systems.

Background

With the combination of deep learning and recommendation systems, the click rate estimation method also changes in a coverage area. From the combination of logistic regression and gradient boosting trees of initial artificial feature combination to a factorization machine FM of shallow feature automatic combination, Deep learning is proposed by the Hua Noah ark as a Deep FM, and the accuracy of click rate prediction is remarkably improved, but the existing click rate estimation method still has some problems, and the Deep DNN network has the problems of gradient explosion and gradient disappearance along with the increase of the number of layers, so that the training effect is difficult and the optimization is difficult.

In order to solve the problems of the current click rate estimation method, a more effective and accurate click rate estimation method needs to be researched.

Disclosure of Invention

Aiming at the problems in the background art, the click rate estimation method CNN-FFM based on deep learning and information fusion is provided by the method.

The purpose of the invention is realized as follows:

a click rate estimation method based on deep learning and information fusion is characterized by being divided into three modules: the system comprises a field decomposition machine FFM of a shallow extraction module, a convolutional neural network CNN of a deep extraction module and a deep confidence network DBN of a feature fusion module. The shallow layer module and the deep layer module adopt parallel structures and share fixed dense vectors converted from discrete features of users and commodities; the shallow layer module adopts a second-order combination of FFM automatic extraction features, the deep layer module adopts a CNN local perception domain to extract a high-order nonlinear feature combination, and the fusion module adopts a DBN to fuse the output of the shallow layer FFM and the deep layer CNN, so that the interaction of the shallow layer features and the deep layer features is realized.

The invention also includes such features:

the method is characterized in that the CNN-based high-order nonlinear feature extraction mode comprises the following steps:

the convolutional neural network comprises 5 convolutional layers, 5 pooling layers and 2 full-connection layers, wherein the convolutional layers extract high-order nonlinear features in a local sensing domain mode to complete deep feature combination, and the mode reduces the number of model parameters while retaining main features.

The DBN-based feature fusion mode comprises the following steps:

the output of the shallow FFM module and the deep CNN module is used as the input of the feature fusion module, the DBN is used as a fusion model, the DBN comprises a 3-layer hidden layer and a 1-layer Sigmoid layer, the DBN fusion model aims to capture the height nonlinear relation between the shallow feature and the deep feature, and the click pre-estimation judgment result is output to the interval (0,1) through a Sigmoid function.

Compared with the prior art, the invention has the beneficial effects that:

the method provided by the invention has the advantages that the internal relation between the features is excavated by combining the feature interaction depth of the shallow layer and the deep layer, the problems of gradient explosion and gradient disappearance are effectively solved, and the click prediction capability is improved.

Drawings

FIG. 1 is a schematic diagram of a convolutional neural network process of the method of the present invention;

FIG. 2 is a process diagram of a deep belief network of the method of the present invention;

FIG. 3 is a flow chart of click through rate estimation of the method of the present invention;

FIG. 4 is a schematic diagram of deep learning and information fusion based on the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and specific embodiments.

1. The deep high-order nonlinear feature extraction model adopts a convolutional neural network CNN. And inputting the fixed dense vector converted by the embedded layer into the CNN, and extracting high-order nonlinear features by the convolution layer in a local perception domain mode to complete the deep feature combination problem.

2. The feature fusion module adopts DBN and a layer of Sigmoid function. The output of the shallow FFM module and the deep CNN module is used as the input of the feature fusion module, the DBN is used as a fusion model, the DBN fusion model aims to capture the height nonlinear relation between the shallow feature and the deep feature, and the click estimation judgment result is output to the interval (0,1) through a Sigmoid function.

The execution process of the method is divided into 8 steps:

1. embedding of user commodity data: in the data set of the user commodity, the data type features are more, so that the original data can be subjected to one-hot coding before data input to obtain sparse features; in order to simplify data of neural network parameters and reduce the calculation amount of a model, an embedding layer is added between the data and the model, and a sparse matrix is converted into a fixed dense vector through a mapping relation to be used as the input of the model. Where the vector i in the embedding layer is represented as:

wherein e_iIs represented as a feature vector that is,

the i-th dimension of the feature field, x, in which the feature is located_fieldSamples representing the ith dimension feature field. Then the output of the embedding layer

It can be expressed as:

where n represents the number of features and k represents the dimension of the embedding vector.

2. The shallow layer and the deep layer adopt a parallel structure, and the two modules share the output of the embedded layer as the input of the model.

3. The shallow layer adopts derived FFM of a traditional factorization machine, and the correlation of features in a field is more concerned; the shallow FFM mainly realizes automatic combination of shallow features, and for the feasibility of calculation, the FFM module considers the interaction between second-order features. The output of the FFM layer can be expressed as:

wherein with y_ffmRepresents the output of the FFM layer, w_iWeight, w, representing first order features₀The bias of the equation is expressed,

and representing the weight of the second-order feature interaction.

4. Extracting high-order nonlinear features by adopting a Convolutional Neural Network (CNN) in a deep layer; initialized sample weight w ═ w₁,w₂,...，w_n]And normalizing, wherein n is expressed as the number of samples; initializing the convolution layer number of the convolution neural network, the convolution kernel number of each layer, the full connection layer number, the weight w on each layer of the network and the bias b.

5. The input is a fixed dimension vector of the embedded layer output, the output is a reduced-size mapping, and each mapping is a convolution value combination of the input mapping of the upper layer; the input sample set is X ═ X₁,x₂,x₃,…x_n) Wherein n represents the size of the sample and x represents the sample; the CNN module trains a feature fusion network by adopting all feature data, minimizes the weight and bias of subsequent learning, and can be expressed as:

where n represents the size of the sample,

representing input data Y_iAnd reconstructing the data

Cross entropy loss function between.

6. The output of the shallow FFM module and the deep CNN module is used as the input of the feature fusion module, a deep confidence network (DBN) is selected as a fusion model, and the model comprises a 3-layer hidden layer and a 1-layer Sigmoid layer.

Wherein y is_ffmAnd y_cnnRepresents the output of the shallow FFM and the deep CNN, w represents the weight, b represents the equation bias,

representing the predicted value of the sample. y is_iThe true value of the sample.

7. In a DBN network, by minimizing the difference between the actual and predicted values, and back-broadcasting the parameter values, the overall loss function of the model can be defined as:

8. to prevent the model from overfitting, the model network is optimized at the hidden layer using Dropout; the DBN fusion model aims to capture the highly nonlinear relation between the shallow layer characteristic and the deep layer characteristic and output the click estimation judgment result to the interval (0,1) through a Sigmoid function.

Through the 8 steps, the click rate estimation method FFM-CNN based on deep learning and information fusion is formed. The method effectively fuses the correlation of the characteristics between the shallow layer and the deep layer, and improves the success rate of recommending click rate estimation.

The above embodiments are only examples, but the scope of the present invention is not limited thereto, and any person skilled in the art should be considered as the scope of the present invention in the light of the technical solutions disclosed in the present disclosure. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

In summary, the following steps: the invention provides a click rate estimation method CNN-FFM based on deep learning and information fusion. Most of the traditional click rate estimation methods adopt linear models, and the interaction between features depends on a large amount of labor cost; the automatic combination characteristics of the FM model of the factorization machine ignore the high-order nonlinear relation among the characteristics; and the common deep learning model can have the problems of gradient disappearance, gradient explosion and the like due to the deepening of a network layer. Aiming at the problems, the invention provides a method CNN-FFM combining a convolutional neural network CNN and a field factor decomposition machine FFM. Firstly, converting discrete characteristics of users and commodities into fixed dense vectors through one-hot coding and mapping to be used as input of a whole model; then, the shallow layer uses FFM to realize automatic combination between second-order features, and the deep layer uses a convolutional neural network CNN to extract high-order nonlinear features; and finally, the output of the shallow layer and the deep layer is used as the input of the feature fusion module to complete the fusion interaction of the shallow layer and the deep layer features. The method is combined with the internal relation between the characteristics of the shallow layer and the deep layer in the characteristic interaction depth, effectively solves the problems of gradient explosion and gradient disappearance, and improves the click estimation capability.

Claims

1. A click rate estimation method based on deep learning and information fusion is characterized by being divided into three modules: the system comprises a field decomposition machine FFM of a shallow extraction module, a convolutional neural network CNN of a deep extraction module and a deep confidence network DBN of a feature fusion module. The shallow layer module and the deep layer module adopt parallel structures and share fixed dense vectors converted from discrete features of users and commodities; the shallow layer module adopts a second-order combination of FFM automatic extraction features, the deep layer module adopts a CNN local perception domain to extract a high-order nonlinear feature combination, and the fusion module adopts a DBN to fuse the output of the shallow layer FFM and the deep layer CNN, so that the interaction of the shallow layer features and the deep layer features is realized.

2. The click rate estimation method based on deep learning and information fusion as claimed in claim 1, wherein the CNN-based high-order nonlinear feature extraction method:

3. The click rate estimation method based on deep learning and information fusion as claimed in claim 1, wherein the feature fusion mode based on DBN is as follows: