CN112801760A

CN112801760A - Sequencing optimization method and system of content personalized recommendation system

Info

Publication number: CN112801760A
Application number: CN202110338178.7A
Authority: CN
Inventors: 崔成龙
Original assignee: Nanjing Lanjingren Network Technology Co ltd
Current assignee: Nanjing Lanjingren Network Technology Co ltd
Priority date: 2021-03-30
Filing date: 2021-03-30
Publication date: 2021-05-14

Abstract

The invention discloses a sequencing optimization method and a sequencing optimization system of a content personalized recommendation system, wherein the method comprises the following steps: acquiring a user click operation, recalling and generating a list of contents to be sorted for preliminary screening; (II) scoring the list of the contents to be sorted of the initial screening according to a sorting model to generate an initial content-sorting score association vector; and thirdly, performing secondary sorting on the initial content-sorting score association vector based on a self-adaptive strategy to obtain a final sorting result. The method solves the problem of viscosity accuracy between users and contents, and performs adaptive sampling and aggregation aiming at the push content list of various upstream recall strategies to generate the content push list with richer varieties and more accurate individuality, thereby realizing the diversity of the individuality and accurate recommendation content varieties. The invention improves the accuracy of the product recommendation system.

Description

Sequencing optimization method and system of content personalized recommendation system

Technical Field

The invention relates to a recommendation system sorting method and a recommendation system sorting system, in particular to a sorting optimization method and a sorting optimization system of a content personalized recommendation system.

Background

At present, in a content community platform product of the internet, a personalized accurate recommendation system is the technical core of the product. In order to improve the use experience of the product in the community for the old users, the content which can be positively fed back by the users needs to be pushed. The target is realized by the cooperation of key links such as recall, rough arrangement, fine arrangement, rearrangement and the like in the recommendation system.

In the content ordering link, a content list which is really interested in is pushed to each user on the premise that the user does not have explicit behavior. To achieve the effect of personalized and accurate pushing of users, three service requirements need to be met: firstly, the characteristic rule of the historical click behavior of the user needs to be considered; second, there is a need to avoid pushing results for banner party content or single categories; third, it is important to ensure diversity of recommended content, so that users feel a novel experience of "familiar and strange" with respect to content push.

Most of the current sorting methods have the following problems: 1. only user behavior data (such as reading duration, reading completion rate, praise appreciation and the like) are considered; 2. obtaining a relatively rough user-content association vector in a relatively single depth model mode; 3. the upstream recall strategy is too single to carry out strategy modification of content diversity in the sequencing link.

Disclosure of Invention

The purpose of the invention is as follows: the invention provides a recommendation system sequencing optimization method with high user-content viscosity accuracy. The invention also aims to provide a sequencing optimization system based on the sequencing optimization method.

The technical scheme is as follows: the sequencing optimization method of the content personalized recommendation system comprises the following steps:

acquiring a user click operation, recalling and generating a list of contents to be sorted for preliminary screening;

(II) scoring the list of the contents to be sorted of the initial screening according to a sorting model to generate an initial content-sorting score association vector;

and thirdly, performing secondary sorting on the initial content-sorting score association vector based on a self-adaptive strategy to obtain a final sorting result.

Further, in the step (a), the list of contents to be initially screened is a list of content ids related to the historical click data of the user.

Further, in the step (two), the ranking model includes a double tower model.

Preferably, the step (two) includes:

(21) extracting user characteristic information and content characteristic information according to the list of the contents to be sorted which are preliminarily screened;

(22) according to different sorting models, evaluating the metadata or respectively evaluating the metadata after the user characteristic information and the content characteristic information are combined, and selecting the sorting model with the highest score as an actual sorting model;

(23) in an off-line training stage of the recommendation system, the user characteristic information and the content characteristic information are respectively input into the actual sequencing model to obtain a user embedded vector and a content embedded vector with the same dimension;

(24) performing dot product calculation on the user embedded vector and the content embedded vector, performing cross entropy loss calculation on a dot product value and a sample label value clicked by the user, and performing backward propagation to optimize network parameters of an actual sequencing model;

(25) inputting the user characteristic information and the content characteristic information to be sorted into the optimized actual sorting model, and taking the dot product result of the model output vector as a sorting score to obtain an initial content-sorting score association vector.

Further, the user feature information includes: the content feature vector of the user click sequence, the content feature vector of the user portrait index and the content feature vector of the user favorite sequence.

Further, the content embedding vector is calculated by continuously calling a deep network at the content side of the actual sequencing model, an embedding layer is output, and the actual sequencing model is updated and stored for the on-line prediction of a new content sequence to be inquired and used.

Further, when the content embedding vector is predicted on line, calculation is performed by calling a deep network on the user side of the actual sequencing model.

Further, the step (three) includes:

(31) acquiring the initial content-sorting score association vector, counting all vector sources, and classifying each vector into a corresponding recall group;

(32) the adaptive sampling weight is calculated according to the following formula:

（1）

wherein,

the sampling coefficient representing the recall packet i,nindicates the number of recall packets and the number of recall packets,

indicating the click rate of the ith recall packet;

(33) generating a Top-K recommended content vector list according to the following formula, wherein the number of the actual recommended content vector lists of the ith recall group is as follows:

（2）

wherein,

indicating the number of recalls configured for the ith recall group,

the number of recalls after the weighted calculation of the ith recall group is represented, and the following conditions are met:

（3）

wherein m is the total number of recalled content ids;

(34) number of recalls grouped according to recall

Executing a sample balance processing strategy to average the content samples according to the deviation from the actual recalling number;

(35) and performing secondary sorting on the Top-K recommended content vector list by the fusion service logic to obtain a final sorting result.

Further, the sample balancing processing strategy is: when a certain number of recall packets is insufficient

The recommender system recalculates the sampling coefficients based on the number of recall misses

According to

Preferably, a corresponding amount of content is extracted from the recall packet having a large value to supplement the content missing from the other recall packets.

The sequencing optimization system of the content personalized recommendation system comprises the following components:

the rough screening module is used for acquiring the clicking operation of the user, recalling and generating a list of contents to be sorted of the primary screening;

the first sequencing module is used for scoring the list of the contents to be sequenced by the sequencing model to generate an initial content-sequencing score association vector;

and the second sequencing module is used for carrying out secondary sequencing on the initial content-sequencing score association vector output by the first sequencing module based on a self-adaptive strategy to obtain a final sequencing result.

Has the advantages that: the invention has the following advantages:

1. the viscosity between each content of the product list and the user is calculated for initialization sequencing, so that individual real-time recommendation for each user is accurately realized;

2. based on the diversity of the recall groups, the self-adaptive sampling is carried out, so that products recommended to a single user come from different categories, and the intelligent effect of the recommendation system is improved;

3. adding a praise sequence in the user behavior data to ensure the robustness of the user behavior vector characteristics;

4. the introduction of user portrait information increases the richness and representativeness of features.

Drawings

FIG. 1 is a flow chart of a ranking optimization method of the content personalized recommendation system of the present invention;

FIG. 2 is a first ranking module framework diagram of the ranking optimization system of the content personalized recommendation system of the present invention;

fig. 3 is a frame diagram of a second ranking module of the ranking optimization system of the content personalized recommendation system of the invention.

Detailed Description

The technical solution of the present invention is further described below with reference to the accompanying drawings and examples.

Referring to fig. 1, a flowchart of a ranking optimization method of a content personalized recommendation system according to the present invention is shown, where the method includes:

the method comprises the following steps of (I) acquiring a user click operation, recalling upstream and generating a list of contents to be sorted for preliminary screening, wherein the method specifically comprises the following steps:

and acquiring m pieces of content information data provided by a recall system each time a single behavior operation clicked by a user is captured, and combining a user id and a content id corresponding to the acquired content information data into a binary group to be recorded as (user id, content id).

And (II) scoring the list of the contents to be sorted of the initial screening according to a sorting model to generate an initial content-sorting score association vector.

Extracting user characteristic information and content characteristic information according to the list of the contents to be sorted which are preliminarily screened; wherein,

the user characteristic information comprises three types of data information: 1. preprocessing data of a feature vector of a user click sequence; 2. the basic attributes of the user comprise multi-dimensional user portrait indexes such as gender, age, consumption level and the like; 3. the user approves the preprocessed data of the feature vector of the sequence, and the processing mode is consistent with that of the class 1 data. The preprocessed data is an average or weighted average.

The content characteristic information comprises information such as primary classification, secondary classification and content author based on the content.

How to evaluate whether the pushed content is accurate or not, in the feature engineering of other current recommendation systems, the preference degree of the user for the content is generally comprehensively measured through data such as clicking, praise, commenting, forwarding, collecting, browsing and playing time of the recommended content by the user; or the satisfaction degree of the user to the APP is measured through the times of opening the APP by the user, the time interval of returning the APP by the user, the one-time stay time of the user and the like, and the satisfaction degree of the user to the recommended content can be reflected to some extent.

Therefore, in conjunction with the above industry experience, user portrait considerations are taken into account. The feature vector of the user click sequence contains the click sequence of the user's last 50 contents, the praise sequence of the last 50 contents, and fixed attributes of the user, such as gender, age, and consumption level. After content feature vectors of nearly 50 user click sequences are obtained, dimension reduction processing needs to be performed on the 50 x 32 dimensional matrix, and the content feature vectors are compressed into a 32 dimensional first group of user vectors after average calculation. And the recent 50 user praise sequences are subjected to dimensionality reduction by adopting the same method to generate a 32-dimensional second group of user vectors. The basic characteristics of the user comprise multi-dimensional user portrait indexes such as gender, age, consumption level and the like, and the discrete characteristics are processed in a single-hot coding mode to generate a third group of user vectors. And combining the three groups of user vectors to be used as the splicing vector input of the deep neural network at the user side. Similarly, the content feature information comprises content-based first-level classification information and second-level classification information, and after the content feature information is combined into vectors, the vectors are averaged with the user vectors to generate one-dimensional spliced vectors which are sent to the deep network of the content side.

As shown in FIG. 2, model training of the recommendation system is performed in an off-line stage, and the user combination features and the content combination features are respectively fed into the selected deep neural network model. The method disclosed by the invention tests classical sequencing models such as a double-tower model, a Google Wide and Deep model (Wide & Deep), a DIN model in Ali, a DIEN model and the like. And selecting a model with the highest accuracy rate and F1, namely a double-tower model as a base line of the sequencing model according to the accuracy rate of the data verification set, indexes such as F1 and the like. And calculating the user and the content to the combined feature vector, and calculating to obtain two units of a user embedded vector and a content embedded vector, namely respectively serving as low-dimensional semantic representations of the user and the content.

The two methods calculate cross entropy loss through the label value of the dot product result sample, and carry out backward propagation to optimize network parameters. In addition, the content embedding vector calls a deep tower network on the content side of the model to calculate, and the model is stored in an online environment for the sequential query of new content characteristic information predicted online.

Meanwhile, in the online prediction stage, the combined feature vector obtained by combining the user feature information and the content feature information of the new user also needs to be calculated by calling a model user side deep network, after the user embedded vector is generated, click operation is carried out on the combined feature vector and the content embedded vector of each content stored in the model, and finally the logit is taken as the score of the content-ranking score association vector, and the output format of the process is (user id, content id, ranking score).

And (III) as shown in FIG. 3, the module obtains the (user id, content id, ranking score) vector set predicted on line by the ranking double-tower model. And counting the content sources in all the triples, namely determining which recall strategy the content is pushed by, classifying each triplet according to the counting result, and marking the corresponding identification of the corresponding recall strategy group.

Here, there are a total of five recall groups, as shown in table 1 below.

TABLE 1

Recall group name	Principle of grouping
		i2i	Similar computing mode recalls between contents
u2i	Preference calculation mode recall between user and content
		up	User portrait calculation recall
hot	Computing-based recall of hot content
		u2u	Similarity calculation mode recalling between users

(IV) according to the click rate index of the latest 30 days of each group, carrying out self-adaptive sampling weight calculation to obtain a sampling coefficient corresponding to each recall group

：

（1）

Wherein,

a sampling coefficient representing the recall packet i, n represents the number of recall packets,

indicating the click rate of the ith recall packet.

The sampling weight for each recall packet will be calculated

And total number of content items pushed by the recall packet

Substituting the following formula to generate a Top-K recommended content vector list:

（2）

wherein,

indicating the number of recalls configured for the ith recall group,

shows the number of recalls weighted by the ith recall group (according to the table)

5), the following conditions are satisfied:

（3）

where m is the total number of recalled content ids.

Although topk content sequences pushed to the downstream can be extracted accurately according to the above strategy in an ideal situation, in reality, the situation that the number of recalls in a content list pushed by an upstream recall system is uneven is likely to exist, for example, the number of recalls in a certain recall group is not enough. Therefore, recall balance evaluation is needed, and a corresponding processing strategy is carried out. When the number of certain recall groups is not enough, the recommendation system can be used for recalling the missing numberCalculating a sampling coefficient

According to

Preferably, a corresponding amount of content is extracted from the recall packet having a large value to supplement the content missing from the other recall packets. And finally, the business logic of the product is fused, and the Top-K recommended content list is sent to a downstream link after secondary sequencing.

And the list of the contents to be sorted in the preliminary screening is a content id list related to the historical click data of the user.

The first sequencing module further comprises:

the preprocessing subunit is used for extracting user characteristic information and content characteristic information according to the preliminary screening content list to be sorted; according to different sorting models, evaluating the metadata or respectively evaluating the metadata after the user characteristic information and the content characteristic information are combined, and selecting the sorting model with the highest score as an actual sorting model;

the calculation subunit is used for respectively inputting the user characteristic information and the content characteristic information into the actual sequencing model in an offline training stage of the recommendation system to obtain a user embedded vector and a content embedded vector with the same dimension; performing dot product calculation on the user embedded vector and the content embedded vector, performing cross entropy loss calculation on a dot product value and a sample label value clicked by the user, and performing backward propagation to optimize network parameters of an actual sequencing model; inputting the user characteristic information and the content characteristic information to be sorted into the optimized actual sorting model, and taking the dot product result of the model output vector as a sorting score to obtain an initial content-sorting score association vector.

Further, the user feature information includes: the content feature vector of the user click sequence, the user portrait index and the content feature vector of the user approval sequence.

The second sorting module is specifically configured to obtain the initial content-sorting score association vectors, count all vector sources, and classify each vector into a corresponding recall group; the adaptive sampling weight is calculated according to the following formula:

（1）

wherein,

representing the click rate representing the ith recall packet;

generating a Top-K recommended content vector list according to the following formula, wherein the number of the actual recommended content vector lists of the ith recall group is as follows:

（2）

wherein,

indicating the number of recalls configured for the ith recall group,

（3）

wherein m is the total number of recalled content ids;

number of recalls grouped according to recall

Executing a sample balance processing strategy to average the content samples according to the deviation from the actual recalling number; and performing secondary sorting on the Top-K recommended content vector list by the fusion service logic to obtain a final sorting result.

According to

Claims

1. A sequencing optimization method of a content personalized recommendation system is characterized by comprising the following steps:

2. The ranking optimization method of the content personalized recommendation system according to claim 1, wherein in step (one), the list of the content to be initially screened is a list of content ids related to historical click data of the user.

3. The ranking optimization method of the content personalized recommendation system according to claim 1, wherein in the step (two), the ranking model comprises a double tower model.

4. The ranking optimization method of the content personalized recommendation system according to claim 1, wherein the step (two) includes:

5. The ranking optimization method of the content personalized recommendation system according to claim 4, wherein in the step (21), the user feature information includes a feature vector of a user click sequence, a feature vector of a user portrait indicator, and a feature vector of a user favorite sequence.

6. The ranking optimization method of the content personalized recommendation system according to claim 4, wherein the content embedding vector is calculated by continuously calling a deep network on a content side of the actual ranking model, and the actual ranking model is updated and saved for use in online predicted new content sequence query.

7. The ranking optimization method of the content personalized recommendation system according to claim 4, wherein the content embedding vector is calculated by calling a deep network of a user side of the actual ranking model when predicted on line.

8. The ranking optimization method of the content personalized recommendation system according to claim 1, wherein the step (three) includes:

（1）

wherein,

indicating the click rate of the ith recall packet;

（2）

wherein,

indicating the number of recalls configured for the ith recall group,

（3）

wherein m is the total number of recalled content ids;

(34) number of recalls grouped according to recall

9. The ranking optimization method of the content personalized recommendation system according to claim 8, wherein the sample balancing processing policy is: when a certain number of recall packets is insufficient

According to

10. A ranking optimization system for a content personalized recommendation system, the system comprising: