CN112732936B

CN112732936B - Radio and television program recommendation method based on knowledge graph and user microscopic behaviors

Info

Publication number: CN112732936B
Application number: CN202110033041.0A
Authority: CN
Inventors: 詹会兰; 向超; 雷航; 杨茂林
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2021-01-11
Filing date: 2021-01-11
Publication date: 2022-03-29
Anticipated expiration: 2041-01-11
Also published as: CN112732936A

Abstract

The invention provides a radio and television program recommendation method based on a knowledge graph and user microscopic behaviors, which combines item2vec with random walk, trains a random walk sequence of an attribute subgraph and a user behavior sequence together, and obtains an embedded vector fusing similarity of two layers of program content attributes and user interaction sessions. Then, under the condition of obtaining program embedding and classified fusion microscopic behavior embedding, behavior embedding and corresponding program embedding are spliced according to historical interaction records of a user and a program, semantic representation of behaviors-programs with the same dimensionality is obtained through semantic space network mapping, and an embedding sequence of historical behaviors of the user is formed; and finally, performing self-attention coding mapping on the historical behavior sequence of the user based on a Transformer coding-decoding mechanism to obtain user semantic features implying dynamic preference, and decoding the user semantic feature mapping by using the attention of the target program to obtain the user semantic preference.

Description

Radio and television program recommendation method based on knowledge graph and user microscopic behaviors

Technical Field

The invention belongs to the technical field of radio and television program recommendation, and particularly relates to a radio and television program recommendation method based on a knowledge map and user microscopic behaviors.

Background

With the convergence of three network services of a telecommunication network, a broadcast network and a computer communication network, services provided by the broadcast network are richer and faster. Due to the increasing number of television channels and the advent of IPTV services and new media services, more and more television program content is available to users exposed to television terminals. However, such an excessive amount of television programming also places a burden on television viewers because searching for their favorite television programming content takes longer. The recommendation system can help users to efficiently screen out interesting information, and is beneficial to television audiences to conveniently and effectively obtain favorite program contents.

The algorithm research of the existing radio and television field recommendation system can be summarized into several main contents. One is a general recommendation method, including simple statistical algorithms and traditional collaborative filtering. From a user preference modeling perspective, these efforts have focused on mining static correlations between users and projects, ignoring dynamic changes in user preferences over time. Some studies have clustered television viewers according to the types of programs watched and then recommend the programs through collaborative filtering. Some researches set a program type threshold value according to an experience value to cluster television programs, and then perform user clustering according to indexes such as program types, time lengths and the like watched by users for collaborative filtering recommendation. And an implicit scoring function is set in part of research, the watching behavior of the user is converted into the scoring of the program by the user so as to provide a recommendation strategy, and a Hadoop distributed framework is realized. In part of researches, the watching time length of a user and the preference degree are considered to be in a direct proportion relationship, and sequencing recommendation is carried out.

Another content is to take into account the dynamic changes of the user's interest, and introduce a time decay function to characterize the drift of the user's interest over a time span. The time attenuation functions in the models are mostly subjective structures, the effects are different, and in a big data environment, the problems of data sparseness and cold start are faced. Some studies have introduced a time decay function in the underlying factorial model, making the factorization result time dependent. Some researches set a time period function and establish attenuation factors to simulate the interest change of users.

Still another content is to recommend items of interest to a user based on sequence pattern mining, these sequence-based models equate the interaction sequence of user items to the behavior of the user, and ignore the various behavior types of the user, do not consider the user interest differences implied by the different feedback behaviors of the user to the goods, and these works obtain the embedding of items from the aspect of characteristics when mining the dynamic changes of the user preferences often ignore the inherent relation of content attributes between items.

Disclosure of Invention

Aiming at the problems that in the prior art, various behavior types of users are ignored, user interest differences implied by different feedback behaviors of the users are not considered, and when dynamic changes of user preferences are mined, the problem that the intrinsic relation of content attributes among items is often ignored when items are embedded from the aspect of characteristics is obtained, the invention provides a radio and television program recommendation method based on a knowledge graph and user microscopic behaviors, a random walk sequence obtained by a knowledge graph attribute subgraph is trained together with a user behavior sequence based on conversation by combining item2vec and random walk, and the embedded vector obtained by training integrates the similarity of two levels of program content attributes and user interaction sessions. Then, under the condition of obtaining program embedding and classified fusion microscopic behavior embedding, program embedding and splicing corresponding behaviors according to historical interaction records of a user and a program, and obtaining semantic representation of each behavior-program with the same dimensionality through semantic space network mapping to form an embedding sequence of the historical behaviors of the user; and finally, performing self-attention coding mapping on the historical behavior sequence of the user based on a Transformer coding-decoding mechanism to obtain user semantic features with implicit dynamic preference, and decoding the user semantic feature mapping by using the attention of the target program to obtain the user semantic preference. The invention realizes the content recommendation which is more deeply related to the user behavior, the program content attribute and other parties through the operation, and the recommendation is closer to the interest and the preference of the user.

The specific implementation content of the invention is as follows:

the invention provides a radio and television program recommendation method based on a knowledge graph and user microscopic behaviors, which specifically comprises the following steps of:

step S1: constructing a knowledge graph in the field of broadcasting and television;

step S2: combing user microscopic behavior data: dividing behavior interaction data of a user into continuous microscopic behaviors and discrete microscopic behaviors according to behavior duration;

step S3: extracting the attribute subgraph of the knowledge graph in the broadcasting and television field constructed in the step S1, and extracting random walk programs according to the attribute subgraph in a random walk modeA sequence Q; extracting a random walk sequence Q from one attribute subgraph, wherein the random walk sequences Q of all the attribute subgraphs form an item sequence set H_k；

Step S4: constructing a time session-behavior type matrix by using the behavior interaction data of the user, which is combed in the step S2, and calculating the session similarity; finally, extracting a user interactive program sequence H based on the conversation;

step S5: collecting the item sequence H obtained in step S3_kThe user interactive program sequence H obtained in the step S4 is used as the input of an Item2vec algorithm together, and program embedding is obtained through training;

step S6: generating user microscopic behavior embedding according to the behavior interaction data of the user combed in the step S2; then according to the historical interactive record of the user and the program, splicing the user microscopic behavior embedding and the corresponding program embedding, and generating semantic representation of each behavior-program with the same dimensionality through semantic space network mapping, wherein the semantic representations of all the behaviors-programs form an embedding sequence of the user historical behaviors, namely a user-behavior potential semantic sequence;

step S7: and (3) carrying out user dynamic preference learning based on a transformer mechanism: the transform mechanism comprises a self-attention mechanism and a common attention mechanism based on a multi-head attention mechanism, wherein the self-attention mechanism is used for learning the relevance of each item in a user-behavior latent semantic sequence to obtain the latent semantic features of a user, the common attention mechanism is used for decoding the latent semantic features of the user according to a target program to obtain the semantic preference of the user, and the program recommendation is performed on the user according to the semantic preference of the user.

In order to better implement the present invention, further, the specific operation of step S3 includes:

step S3.1: determining the attributes of the radio and television programs, and querying knowledge map information by using spark ql triple query statements according to the attribute keywords to form a plurality of attribute sub-graphs based on the attributes; the attributes comprise director, actors, language, genre, drama, region;

step S3.2: at each attribute subgraphRandom walk is carried out, a random walk sequence Q is generated, and all the obtained random walk sequences Q jointly form a project sequence set H_k＝{Q₁，Q₂，...，Q_nIn which Q_i＝{x₁，x₂，...，x_nDenotes a random walk sequence generated, i ═ 1,2, 3.., n;

in the process of random walk, given the starting node as v and the ith node as c_i，c₀The probability of random walk is:

therein, II_vxIs the unnormalized transition probability between node v and node x, z is the normalization constant; n shape_vxBy entity edge weight w_vxMultiplied by a coefficient, expressed as:

∏_vx＝α_pq(t，x)·w_vx；

wherein the coefficient alpha_pqThe formula for the calculation of (t, x) is:

at the coefficient alpha_pq(t, x) in the calculation formula, t represents the previous node of the random walk, x represents the next node of the random walk, and the walk of the depth and the extent is controlled by the values of p and q; d_txRepresenting the shortest distance between node t and node x.

In order to better implement the present invention, further, the step S4 specifically includes the following steps:

step S4.1: constructing a time session-behavior type matrix: assuming that the length of the historical interaction sequence of a user is h, the number of behavior types is n, and the number of programs interacted per session is k, the historical interaction record of the user contains m-h/k sessions, and the time period of each session is t_iI 1, 2.. times.m, thenTo form the temporal session-behavior type matrix TSA as:

namely: TSA ═ x_ij}_m×n，1≤i≤m，1≤j≤n；

Wherein x is_ijIs shown over a time period t_iIn the method, the microscopic behaviors of a user in the ith conversation are the frequency of j, m and n are rows and columns of a matrix respectively and respectively represent the number of conversations and the number of behavior types divided according to a certain conversation length k;

step S4.2: first, a time period t is calculated from a time session-behavior type matrix_iAnd a time period t_jDegree of similarity of behaviors between

The specific calculation formula is as follows:

wherein a is a behavior type, A is a behavior type set,

represents t_iThe frequency of the user generating behavior type a in the session s in the time period is equivalent to x in the TSA matrix_ij；

Then, the time period t is calculated_iAnd time period t_jThe interval between dist (t)_i，t_j) The specific calculation formula is as follows:

then, calculating the similarity of the session i and the session j, wherein the specific calculation formula is as follows:

step S4.3: combing out a user interactive program sequence, dividing conversations according to the time increasing direction, and sequentially calculating the similarity between adjacent conversations, wherein the calculation formula of the superposition sum of the conversation similarities is as follows:

wherein m is the number of sessions divided according to a certain session length k, and if the threshold of the session similarity superposition sum is Y, k when sum (se) of user u is greater than Y is taken as the session length of the user, and the interactive program sequence H of the user is extracted as { S ═ S {₁，S₂，...S_n}。

In order to better implement the present invention, further, the specific operations of step S6 are:

step S6.1: calculating the user microscopic behavior weight:

firstly, setting N user micro behaviors obtained in the step S2, wherein the N user micro behaviors comprise a continuous micro behavior and a discrete micro behavior; the total times of the micro-behaviors of each user are respectively marked as A₁，A₂，...，A_N；

Secondly, respectively calculating the normalization weight a corresponding to each user micro-behavior type₁，a₂，...，a_NThe specific calculation formula is as follows:

step S6.2: obtaining a type vector vec (a) of the user's microscopic behavior_g): for user microscopic behavior, one-hot coding is used as perAn implicit feedback action obtains a vector representation, and the formula is as follows:

vec(a_g)∈R^|A| g＝1，2，...，|A|

step S6.3: obtaining a duration long vector representation vec (a) of the microscopic behavior of the user_d)：

For the continuous microscopic behaviors, the continuous microscopic behaviors in the program are divided into [0.0.1 ], [0.1, 0.2 ], [0.2, 0.3 ], and [0.9, 1 ] according to the proportion of the continuous microscopic behaviors of the user in the program to the total time length of the program]For a total of ten levels, a one-hot encoding is used to generate a vector representation vec (a) for the microscopic behavior times of the ten levels_d) And vec (a)_d)∈R¹⁰；

For discrete microscopic behaviors, all-zero equal-length vectors are used for space occupation to generate a vector representation vec (a)_d) And vec (a)_d)＝[0，0，0，...，0]；

Step S6.4: vectorizing the user microscopic behaviors of the user u on the program i, wherein the vectorizing is represented as:

vec(a_u，i)＝a_i·vec(a_g)+vec(a_d)

the "+" sign in the formula denotes the join operation of the vector, a_iFor microscopic behavioral weighting, vec (a)_g) Type vector for microscopic behavior, vec (a)_d) A duration vector for the microscopic behavior;

step S6.5: time coding is carried out, continuous time is discretized, then embedding of time is obtained, and a model is introduced for training and learning:

for continuous microscopic behavior: firstly, extracting the historical behavior interaction sequence of the user

Then the historical behaviors are interacted with sequences

Is extracted as T ═ T₁，t₂，t₃...]Setting the time stamp of the clicked target program as t_p；

Then, time interval sequence T between the target program and the historical interactive program of the user is calculated_ΔThe specific calculation formula is as follows:

T_Δ＝[t_p-t₁，t_p-t₂，t_p-t₃，...]＝[Δt₁，Δt₂，Δt₃，...]；

for discrete microscopic behavior: a discretized time interval is defined, denoted as [0, 1), [1, 2, [2, 4 ]^k，2^k+1) ,..; wherein each time interval is in units of hours, and T is_ΔEach item of (a) is mapped into a discrete time interval and then mapped into a one-hot code, and a time code vec (t) of the item interaction behavior is obtained_i)；

Step S6.6: performing behavior semantic space embedding: defining user behavior u on the basis of obtaining different behavior representations, item embedding and time coding_iThe behavior of user u on item i is represented as:

u_i＝vec(a_u，i)+vec(x_i)+vec(t_i)；

wherein, vec (x)_i) Representing the adoption of the behavior a by the user u_u，iEmbedded representation of interaction with item i, vec (x)_i) Project embedding for fusing knowledge-graph and user interaction features, vec (t)_i) For temporal coding of behaviors, "+" in the formula represents the join operation of vectors; and then, calculating the user-behavior sequence according to the following formula:

in the formula (d)_xRepresents the dimension size, | L | represents the length of the user-behavior sequence;

then, a user-behavior sequence L is formed by adopting a full connection layer^uConversion into user-behavior latent semantic sequences

The specific conversion formula is as follows:

B^u＝σ_τ(W_τL+b_τ)；

wherein, W_τAnd b_τFor weights and offsets of fully connected layers, σ_τIs an activation function.

In order to better implement the present invention, further, the step S7 specifically includes the following steps:

step S7.1: carrying out matrix calculation on potential semantic features U of the user, wherein the specific calculation method comprises the following steps:

wherein Self _ Attention represents the Self-Attention mechanism operation, B^uFor the user-behavior latent semantic sequence, the sequence length is set to be S, the embedding dimension of each item is K, and then B^u∈R^S×K(ii) a In the formula

Softmax is a function for making the attention score of the generation between 0 and 1, a self-set constant term for avoiding the generation of an excessive value;

step S7.2: user semantic preference B for users using a common attention mechanism_embAnd (3) performing matrix calculation, wherein a specific calculation formula is as follows:

wherein, Attention represents a common Attention mechanism, U is a potential semantic feature of a user, and U belongs to R^S×KTerm of constant

And in order to avoid generating excessive values, P is the embedding of the fused knowledge base map and the user interaction characteristics of the target program.

In order to better implement the present invention, in step S7, after step S7.2 is performed, the following operations are performed:

step S7.3: and (3) carrying out nonlinear processing: after the calculation of the self-attention mechanism and the ordinary attention mechanism is carried out, the calculation of a point type feedforward network is added, and the calculation specifically comprises the following steps:

for the point feed-forward network calculation with increased self-attention mechanism:

U＝Normalize(Conv1D(Conv1D(U))+U)；

wherein normaize is a normalization operation for solving the gradient vanishing problem, and Conv1D represents a one-dimensional convolution network; carrying out nonlinear mapping on the U twice through two layers of convolution networks; u on the left side of the equation in the formula is a potential semantic feature of the user after point feed-forward, and U on the right side of the equation is a potential semantic feature of the user before television feed-forward;

the point feed-forward network for the general attention mechanism addition is calculated as: the structure of the point type feedforward network after the ordinary attention calculation is the same as that of the point type feedforward network added by the self-attention mechanism;

a residual error network for preventing the loss of original information is added after the output of the multi-head attention mechanism and the point type feedforward network, and normalization processing is carried out after the residual error network is passed during calculation;

step S7.4: after the semantic preference vector of the user is obtained, calculating the probability of interaction between the user u and the candidate item v through a prediction function g, wherein the specific calculation formula is as follows:

p_u，v＝σ(g(B_emb，τ_v))；

the prediction function g is an inner product or an L-layer perceptron; b is_embFor semantic preferences of user u, τ_vIs the embedding of the fused knowledge-graph and the user interaction characteristics of the candidate program v.

In order to better implement the present invention, further, after step S7 is performed, the following steps are also required:

step S8: model training and optimization: training and optimizing an algorithm model by adopting a sigmoid cross entropy loss function, wherein a specific calculation formula is as follows:

where u denotes all instances of training users, y_i1 denotes a positive example, i.e. the user has interacted with the program, y_i0 represents a negative example, i.e. the user has not found an interaction with the program; p is a radical of_u，v∈[0，1]Is the output of the model, representing the click rate of the user; the optimization process of the above model is the process of minimizing the above loss function.

In order to better implement the present invention, further, the specific operations of step S1 are:

the method comprises the steps of forming structured data through entity alignment by crawling network resources of the broadcasting and television programs, and completing ontology construction by utilizing an ontology modeling tool prot g; after the ontology is constructed, the data originally stored in the relational database is converted into a corresponding rdf format by using d2rq, and then the rdf format is stored in a graph database mode, so that the construction of the knowledge graph in the field of broadcasting and television is completed.

In order to better realize the invention, further, when the knowledge map in the field of broadcasting and television is constructed, the knowledge map is constructed in a top-down mode, a data mode is constructed from the topmost concept, the data mode is gradually refined downwards, a taxonomy level with clear structure and clear logic is formed, and the ontology construction is completed; in the body construction, the concept layer inherits the type of the event, such as region, character, type, program and language; the top-level concept of the object attribute comprises program category, program origin, character home country, participation, actor, director, drama editor and program language; the top-level concept of the data attribute includes a region number, a region name, a character birthday, a character foreign language name, a character gender, a character number, a character name, a kind number, a kind name, a program number, a program rating, a program release date, a program introduction, a program name, a language number, and a language name.

In order to better implement the present invention, further, the specific operations of step S2 are: dividing behavior interaction data of a user into continuous microscopic behaviors and discrete microscopic behaviors according to behavior duration; the continuous microscopic behaviors are user behaviors which can last for a certain time, and comprise live watching, on-demand watching and search watching behaviors; the discrete microscopic behaviors are user behaviors which only occur at a certain moment, and comprise purchasing, collecting and praise behaviors; and collecting data detected by a background of the radio and television system to form structured data for recording a user number, a media asset number, a behavior type, a behavior timestamp and a behavior duration, wherein the behavior duration of the continuous microscopic behavior is a corresponding effective value, and the behavior duration of the discrete behavior type is null.

Compared with the prior art, the invention has the following advantages and beneficial effects:

(1) by adopting the mode of the migration attribute subgraph, the programs which have an association relation based on a certain attribute can more easily migrate to form a path, and the extracted program sequences can more reflect the similarity based on the attribute among the programs. Node2vec has two modes of breadth traversal and depth traversal, and can well extract the homogeneity and isomorphism between entities; making the recommendation of programs implicitly more relevant;

(2) considering that the behavior patterns of the users in each session keep statistical similarity, dividing the session time window according to the behavior pattern similarity of the users in a certain time period, so that the time window of each user is divided based on the behavior pattern of the user, and the consistency of the interactive behavior is fully considered;

(3) the project embedding obtained by extracting the sequence through the knowledge map attribute subgraph, the similarity of the content of the project is considered from the project attribute level, the similarity of the project is considered from the interaction context level based on the project embedding obtained by the user interaction sequence of the conversation, the fusion of the project embedding and the project embedding makes up the defect that the sequence embedding is less and the information of the content of the project is considered, and the individuation of the interaction sequence conversation is enhanced;

(4) the method comprises the steps of dividing microscopic behaviors of a user into continuous behaviors and discrete behaviors, fusing and projecting the behaviors to a common potential semantic space for recommendation after different vectorization is carried out, and enabling an algorithm to capture the influence of different behaviors on user preference. On one hand, the user interest degrees of the discrete behaviors such as collection, praise and purchase and the like are different from the user interest degrees of the continuous behaviors such as live broadcast watching and on-demand watching, and the behaviors are different in nature and need different vectorization modes. On the other hand, for the behaviors of live broadcast watching, on-demand watching and the like which are continuous behaviors, the user interest degrees reflected by different behavior durations are also different. Analyzing the microscopic behaviors of the user from a qualitative aspect and a quantitative aspect, and performing vectorization fusion, so that the user interest difference implied by different microscopic feedback behaviors of the user on the program can be reflected;

(5) by calculating the weight of the microscopic behaviors in the existing scene, the user interest degree of the microscopic behaviors based on the property level can be obtained, and the vectorization of the microscopic behaviors obtained based on the property level is corrected;

(6) time coding is introduced, coding is carried out by considering the difference between the interaction time of the target program and the program interaction time in the historical interaction sequence, and time drift of the sequence can be well modeled;

(7) the finally obtained semantic preference of the user implies attribute-based information of the item side in the historical interaction of the user, session context-based information, interactive microscopic behavior information and interactive time information, and well describes the dynamic preference of the user;

(8) and training and optimizing the model through the sigmoid cross entropy loss function, so that the algorithm is more accurate.

Drawings

FIG. 1 is a schematic flow diagram of the present invention;

FIG. 2 is a conceptual schematic diagram of a knowledge-graph according to the present invention.

Detailed Description

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it should be understood that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments, and therefore should not be considered as a limitation to the scope of protection. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.

In the description of the present invention, it is to be noted that, unless otherwise explicitly specified or limited, the terms "disposed," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.

Example 1:

the invention provides a radio and television program recommendation method based on a knowledge graph and user microscopic behaviors, which specifically comprises the following steps as shown in figure 1:

step S3: extracting the attribute subgraph of the knowledge graph in the broadcasting and television field constructed in the step S1, and extracting a random walk program sequence Q according to the attribute subgraph in a random walk mode; extracting a random walk sequence Q from one attribute subgraph, wherein the random walk sequences Q of all the attribute subgraphs form an item sequence set H_k；

step S7: and (3) carrying out user dynamic preference learning based on a transformer mechanism: the transform mechanism comprises an attention mechanism and a common attention mechanism based on a multi-head attention mechanism, wherein the attention mechanism is used for learning the correlation of each item in a user-behavior latent semantic sequence to obtain the latent semantic features of a user, then the common attention mechanism is used for decoding the latent semantic features of the user according to a target program to obtain the semantic preference of the user, and the program recommendation is carried out on the user according to the semantic preference of the user;

step S8: model training and optimization: and training and optimizing an algorithm model by adopting a sigmoid cross entropy loss function.

The working principle is as follows: the invention provides a radio and television content recommendation method based on a knowledge graph and user microscopic behaviors, which mainly comprises three parts: program embedding, user micro-behavior embedding, and user dynamic preference learning. The item2vec and the random walk are combined, a random walk sequence obtained by a knowledge map attribute subgraph and a user behavior sequence based on conversation are trained together, and an embedded vector obtained by training combines the similarity of two levels of program content attributes and user interaction conversation. Then, under the condition of obtaining program embedding and classified fusion microscopic behavior embedding, program embedding and splicing corresponding behaviors according to historical interaction records of users and programs, semantic representation of each behavior-program with the same dimensionality is obtained through semantic space network mapping, and an embedding sequence of historical behaviors of the users is formed. Wherein, in consideration of the time sequence of the user behavior sequence, the time coding is also introduced at the user behavior embedding layer in the subsequent embodiment. And finally, performing self-attention coding mapping on the historical behavior sequence of the user based on a transform coding-decoding mechanism to obtain user semantic features with implicit dynamic preference, and decoding the user semantic feature mapping by using the attention of the target program to obtain a user semantic expression vector. And calculating the product of the user semantic expression vector obtained by the text and the target program to predict the click rate. Meanwhile, in step S5, the program sequences obtained in step S3 and step S4 are combined together to be used as the input of item2vec, and finally the embedding vector of the item is obtained. The project embedding obtained through the knowledge map attribute subgraph extraction sequence considers the similarity of the contents of the project from the project attribute level, the project embedding obtained based on the user interaction sequence of the conversation considers the similarity of the projects from the interaction context level, the fusion of the project embedding and the user interaction sequence makes up the defect that the sequence embedding considers the content information of the project less, and the individuation of the interaction sequence conversation is enhanced.

Example 2:

in this embodiment, on the basis of the above embodiment 1, in order to better implement the present invention, as shown in fig. 2, the specific operations are as follows:

step S1: constructing a knowledge graph in the field of broadcasting and TV: the method comprises the steps of forming structured data through entity alignment by crawling network resources of the broadcasting and television programs, and completing ontology construction by utilizing an ontology modeling tool prot g; after the ontology is constructed, the data originally stored in the relational database is converted into a corresponding rdf format by using d2rq, and then the rdf format is stored in a graph database mode, so that the construction of the knowledge graph in the field of broadcasting and television is completed.

Other parts of this embodiment are the same as those of embodiment 1, and thus are not described again.

Example 3:

in this embodiment, on the basis of any one of the above embodiments 1-2, in order to better implement the present invention, further, the specific operation of step S2 is: dividing behavior interaction data of a user into continuous microscopic behaviors and discrete microscopic behaviors according to behavior duration; the continuous microscopic behaviors are user behaviors which can last for a certain time, and comprise live watching, on-demand watching and search watching behaviors; the discrete microscopic behaviors are user behaviors which only occur at a certain moment, and comprise purchasing, collecting and praise behaviors; and collecting data detected by a background of the radio and television system to form structured data for recording a user number, a media asset number, a behavior type, a behavior timestamp and a behavior duration, wherein the behavior duration of the continuous microscopic behavior is a corresponding effective value, and the behavior duration of the discrete behavior type is null.

Other parts of this embodiment are the same as any of embodiments 1-2 described above, and thus are not described again.

Example 4:

in this embodiment, on the basis of any one of the foregoing embodiments 1 to 3, in order to better implement the present invention, further, the specific operation of step S3 includes:

step S3.2: random walk is carried out in each attribute subgraph to generate a random walk sequence Q, and all the obtained random walk sequences Q jointly form an item sequence set H_k＝{Q₁，Q₂，...，Q_nIn which Q_i＝{x₁，x₂，...，x_nDenotes a random walk sequence generated, i ═ 1,2, 3.., n;

∏_vx＝α_pq(t，x)·w_vx；

wherein the coefficient alpha_pqThe formula for the calculation of (t, x) is:

The working principle is as follows: and (4) extracting an attribute sub-graph corresponding to each attribute by using the knowledge graph in the step (S1), and extracting a wandering program sequence on each attribute sub-graph in a Node2vec random wandering mode. By adopting the mode of the migration attribute subgraph, the programs which have an association relation based on a certain attribute can more easily migrate to form a path, and the extracted program sequences can more reflect the similarity based on the attribute among the programs. Node2vec has two modes of breadth traversal and depth traversal, and can well extract the homogeneity and isomorphism between entities.

Other parts of this embodiment are the same as any of embodiments 1 to 3, and thus are not described again.

Example 5:

in this embodiment, on the basis of any one of the foregoing embodiments 1 to 4, in order to better implement the present invention, further, the step S4 specifically includes the following steps:

step S4.1: constructing a time session-behavior type matrix: assuming that the length of the historical interaction sequence of a user is h, the number of behavior types is n, and the number of programs interacted per session is k, the historical interaction record of the user contains m-h/k sessions, and the time period of each session is t_iI 1, 2.. m, then a temporal session-behavior type matrix TSA may be formed as:

namely: TSA ═ x_ij}_m×n，1≤i≤m，1≤j≤n；

The specific calculation formula is as follows:

wherein a is a behavior type, A is a behavior type set,

represents t_iThe frequency of the user generating the behavior type a in the session s in the time period is equivalent toX in TSA matrix_ij；

The working principle is as follows: using the user interaction data in step S2, a session-based user interaction program sequence is extracted. As the user interacts with the program content, the user's viewing history forms a sequence that advances in time, and the sequence grows longer and longer. It is clearly undesirable to capture the interaction context information of an item using the entire interaction sequence of the user. Firstly, the user's interests change over time, and secondly, the computational and space costs associated with large amounts of data are too high. And extracting a user interaction sequence in a session-based mode by considering that the interest of the user is stable and the correlation existing between the interactive items is higher in a certain time. The general solution is to set a fixed time window, the fixed time window includes two types of fixed time length and fixed interactive item number, the time window is slid on the behavior interactive sequence of the user, and only the user interactive items in the time window are extracted each time. In fact, the interaction of the users is not as dense, and the size setting of the time window should be different. Considering that the behavior patterns of the users in each session keep statistical similarity, the session time windows are divided according to the behavior pattern similarity of the users in a certain time period, so that the time window of each user is divided based on the behavior pattern of the user, and the consistency of the interactive behavior is fully considered.

Other parts of this embodiment are the same as any of embodiments 1 to 4, and thus are not described again.

Example 6:

in this embodiment, on the basis of any one of the above embodiments 1 to 5, in order to better implement the present invention, further, the specific operation of step S6 is:

step S6.1: calculating the user microscopic behavior weight:

firstly, setting N user micro behaviors obtained in the step S2, wherein the N user micro behaviors comprise a continuous micro behavior and a discrete micro behavior; the total times of the micro-behaviors of each user are respectively marked as A₁，A₂，…，A_N；

Secondly, respectively calculating the normalization weight a corresponding to each user micro-behavior type₁，a₂，…，a_NThe specific calculation formula is as follows:

step S6.2: obtaining a type vector vec (a) of the user's microscopic behavior_g): for use ofAnd (3) user microscopic behaviors, obtaining a vector representation for each implicit feedback behavior by using one-hot coding, wherein the formula is as follows:

vec(a_g)∈R^|A| g＝1，2，…，|A|

vec(a_u，i)＝a_i·vec(a_g)+vec(a_d)

Then the historical behaviors are interacted with sequences

Is extracted as T ═ T₁，t₂，t₃...]Setting target program destinationTime stamp of hit is t_p；

for discrete microscopic behavior: a discretized time interval is defined, denoted as [0, 1), [1, 2, [2, 4 ]^k，2^k+1) ,..; wherein each time interval is in units of hours, and T is_ΔEach item of (a) is mapped into a discrete time interval and then mapped into a one-hot code, and a time code vec (t) of the item interaction behavior is obtained_i) (ii) a For example, if the time interval between the target program and a historical interactive program is 0.5 hours, the time code of the historical interactive movie is [1, 0, 0.,. 0 ]]。

u_i＝vec(a_u，i)+vec(x_i)+vec(t_i)；

then, one full connection layer is adopted to connect the user to the lineIs a sequence L^uConversion into user-behavior latent semantic sequences

The specific conversion formula is as follows:

B^u＝σ_τ(W_τL+b_τ)；

The working principle is as follows: in a recommendation scene in the field of broadcasting and television, a user does not display a score, and user-item interactions collected by a back-end system are a series of microscopic feedback behaviors (such as browsing, watching, collecting and the like) with heterogeneity, ambiguity and dynamics. Based on the characteristics of the scene application, different from a method for visually weighting or converting behaviors into scores, the method divides the microscopic behaviors of the user into continuous behaviors and discrete behaviors, performs different vectorization, fuses and projects the behaviors to a common potential semantic space for recommendation, and enables an algorithm to capture the influence of different behaviors on the preference of the user. On one hand, the user interest degrees of the discrete behaviors such as collection, praise and purchase and the like are different from the user interest degrees of the continuous behaviors such as live broadcast watching and on-demand watching, and the behaviors are different in nature and need different vectorization modes. On the other hand, for the behaviors of live broadcast watching, on-demand watching and the like which are continuous behaviors, the user interest degrees reflected by different behavior durations are also different. The microscopic behaviors of the user are analyzed from the qualitative aspect and the quantitative aspect, vectorization fusion is carried out, and the user interest difference implied by different microscopic feedback behaviors of the program by the user can be reflected. By calculating the weight of the microscopic behaviors in the existing scene, the user interest degree of the microscopic behaviors based on the property level can be obtained, and the vectorization of the microscopic behaviors obtained based on the property level is corrected. Time coding is introduced, coding is carried out by considering the difference between the interaction time of the target program and the program interaction time in the historical interaction sequence, and time drift of the sequence can be well modeled. For continuous behaviors such as on-demand watching and live watching, the user behavior accounts for the program in the program timeThe proportion of the total time length is divided into [0.0.1 ], [0.1, 0.2 ], [0.9, 1 ]]Then using one-hot coding to obtain a vector representation, namely vec (a), for the microscopic behavior time of the ten levels_d)∈R¹⁰. For continuous behaviors, different continuous time length vectors represent different interestingness implied by the duration of the user behavior, and the longer the program watching time length is, the more interesting the user is for the program is reflected. For the continuous duration representation of discrete behaviors such as collection, praise and the like, all-zero equal-length vectors are adopted for carrying out occupation, namely vec (a)_d)＝[0，0，0，...，0]. For discrete behavior, the measure of user preference is the behavior type itself, such as two behaviors of a program collection or praise, representing different interests of the user in the program. The all-zero vector has no excessive information, and the occupation alignment is convenient for subsequent model training.

Other parts of this embodiment are the same as any of embodiments 1 to 5, and thus are not described again.

Example 7:

in this embodiment, on the basis of any one of the foregoing embodiments 1 to 6, in order to better implement the present invention, further, the step S7 specifically includes the following steps:

step S7.2: by applying common attentionUser semantic preference of force mechanism to user B_embAnd (3) performing matrix calculation, wherein a specific calculation formula is as follows:

U＝Normalize(Conv1D(Conv1D(U))+U)；

p_u，v＝σ(g(B_emb，τv))；

The working principle is as follows: the semantic preference of the user implies attribute-based information of an item side in the historical interaction of the user, session context-based information, interactive microscopic behavior information and interactive time information, and well describes the dynamic preference of the user.

Other parts of this embodiment are the same as any of embodiments 1 to 6, and thus are not described again.

Example 8:

this embodiment is based on any one of embodiments 1 to 7 described above, and further,

The working principle is as follows: the model is a click rate prediction model, the recommended tasks are defined as a binary classification problem, and a sigmoid cross entropy loss function is adopted to train and optimize the model.

The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention in any way, and all simple modifications and equivalent variations of the above embodiments according to the technical spirit of the present invention are included in the scope of the present invention.

Claims

1. A radio and television program recommendation method based on knowledge graph and user microscopic behaviors is characterized by comprising the following steps:

the specific operation of step S3 includes:

therein, II_vxIs the unnormalized transition probability between node v and node x, z is the normalization constant; II type_vxBy entity edge weight w_vxMultiplied by a coefficient, expressed as:

Π_vx＝α_pq(t，x)·w_vx；

wherein the coefficient alpha_pqThe formula for the calculation of (t, x) is:

at the coefficient alpha_pq(t, x) in the calculation formula, t represents the previous node of the random walk, x represents the next node of the random walk, and the walk of the depth and the extent is controlled by the values of p and q; d_txRepresents the shortest distance between node t and node x;

the step S4 specifically includes the following steps:

namely: TSA ═ x_ij}_m×n，1≤i≤m，1≤j≤n；

The specific calculation formula is as follows:

wherein a is a behavior type, A is a behavior type set,

then, the similarity between the session i and the session j is calculated, and the specific calculation formula is as follows:

wherein m is the number of sessions divided according to a certain session length k, and if the threshold of the session similarity superposition sum is Y, k when sum (se) of user u is greater than Y is taken as the session length of the user, and the interactive program sequence H of the user is extracted as { S ═ S {₁，S₂，...S_n}；

The specific operation of step S6 is:

step S6.1: calculating the user microscopic behavior weight:

step S6.2: obtaining a type vector vec (a) of the user's microscopic behavior_g): for user microscopic behaviors, a one-hot code is used to obtain a vector representation for each implicit feedback behavior, and the formula is as follows:

vec(a_u，i)＝a_i·vec(a_g)+vec(a_d)

the "+" sign in the formula represents a vectorA connection operation of_iFor microscopic behavioral weighting, vec (a)_g) Type vector for microscopic behavior, vec (a)_d) A duration vector for the microscopic behavior;

Then the historical behaviors are interacted with sequences

for discrete microscopic behavior: a discretized time interval is defined, denoted as [0, 1), [1, 2, [2, 4 ]^k，2^k ⁺¹) ,..; wherein each time interval is in units of hours, and T is_ΔMapping each item in the one-hot code into a discrete time interval to obtain a time code uec (t) of the item interaction behavior_i)；

u_i＝vec(a_u，i)+vec(x_i)+vec(t_i)；

The specific conversion formula is as follows:

B^u＝σ_τ(W_τL+b_τ)；

2. The radio and television program recommendation method based on knowledge-graph and user microscopic behaviors as claimed in claim 1, wherein the step S7 specifically comprises the steps of:

wherein Self _ Attention represents the Self-Attention mechanism operation, B^uFor the user-behavior latent semantic sequence, the sequence length is set to be S, the embedding dimension of each item is K, and then B^u∈R^S×K(ii) a In the formulaIs/are as follows

3. The radio and television program recommendation method based on knowledge-graph and user microscopic behaviors as claimed in claim 2, wherein in step S7, after step S7.2, the following operations are further performed:

U＝Normalize(Conv1D(Conv1D(U))+U)；

p_u，v＝σ(g(B_emb，τ_v))；

4. The radio and television program recommendation method based on knowledge-graph and user microscopic behaviors as claimed in claim 3, wherein after the step S7, the following steps are required:

5. The broadcasting program recommending method based on the knowledge-graph and the user microscopic behaviors as claimed in any one of claims 1 to 4, wherein the specific operations of the step S1 are:

6. The radio and television program recommendation method based on the knowledge graph and the user microscopic behaviors as claimed in claim 5, wherein when the radio and television field knowledge graph is constructed, the knowledge graph is constructed in a top-down manner, a data mode is constructed from the topmost concept, and is gradually refined downwards to form a taxonomy level with clear structure and clear logic, so as to complete body construction; in the body construction, the concept layer inherits the type of the event, such as region, character, type, program and language; the top-level concept of the object attribute comprises program category, program origin, character home country, participation, actor, director, drama editor and program language; the top-level concept of the data attribute includes a region number, a region name, a character birthday, a character foreign language name, a character gender, a character number, a character name, a kind number, a kind name, a program number, a program rating, a program release date, a program introduction, a program name, a language number, and a language name.

7. The broadcasting program recommending method based on the knowledge-graph and the user microscopic behaviors as claimed in any one of claims 1 to 4, wherein the specific operations of the step S2 are: dividing behavior interaction data of a user into continuous microscopic behaviors and discrete microscopic behaviors according to behavior duration; the continuous microscopic behaviors are user behaviors which can last for a certain time, and comprise live watching, on-demand watching and search watching behaviors; the discrete microscopic behaviors are user behaviors which only occur at a certain moment, and comprise purchasing, collecting and praise behaviors; and collecting data detected by a background of the radio and television system to form structured data for recording a user number, a media asset number, a behavior type, a behavior timestamp and a behavior duration, wherein the behavior duration of the continuous microscopic behavior is a corresponding effective value, and the behavior duration of the discrete behavior type is null.