CN112801751B

CN112801751B - Personalized scenic spot recommendation method of multitask graph neural network

Info

Publication number: CN112801751B
Application number: CN202110155597.7A
Authority: CN
Inventors: 许国良; 李家浩; 雒江涛
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Hunan Shuzhi Cultural Tourism Development Co.,Ltd.
Priority date: 2021-02-04
Filing date: 2021-02-04
Publication date: 2022-12-23
Anticipated expiration: 2041-02-04
Also published as: CN112801751A

Abstract

The invention belongs to the field of big data mining, and particularly relates to a personalized scenic spot recommendation method of a multitask graph neural network, which comprises the following steps: acquiring interactive data, user attribute data and scenic spot attribute data of a user and a scenic spot; constructing a scenic spot knowledge map and a user knowledge map according to the user and the scenic spot data; learning vector representation of entities and relations in the knowledge graph through two graph neural networks, and accordingly constructing a deep neural network to predict scores of the user on scenic spots; training three networks by recommending a network scoring task and a multi-task alternate training mode of representing a learning task by a user and a scenic spot knowledge graph to complete model optimization; according to the invention, scenic spot and user attribute information are introduced, a knowledge map is constructed, the network is trained in a multi-task alternative training mode, the relation between the user and the characteristics of the scenic spot is accurately learned, the multi-task alternative training network can enhance the expandability of the model, the overfitting of the model is avoided, and the recommendation performance can be effectively improved.

Description

Personalized scenic spot recommendation method of multitask graph neural network

Technical Field

The invention belongs to the field of big data mining, and particularly relates to a personalized scenic spot recommendation method of a multitask graph neural network.

Background

With the development of society, the living standard of people is continuously improved, so that people like traveling more and more. At present, the development of the domestic tourism industry is in the key period of transformation and upgrading to global tourism and modern tourism, the tourism industry belongs to an information-intensive industry, has the characteristics of strong comprehensiveness and high relevance, and is a necessary choice for realizing transformation and upgrading of the tourism industry by using multi-domain mass tourism information and developing business and service mode innovation by using information technologies such as internet, cloud computing and the like. On the other hand, the rapid development of tourism brings great business opportunities to the tourism industry, and simultaneously, the method also meets the challenges. With the continuous acceleration of intelligent tourism construction, mass tourism data such as information acquisition, consumption comment, product recommendation and the like are generated. For the obtained travel data, the importance of intelligent travel development is formed by how to mine the value in the travel data by applying a mature big data technology.

In order to realize intelligent tourism, the tourist resort can be recommended by a personalized recommendation technology. Personalized recommendation systems have been widely used in the field of e-commerce and have also met with great success, with Amazon sales accounting for 35% of all being helped by recommendation systems. Although personalized recommendation technology has been successful in e-commerce and other fields, its application in other fields is not as effective as e-commerce. Therefore, how to find out the information meeting the personalized requirements of the tourists from the massive travel service information through the recommendation system according to the user preference and recommend the information for the user becomes a problem to be solved urgently. The traditional recommendation technology comprises a collaborative filtering series technology and an FM recommendation technology, but the methods cannot effectively utilize auxiliary information to solve the problems of data cold start and sparsity, and have insufficient extraction on characteristics of tourist attractions and users and poor recommendation effect.

Disclosure of Invention

In order to solve the problems in the prior art, the invention provides a personalized scenic spot recommendation method of a multitask graph neural network, which comprises the following steps: acquiring user data in real time, and preprocessing the acquired user data; inputting the preprocessed data into a trained recommendation model to obtain a recommendation result; the recommendation model is composed of a user graph neural network, a scenic spot graph neural network and a recommendation network and is a cross unit;

the process of training the recommendation model includes:

s1: acquiring original data, and preprocessing the original data; the original data comprises user attribute data, scenic spot attribute data and scenic spot interaction data;

s2: extracting a user characteristic set and a scenic spot characteristic set of the preprocessed data; constructing a user knowledge graph according to the user feature set, and constructing a scenic spot knowledge graph according to the scenic spot feature set;

s3: inputting triple data in the user knowledge graph into a user graph neural network for training, and learning the vector expression of the user in the user knowledge graph; inputting the triad data in the scenic spot knowledge graph into a scenic spot graph neural network for training, and learning to obtain vector expression of the scenic spot in the scenic spot knowledge graph;

s4: respectively inputting the user potential features extracted by the user map neural network and the scenic spot potential features extracted by the scenic spot map neural network into a recommendation network through a cross unit to obtain potential feature vectors after user fusion and potential feature vectors after scenic spot fusion; forming a prediction score of the user for the scenic spot according to the fused user potential feature vector and the fused scenic spot potential feature vector;

s5: in the training process of the recommendation model, multi-task training is carried out on a recommendation network score prediction task, a user map neural network representation learning task of a user knowledge map, and a scenic spot map neural network representation learning task of the scenic spot knowledge map;

s6: calculating a loss function of the model in a multitasking process, wherein the loss function of the model comprises scenic region diagram neural network loss, user diagram neural network loss, recommendation network loss and regular term loss;

s7: and when the loss function value of the model is minimum, finishing the training of the model.

Preferably, the process of preprocessing the user data includes: cleaning user data, and deleting invalid data and abnormal data; the data after washing were normalized by z-score.

Preferably, the extracted user feature set comprises a biological attribute feature and a social attribute feature; the extracted scenic spot feature set comprises scenic spot resource features and scenic spot leading function features.

Preferably, the structure of the user graph neural network comprises two parts, namely a neural network comprising L-layer full connection and a neural network comprising H-layer full connection; the neural network comprising L layers of full connection is used for extracting potential feature vectors of head entities and relations in the user knowledge graph; the H-layer full-connection neural network is used for extracting the potential feature vectors of the head entity and the relation from the feature extraction layer to perform high-order feature combination to form a predicted tail entity; wherein L, H is a model hyper-parameter.

Preferably, the structure of the scenic spot map neural network is the same as that of the user map neural network; the neural network comprising L layers of full connection is used for extracting head entities and relation potential feature vectors in the scenic spot knowledge graph; the neural network comprising H layer full connection is used for extracting the potential feature vectors of the head entity and the relation from the feature extraction layer to carry out high-order feature combination to form a predicted tail entity; wherein L, H is a model hyper-parameter.

Preferably, the structure of the recommendation network comprises two parts, namely a neural network comprising L-layer full connection and a neural network comprising H-layer full connection; the neural network comprising the L-layer full connection is used for extracting potential features of the user and the scenic spot input in the recommendation network, wherein the user corresponds to a head entity input by the user graph neural network, and the scenic spot corresponds to a head entity input by the scenic spot graph neural network; the neural network comprising H-layer full connection is used for extracting potential feature vectors of the user and the scenic spot from the feature extraction layer, performing high-order feature combination, and predicting the score of the user on the scenic spot.

Preferably, the structure of the crossing unit comprises a user crossing unit and a scenic spot crossing unit; the user cross unit is used for connecting the user graph neural network and the recommendation network feature extraction layer, fusing the features extracted by the same user through the user graph neural network and the recommendation network through feature cross and feature compression, and obtaining a potential feature vector after user fusion; and the scenic spot crossing unit is used for connecting the scenic spot map neural network and the recommendation network feature extraction layer, and fusing the features extracted from the same scenic spot by the scenic spot map neural network and the recommendation network through feature crossing and feature compression to obtain a potential feature vector after scenic spot fusion.

Further, the expression of feature intersection and feature compression is:

the characteristics are crossed: c _l ＝v _l e _l ^T

Feature compression:

preferably, the process of obtaining the personalized scores of the user on the scenic spot comprises the following steps:

preferably, the loss function is expressed as:

the invention has the beneficial effects that:

1) The method constructs the knowledge map by using the attribute data of the users and the scenic spots, learns the scenic spots and the user characteristic expression in the knowledge map through the graph neural network, introduces the knowledge information expressing the scenic spots and the users in the knowledge map into a recommendation network, accurately learns the relation between the users and the characteristics of the scenic spots and fully excavates the information of the data;

2) The invention designs two cross units as connection links between a scenic spot graph neural network and a recommendation network and between a user graph neural network and the recommendation network, and learns potential interaction characteristics of scenic spots and users in the two forms. The expandability of the model can be enhanced through a multi-task alternative training mode, overfitting of the model is avoided, and the recommendation performance can be effectively improved.

Drawings

FIG. 1 is a schematic representation of the steps of the process of the present invention;

FIG. 2 is a diagram of a user graph neural network architecture for the method of the present invention;

FIG. 3 is a diagram of a scenic map neural network architecture of the method of the present invention;

FIG. 4 is a diagram of a preferred network architecture for the method of the present invention;

FIG. 5 is a cross-cell block diagram of the method of the present invention;

fig. 6 is a general network architecture diagram of the method of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions in the embodiments of the present invention are clearly and completely described below with reference to the accompanying drawings. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The invention provides a personalized scenic spot recommendation method of a multitask graph neural network, which mainly comprises the following steps: acquiring and preprocessing data; selecting characteristics to establish a knowledge graph of a user and a scenic spot; learning vector representation of entity nodes and relations in the knowledge graph by using a graph neural network; establishing a deep neural network by using historical interaction data of a given user on a scenic spot, and realizing personalized prediction of the rating of the user on the given scenic spot; designing two cross units to connect three networks, and effectively integrating information in knowledge maps of a given user and a given scenic spot into a recommendation system; and finally, training three networks in a multi-task alternate training mode of recommending network scoring tasks and representing learning tasks by users and scenic spot knowledge maps to complete model optimization and form personalized scores of the users for scenic spots.

A personalized scenic spot recommendation method of a multitask graph neural network comprises the following steps: acquiring user data in real time, and preprocessing the acquired user data; inputting the preprocessed data into a trained recommendation model to obtain a recommendation result; the recommendation model is composed of a user graph neural network, a scenic spot graph neural network and a recommendation network, and a cross unit.

As shown in fig. 1, the process of training the recommendation model includes:

s1: acquiring original data and preprocessing the original data; the original data comprises user attribute data, scenic spot attribute data and scenic spot interaction data;

s3: inputting the (head entity, relation, tail entity) triples in the user knowledge graph into a user graph neural network for training, and learning the vector expression of the user in the user knowledge graph; inputting (head entity, relation, tail entity) triples in the scenic spot knowledge graph into a scenic spot graph neural network for training, and learning to realize vector expression of scenic spots in the scenic spot knowledge graph;

s4: designing a cross unit to fuse the user potential features extracted by the user map neural network and the scenic spot potential features extracted by the scenic spot map neural network into a recommendation network to obtain potential feature vectors after user fusion and potential feature vectors after scenic spot fusion; forming a prediction score of the user for the scenic spot by using the fused user potential feature vector and the fused scenic spot potential feature vector;

s5: in each round of training, carrying out multi-task training on a recommended network scoring prediction task, a user map representation learning task of a user knowledge map by a user map neural network, and a scenic spot map representation learning task of a scenic spot knowledge map by a scenic spot map neural network; specifically, when a given task is trained, the other two network parameters are kept unchanged, the network parameters of the task are updated, and the last three tasks are alternately trained in sequence to complete the updating of the three network parameters until the model converges;

s6: the loss function for guiding the personalized scenic spot recommendation method of the whole multitask graph neural network is accumulation of scenic spot graph neural network loss, user graph neural network loss, recommendation network loss and regular term loss; acquiring user attribute data, scenic spot attribute data and user and scenic spot interaction data in various ways; the acquisition mode includes, but is not limited to, acquiring tourism website data, volunteer data, public transportation data, climate website data, map software data, social software data and the like by using methods such as web crawlers, data burial, questionnaire and the like; preprocessing the acquired data, including cleaning user data and deleting invalid data and abnormal data; because the data has the characteristic of data source diversification, in order to eliminate the problems of different dimensions among different source scalar data and the problem of value intervals among the same source scalar data, the data is subjected to z-score standardization, and the standardization formula is as follows:

where x represents raw data, u represents raw data mean, σ represents raw data standard deviation, and z represents processed data, whose mean is 0 and standard deviation is 1.

Selecting a user feature set according to the preprocessed data

Feature set of scenic spot

The scenic spot features comprise scenic spot resource features and scenic spot leading function features, and the user features comprise biological attribute features and social attribute features.

The scenic spot resource characteristics comprise natural tourism resources such as scenic spot landscape resources, geographical position resources, climate resources, greening resources, biological type resources and the like; religious cultural resources, historical cultural resources, national life and fashion resources, cultural relic resources and other human resources; scenic spot traffic resources, modern scientific and technological resources, modern construction resources, peripheral supporting facility resources and other social resources.

The main function characteristics of the scenic spot comprise a sightseeing scenic spot, a vacation scenic spot, a scientific and scientific scenic spot, an amusement scenic spot, an ecological scenic spot, a scientific and technological scenic spot, an adventure scenic spot and the like.

The user biological characteristics include age, gender, height, race, weight, language, physical and mental health.

The social characteristics of the user comprise a scholarly calendar, profession, marital family, relatives, living city, income condition, social status, profession, religious belief, ethnic and the like.

According to the characteristics, the knowledge graph is constructed in the form of triples (head entities, relations and tail entities).

As shown in FIG. 2 and FIG. 3, the user graph neural network and the scenic spot graph neural network take head entity head and relationship relation in the knowledge graph as input, and minimize and predict tail entity

Training the network by taking the distance t from the real tail entity as a target function, and finally obtaining knowledgeAtlas

And with

Vector expression of the intermediate entity. The user map neural network and the scenic spot map neural network both comprise lower L-layer full connection layers for extracting potential features of the relationship between the user knowledge graph and the scenic spot knowledge graph, and the expression formula is as follows:

wherein the content of the first and second substances,

respectively potential feature vectors of the relation between the user and the scenic spot knowledge map after passing through the lower L-layer full connection layer,

and representing a layer of fully-connected layers, wherein W is a weight parameter of each layer, b is a bias term parameter, and sigma (x) is a nonlinear activation function. The user map neural network and the scenic spot map neural network comprise upper H-layer full-connection layers respectively used for obtaining vector expressions of tail entities in the user knowledge map and the scenic spot knowledge map

And

the expression is as follows:

wherein, | | is a vector concatenator, w _L ，e _L The potential feature vectors of the user and the scenic spot in the knowledge graph of the user and the scenic spot after passing through the lower L-layer full connection layer are respectively.

As shown in fig. 4, the recommendation network includes a lower L-layer feature extraction layer, and interaction feature vectors of the knowledge graph and corresponding scenic spots and users in the recommendation network are learned through a cross unit in the feature extraction layer, so that information in the knowledge graph is merged into the recommendation network. The recommendation network also comprises an upper H-layer full-connection layer to learn high-order combination characteristics of the users and scenic spots, and finally, the scores are predicted and graded through a nonlinear activation function

The expression is as follows:

wherein u is _L ，v _L Respectively are the feature vectors with potential interaction features of the knowledge graph and the recommended network obtained after the lower L-layer cross unit.

As shown in fig. 5, the proposed model of the present invention involves two cross units for connecting the lower L-layer feature extraction layers of three networks. And a crossing unit connecting the neural network of the scenic spot map and the recommendation network inputs potential feature vectors of the scenic spot in the previous layer of recommendation network and the potential feature vectors of the previous layer corresponding to the scenic spot in the neural network of the map, and learns high-level potential interaction features of the scenic spot in the recommendation network and the neural network of the map through two steps of feature crossing and feature compression, so that information of the scenic spot in a knowledge map is introduced into a recommendation system. The characteristic intersection and the characteristic compression satisfy that:

the characteristics are crossed: c _l ＝v _l e _l ^T

Feature compression:

wherein the content of the first and second substances,

for the l-th layer potential feature vector of the scenic spot in the neural network,

and recommending the potential feature vectors of the ith layer of the scenic spot in the network.

The feature cross matrix is a result of pairwise crossing between the potential features of the scenic spots in the recommendation network and the potential features of the scenic spots in the knowledge graph.

Is potential feature vector of the l +1 layer of scenic spot in the neural network,

the potential feature vector of the l +1 layer of the scenic spot in the network is recommended.

And

for the model parameters, d is the length of the potential feature vector. The intersection unit connecting the user map neural network and the recommendation network has the same structure as the intersection unit connecting the scenic spot map neural network and the recommendation network.

As shown in fig. 6, the three networks are alternately trained layer by layer, the loss function is defined as the sum of the losses of the three networks, and the expression of the loss function is:

wherein the content of the first and second substances,

in order to be a cross-entropy function,

λ ₁ ，λ ₂ ，λ ₃ being a hyper-parameter of the model, W _θ Is a regularization term parameter. And when the model is alternately trained, keeping the two network parameters unchanged, and updating the other network parameter. And training three networks by recommending a network scoring task and a multi-task alternate training mode of representing a learning task by a user and a scenic spot knowledge graph, and finishing model optimization. By determining the model parameters, the personalized rating of a specific user to the scenic spot can be obtained, and thus the personalized scenic spot recommendation list of the given user can be obtained.

Those skilled in the art will appreciate that all or part of the steps in the methods of the above embodiments may be implemented by instructions associated with hardware via a program, which may be stored in a computer-readable storage medium, and the storage medium may include: ROM, RAM, magnetic or optical disks, and the like.

The above-mentioned embodiments, which are further detailed for the purpose of illustrating the invention, technical solutions and advantages, should be understood that the above-mentioned embodiments are only preferred embodiments of the present invention, and should not be construed as limiting the present invention, and any modifications, equivalents, improvements, etc. made to the present invention within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A personalized scenic spot recommendation method of a multitask graph neural network is characterized by comprising the following steps: acquiring user data in real time, and preprocessing the acquired user data; inputting the preprocessed data into a trained recommendation model to obtain a recommendation result; the recommendation model is composed of a user graph neural network, a scenic spot graph neural network, a recommendation network and a cross unit; the user graph neural network structure comprises two parts, namely a neural network comprising L-layer full connection and a neural network comprising H-layer full connection, and L, H is a model hyper-parameter; the structure of the scenic spot graph neural network is the same as that of the user graph neural network; the structure of the recommendation network comprises two parts, namely a neural network comprising L-layer full connection and a neural network comprising H-layer full connection; the structure of the cross unit comprises a user cross unit and a scenic spot cross unit; the user cross unit is used for connecting the user graph neural network and the recommendation network feature extraction layer, fusing the features extracted by the same user through the user graph neural network and the recommendation network through feature cross and feature compression, and obtaining a potential feature vector after user fusion; the scenic spot crossing unit is used for connecting the scenic spot map neural network and the recommendation network feature extraction layer, and fusing the features extracted from the same scenic spot by the scenic spot map neural network and the recommendation network through feature crossing and feature compression to obtain a potential feature vector after scenic spot fusion;

the process of training the recommendation model includes:

s3: inputting the triple data in the user knowledge graph into a user graph neural network for training, and learning the vector expression of the user in the user knowledge graph; inputting the triple data in the scenic spot knowledge map into a scenic spot map neural network for training, and learning vector expression of the scenic spot in the scenic spot knowledge map;

s4: respectively inputting the user potential features extracted by the user map neural network and the scenic spot potential features extracted by the scenic spot map neural network into a recommendation network through a cross unit to obtain potential feature vectors after user fusion and potential feature vectors after scenic spot fusion; according to the fused potential features of the usersForming a prediction score of the user on the scenic region by the amount and the fused scenic region potential feature vector; splicing the scenic spot fused by the cross units and the potential features of the user, inputting the spliced scenic spot and the potential features of the user into a neural network in full connection with an H layer, and forming a score through a nonlinear activation function

The expression is as follows:

wherein, the first and the second end of the pipe are connected with each other,

representing the fully-connected layer of the H layer, | | is a vector concatenation symbol, w _L ，e _L Potential feature vectors u of the users and the scenic spots in the knowledge maps of the users and the scenic spots after passing through the lower L-layer full-connection layer respectively _L ，v _L Respectively obtaining potential feature vectors of the user and the scenic spot after passing through a cross unit, wherein sigma (x) is a nonlinear activation function;

s5: in the training process of the recommendation model, multi-task training is carried out on a recommendation network score prediction task, a user map neural network representation learning task of a user knowledge map and a scenic spot map neural network representation learning task of the scenic spot knowledge map;

s6: calculating a loss function of the model, wherein the loss function of the model comprises scenic region graph neural network loss, user graph neural network loss, recommended network loss and regular term loss; the expression for the loss function is:

wherein L is _RS Loss function, L, representing a recommended network _U-GNN Representing the loss function, L, of the neural network of the user graph _I-GNN Loss function, L, representing scenic map neural network _REG To representA function of the loss of the regular term,

representing the prediction score y _uv And a true score y _uv Phi (x) represents the difference function between the predicted tail entity and the real tail entity in the knowledge-graph, and (h) _u ,r ^u ,t _u ) Representing entity relationships present in the user's knowledge graph, (h) _u ′,r ^u ,t _u ') represents entity relationships that do not exist in the user's knowledge graph, (h) _v ,r ^v ,t _v ) Representing the entity relationship existing in the scenic spot knowledge map, (h) _v ′,r ^v ,t _v ') represent entity relationships that do not exist in the scenic spot knowledge-graph,

in order to be a knowledge-graph of the user,

the map is a knowledge map of the scenic spot,

representing the parameter term of the regularization term, λ ₁ 、λ ₂ 、λ ₃ Respectively, model hyper-parameters;

2. The method as claimed in claim 1, wherein the preprocessing of the user data comprises: cleaning user data, and deleting invalid data and abnormal data; the washed data was normalized by z-score.

3. The personalized scenic spot recommendation method of the multitask graph neural network as claimed in claim 1, wherein the extracted user feature set comprises a biological attribute feature and a social attribute feature; the extracted scenic spot feature set comprises scenic spot resource features and scenic spot leading function features.

4. The method for recommending personalized scenic spots of a multitask graph neural network as claimed in claim 1, wherein the expressions of feature intersection and feature compression are:

the characteristics are crossed:

feature compression:

wherein the content of the first and second substances,

the l-th layer potential feature vector of the scenic region in the neural network of the graph is represented,

representing potential feature vectors at the l-th level of the scenic spot in the recommended network,

is a feature cross matrix;

a layer l +1 potential feature vector representing a scenic spot in the recommendation network,

the l +1 level potential feature vectors representing scenic spots in the neural network of the graph,

for the model parameters, d represents the length of the potential feature vector,

indicating transposed symbols, V, E each indicate that the current interleaving unit is a user interleaving unit.