CN112508256B

CN112508256B - User demand active prediction method and system based on crowdsourcing

Info

Publication number: CN112508256B
Application number: CN202011387991.5A
Authority: CN
Inventors: 张以文; 储蓓; 王庆人; 沈书泽
Original assignee: Anhui University
Current assignee: Anhui University
Priority date: 2020-12-01
Filing date: 2020-12-01
Publication date: 2023-04-14
Anticipated expiration: 2040-12-01
Also published as: CN112508256A

Abstract

The invention provides a user demand active prediction method and a user demand active prediction system based on crowdsourcing, which comprise the following steps: s1: determining annotators participating in the crowdsourcing task, wherein the annotators receive the crowdsourcing task and complete the task; s2: constructing a heterogeneous information network according to the user preference information; s3: generating a user required data space; s4: learning the expression vectors of the user and the demand object respectively through a graph convolution neural network; s5: and (4) demand prediction. According to the invention, the user directly participates in information production and knowledge sharing through a crowdsourcing technology, the preference information fed back by a crowdsourcing annotator can better reflect the real requirement of the user, and the accuracy of the result can be improved by combining the information to predict the requirement; the attribute characteristics of the users are enriched by the user preference information acquired in the crowdsourcing mode, and attribute completion is performed on new registered users lacking historical behavior data, so that each user can be more accurately represented, and the recommendation result is more personalized.

Description

User demand active prediction method and system based on crowdsourcing

Technical Field

The invention relates to the technical field of computers, in particular to a user demand active prediction method and system based on crowdsourcing.

Background

With the development of the internet and big data technology, the problem of information overload is increasingly serious, and the recommendation system can provide interested commodities or services for the user according to the demand information of the user to help the user to perform effective information processing. Therefore, whether the user requirements can be accurately, comprehensively and actively predicted becomes a key for improving the recommendation performance of the service provider so as to realize the maximization of the commercial profit.

(1) Heterogeneous information network recommendation

The existing recommendation technology mainly analyzes user requirements based on network behavior data of users, and the user network behavior data under the background of a big data era often has multi-source heterogeneity. Heterogeneous information networks can integrate different types of objects and complex interaction relationships between the objects, and have been widely used in the field of recommendation as an effective information fusion method. The application number CN106503028A relates to a recommendation method, which comprises the following steps: modeling the objects in the recommended data set and the relationship between the objects as a heterogeneous information network; acquiring a meta path connecting two objects in the heterogeneous information network; calculating similarity data between the objects according to a meta path connecting the two objects; constructing an objective function according to similarity data between objects, and training the recommended data set through the objective function to obtain a prediction score of the user on the object; and recommending the item to the user according to the prediction score of the user on the item. According to the method, a heterogeneous information network is utilized to model the recommendation data, the problem of data sparseness is effectively relieved, and the recommendation effect is improved.

(2) Crowdsourcing recommendations

Crowdsourcing is a distributed problem solving and production mode which adopts a certain mechanism to enable groups to participate in a certain thing together to achieve a certain target. Crowdsourcing solves the problem difficult to understand through the intelligence of the group, and spreads the problem to the worker group in a public bidding mode. Combining crowdsourcing task information has become a hot issue for research in the field of crowdsourcing recommendations.

A method for crowd-sourced task recommendation, as disclosed in application No. 202010464312.3, comprising the steps of: according to crowdsourcing worker data and historical tasks on a crowdsourcing platform, user portrait updating and user portrait grade updating are carried out on crowdsourcing workers; screening the crowdsourcing workers according to the requirements of the tasks to be processed, and obtaining a crowdsourcing worker list; determining the completion time and price of the task to be processed according to the requirements of the task to be processed and the crowdsourcing worker list; determining a recommendation probability list of the crowdsourcing workers through a task recommendation model according to the completion time and the price; recommending the tasks to be processed to crowdsourcing workers in the crowdsourcing worker list according to the tasks to be processed and the recommendation probability list. According to the method and the device, the crowdsourcing workers are subjected to user portrait, skills of the crowdsourcing workers are graded according to attributes in the user portrait, and the recommendation probability list is generated, so that tasks are automatically pushed to the crowdsourcing workers. The method focuses on task recommendation on a crowdsourcing platform. The crowdsourcing mode can feed back information which can reflect true requirements of the user to a task requester in a mode of direct participation of the user, such as a service provider, more accurate and complete data support can be provided for user requirement prediction of a service platform by combining data collected by the crowdsourcing mode with recommended data, and particularly the data missing problem of a newly registered service platform user can be relieved. Therefore, designing a heterogeneous information network modeling method fusing crowdsourcing acquisition data and recommendation data and a corresponding demand prediction method is a practical demand.

Disclosure of Invention

The invention aims to provide a high-matching-degree demand active prediction method for a new user lacking historical data.

The invention solves the technical problems through the following technical means:

a crowd-sourced user demand active prediction method comprises the following steps:

s1: determining annotators participating in the crowdsourcing task, designing the crowdsourcing task and issuing the crowdsourcing task to a crowdsourcing task platform, wherein the annotators receive the crowdsourcing task and complete the task;

s2: constructing a heterogeneous information network according to the user social relationship, the historical behavior data and the user preference information acquired in the crowdsourcing mode;

s3: uniformly representing different types of entities in a heterogeneous information network to generate a user demand data space;

s4: extracting the interactive semantics of the user and the demand object by using a meta path in a heterogeneous information network, and respectively learning the expression vectors of the user and the demand object through a graph convolution neural network;

s5: and aggregating the neighbor information of the target user according to the social relation of the users in the heterogeneous information network, obtaining the expression vector of the target user from the data space, and predicting the demand.

Further, the step S1 comprises

S11: acquiring a user set of a service provider as a target user, acquiring user social relationship data from a social network and a service provider platform to obtain a social neighbor user set of the target user, and taking all users as annotators for receiving crowdsourcing tasks;

s12: and designing a user preference survey questionnaire from the aspects of demographic information, social requirements for reflecting common interest in social relations and enjoyment requirements for reflecting individual preferences, wherein the content of the questionnaire comprises selection questions expressed in characters and displayed in a graphical mode, a annotator is allowed to submit auxiliary information independently, the questionnaire is published on a crowdsourcing platform in a crowdsourcing task mode, and tasks are published to the annotator obtained in S11.

Further, the step S2 includes:

s21: taking multi-modal data acquired based on a crowdsourcing mode in the S1 as an attribute set of each user, and taking the user and a demand object as nodes;

s22: establishing a connection edge between a user and a demand object according to the following relation:

the relationship R1: direct relationships such as friends and concern exist among users U, and L are used respectively ^-1 Representing relationships between users U, i.e.

And &>

Relationship R2: some users have historical behavior information, such as the user bought a certain article, used a certain service, etc., respectively using B and B ^-1 Representing user U and requirement object O ^k In relation to each other, i.e.

And &>

Wherein k represents a kth class requirement object;

s23: and establishing a multi-mode heterogeneous information network according to the attribute set, the nodes and the relationship among the nodes.

Further, the step S3 includes:

s31: user information, text attribute information and image attribute information of a demand object collected in a crowdsourcing mode are uniformly expressed:

obtaining vector representation of text type information by adopting word2vec method

Wherein e is _u Representing a user text attribute vector representation, e _o Representing the text attribute vector representation of the demand object, wherein N is the quantity of the demand object categories;

the picture type information is represented by vector obtained by adopting convolutional neural network

Wherein, g _u Representing user Picture Attribute vector representation, g _o A picture attribute vector representation representing a demand object;

s32: to the warpFusing the uniformly expressed multi-modal attribute information: the user attribute vector e obtained in the step S31 _u And g _u Performing outer product operation to realize feature intersection, flattening the obtained matrix according to rows, inputting the flattened matrix into a multilayer perceptron to obtain an initial vector representation Z of a user node, and expressing a vector of a demand object attribute

And &>

Repeating the operation to obtain the initial vector representation O of the demand object node ^k And vector representation of all users and requirement objects forms a user requirement data space.

Further, the step S4 includes:

s41: establishing a plurality of user-demand object co-occurrence matrixes T according to historical behavior information of users ^k : user-item co-occurrence matrix

Subscriber-service co-occurrence matrix ≥>

Wherein, | I | is the quantity of articles, | S | is the quantity of services, if the user has bought a certain article or the user has used a certain service, put 1 in the corresponding position of the corresponding co-occurrence matrix;

s42: in the k-th demand active prediction scene of the user, extracting the UO from the heterogeneous information network constructed in the step S2 ^k U-element path, meaning that two users use the semantic information of the kth class demand object together, co-occurrence matrix T ^k To which it is transferred

Multiply, i.e. ->

Deriving an inter-user relationship matrix under that semantic->

Extracting O from the heterogeneous information network constructed in step S2 ^k UO ^k Meta-path representing semantic information used by the same user for two kth class demand objects by &>

Obtain a relationship matrix ^ between the kth class of demand objects under the semantic>

S43: for the relationship matrix between users obtained in S42

And a relationship matrix between demand objects>

The standardization treatment is carried out according to the following formulas respectively,

wherein, the first and the second end of the pipe are connected with each other,

and &>

Are all diagonal matrices, are asserted>

And &>

Are respectively based on>

And &>

A degree matrix of (c);

s44: using a graph-convolution neural network, a user vector representation is learned according to the following formula,

the vector representation of the kth class demand object is learned in accordance with the following formula,

wherein the content of the first and second substances,

vector representations of the ith layer user and kth class requirement object respectively are shown, when l is 0,

is Z, <' > based on>

Is O ^k P and W are weight parameters, wherein, indicates element-by-element multiplication operation, sigma is an activation function, and phi indicates that a vector is converted into a diagonal matrix;

s45: and repeating the operation in the S44, and alternately updating the vector representations of the user and the kth class of demand object respectively until the final layer of convolution is finished to obtain the vector representations of all the user and the kth class of demand object.

Further, the step S5 includes:

s51: s4, obtaining a vector representation of a target user i as

Neighbor user j e of user iN (i), aggregating neighbor user information by using an attention mechanism to obtain final target user vector representation; the weight coefficient of the neighbor to the target user is calculated,

the vector representation of the target user is updated,

wherein, alpha and W are weight parameters, sigma is an activation function, and | l is splicing operation;

s52: for each target user, calculating its relevance prediction score to each kth class demand object

S53: the loss function is a binary cross entropy function:

wherein, Y and Y ^- Positive and negative examples in the data set, Y represents the demand object set used by the user, Y ^- Sampled from the demand objects in the data set that are not used by the user,

indicates whether the user has an interaction, presence interaction @, with the demand object>

Is 1, otherwise is 0; using stochastic gradient descent method to correct the loss functionPerforming optimization solution on the numbers, sequencing the kth class demand objects from high to low according to the prediction score obtained by calculation in the step S52, and selecting the first n demand objects as a kth class demand list of the user;

s54: by repeating the operations of S42-S53, a list of all the categories of demand objects for each user can be obtained, thereby realizing active prediction of user demands.

The invention also provides a user demand active prediction system based on crowdsourcing, which comprises

A crowdsourcing task issuing module: determining a annotator participating in the crowdsourcing task, designing the crowdsourcing task and issuing the crowdsourcing task to a crowdsourcing task platform, and receiving the crowdsourcing task and completing the task by the annotator;

heterogeneous information network construction module: constructing a heterogeneous information network according to the user social relationship, the historical behavior data and the user preference information acquired in the crowdsourcing mode;

the user demand data space generation module: uniformly representing different types of entities in a heterogeneous information network to generate a user demand data space;

the user and demand object representation vector learning module: extracting the interactive semantics of the user and the demand object by using a meta path in the heterogeneous information network, and respectively learning the expression vectors of the user and the demand object;

a demand forecasting module: and aggregating the neighbor information of the target user according to the social relation of the users in the heterogeneous information network to obtain the expression vector of the target user and perform demand prediction.

Further, the specific execution process of the step crowdsourcing task issuing module is as follows:

s11: acquiring a user set of a service provider as a target user, acquiring user social relationship data from a social network and a service provider platform to obtain a social neighbor user set of the target user, and taking all the users as annotators for receiving crowdsourcing tasks;

Further, the heterogeneous information network construction module performs the following steps:

s21: taking multimodal data collected based on a crowdsourcing mode as an attribute set of each user, and taking the user and a demand object as nodes;

the relationship R1: direct relationships such as friends and concerns exist between users U, and L are used respectively ^-1 Representing relationships between users U, i.e.

And &>

The relationship R2: some users have historical behavior information, such as the user bought a certain article, used a certain service, etc., respectively using B and B ^-1 Representing user U and requirement object O ^k In relation to each other, i.e.

And &>

Wherein k represents a kth class requirement object;

Further, the specific execution process of the user requirement data space generation module includes:

s32: fusing the multi-mode attribute information after uniform expression: the user attribute vector e obtained in S31 is used _u And g _u Performing outer product operation to realize feature crossing, flattening the obtained matrix according to rows, and inputting the flattened matrix into a multilayer perceptron to obtain a user node initial vector representation Z and a demand object attribute representation vector

And &>

Further, the specific implementation process of the expression vector learning module for the user and the demand object includes:

Subscriber-service co-occurrence matrix ≥>

s42: in the k-th demand active prediction scene of the user, extracting the UO from the constructed heterogeneous information network ^k U-element path, meaning that two users commonly use semantic information of kth class demand object, co-occurrence matrix T ^k To which it is transferred

Multiply, i.e. ->

Deriving an inter-user relationship matrix under that semantic->

Extracting O from a constructed heterogeneous information network ^k UO ^k Meta-path representing semantic information used by the same user for two kth class demand objects by &>

/>

S43: for the relationship matrix between users obtained in S42

And a relation matrix between demand objects>

wherein the content of the first and second substances,

and &>

Are all diagonal matrices, are asserted>

And &>

Are respectively based on>

And &>

A degree matrix of (c);

s44: using a graph convolution neural network, a user vector representation is learned in accordance with the following formula,

the vector representation of the kth class demand object is learned as follows,

wherein the content of the first and second substances,

is Z->

Further, the specific implementation process of the step demand prediction module includes:

s51: the expression vector learning module of the user and the demand object obtains the vector expression of the target user i as

Neighbor users j of the user i belong to N (i), and the neighbor user information is aggregated by using an attention mechanism to obtain final target user vector representation; the weight coefficient of the neighbor to the target user is calculated,

the vector representation of the target user is updated,

s52: for each target user, calculating the relevance prediction score of the target user and each k-th class demand object

S53: the loss function is a binary cross entropy function:

wherein Y and Y ^- Positive and negative examples in the data set, Y represents the demand object set used by the user, Y ^- Sampled from the demand objects in the data set that are not used by the user,

indicates whether a user has interacted with the demand object, there is an interaction @>

Is 1, otherwise is 0; optimizing and solving the loss function by using a random gradient descent method, sequencing kth demand objects from high to low according to the prediction score calculated in the step S52, and selecting the first n demand objects as a kth demand list of the user; />

S54: by repeating the operations of S42-S53, a list of all the categories of demand objects for each user can be obtained, thereby realizing the active prediction of the demand of the user.

The invention has the advantages that:

according to the invention, the user directly participates in information production and knowledge sharing through a crowdsourcing technology, the preference information fed back by a crowdsourcing annotator can better reflect the real requirement of the user, and the accuracy of the result can be improved by combining the information to predict the requirement; the attribute characteristics of the users are enriched by the user preference information acquired in the crowdsourcing mode, and attribute completion is performed on new registered users lacking historical behavior data, so that each user can be more accurately represented, and the recommendation result is more personalized.

The social relationship, the user historical behavior data and the user preference information collected in the crowdsourcing mode generally have the characteristics of multiple sources, multiple types and multiple relationships, the heterogeneous information network can effectively model multiple types of entities and complex relationships among the entities, and implicit interactive relationships among the entities of different types are extracted through meta-paths, so that the information is fully utilized, the individual characteristics of users and demand objects can be more comprehensively described, and the potential demands of the users are mined.

Drawings

Fig. 1 is a flow chart of a user demand active prediction method based on crowdsourcing in an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the embodiments of the present invention, and it is obvious that the described embodiments are some embodiments of the present invention, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

As shown in fig. 1, the present embodiment provides a method for actively predicting user demand based on crowdsourcing, which includes the following steps:

s1: determining annotators participating in the crowdsourcing task, designing the crowdsourcing task and issuing the crowdsourcing task to a crowdsourcing task platform; specifically comprises

S11: acquiring a user set of a service provider as a target user, acquiring user social relationship data from Twitter, a service provider platform and the like to obtain a social neighbor user set of the target user, and taking all the users as annotators for receiving crowdsourcing tasks;

s12: a user preference survey questionnaire is designed from the aspects of demographic information, social requirements reflecting common interest in social relations, enjoyment requirements reflecting personal preferences and the like, wherein the content of the questionnaire comprises word expressions and selection questions displayed graphically (for example, the interested content is selected from the following options), and a annotator is allowed to independently submit auxiliary information such as texts, videos and pictures. Issuing the questionnaire to a crowdsourcing platform in a crowdsourcing task form, and issuing a task to the annotator acquired in S11;

s2: constructing a heterogeneous information network according to the user social relationship, the historical behavior data and the user preference information acquired in the crowdsourcing mode; specifically comprises

S21: and taking the multi-modal data collected based on the crowdsourcing mode in the S1 as a property set of each user, such as age, gender, favorite movie posters and the like, and the properties of a demand object, such as a manufacturer, a marketing date and the like. The user and demand objects (including goods, services, etc.) are taken as nodes.

the relationship R1: direct relationships such as friends and concern exist among users, and L are used respectively ^-1 Representing relationships between users (U), i.e.

And &>

Relationship R2: some users have historical behavior information, such as the user bought a certain article, used a certain service, etc., and use B and B respectively ^-1 Representing user (U) and requirement object (O) ^k ) In relation to each other, i.e.

And &>

Wherein k represents a kth-class demand object, such as a commodity or a service;

s31: user information and attribute information of demand objects collected based on crowdsourcing mode generally have different expression forms including types of texts, pictures and the like, and the user information and the attribute information of the demand objects need to be acquired according to different modalities without adopting the modesThe same expression learning method is used for unified expression. Obtaining vector representation of text type information by adopting word2vec method

e _u Representing a user text attribute vector representation, e _o And representing the text attribute vector representation of the demand object, wherein N is the number of the demand object categories. The picture type information is expressed by a vector obtained by a convolutional neural network>

g _u Representing user picture attribute vector representation, g _o A picture attribute vector representation representing a demand object;

s32: in order to learn the embedded representation of the nodes, the multimodality attribute information after being uniformly expressed needs to be fused. The user attribute vector e obtained in S31 is used _u And g _u Performing outer product operation to realize feature intersection, flattening the obtained matrix according to rows, inputting the flattened matrix into a multilayer perceptron to obtain an initial vector representation Z of a user node, and expressing a vector of a demand object attribute

And &>

Repeating the operation to obtain the initial vector representation O of the demand object node ^k The vector representations of all users and demand objects form a user demand data space.

S4: extracting the interactive semantics of the user and the demand object, and respectively learning the expression vectors of the user and the demand object; specifically comprises

S41: establishing a plurality of user-demand object co-occurrence matrixes T according to historical behavior information of users ^k E.g. user-item co-occurrence matrix

Subscriber-service co-occurrence matrix ≥>

s42: in the k-th demand active prediction scene of the user, extracting UO from the heterogeneous information network constructed by S2 ^k U-element path, meaning that two users commonly use semantic information of kth class demand object, co-occurrence matrix T ^k And the transpose thereof

Multiply, i.e. ->

Get the inter-user relationship matrix->

Extracting O from S2-constructed heterogeneous information network ^k UO ^k Meta-path representing semantic information used by the same user for two kth class demand objects by &>

Deriving a relationship matrix between class k demand objects under the semantics>

S43: for the relationship matrix between users obtained in S42

And a relation matrix between demand objects>

wherein the content of the first and second substances,

and &>

Are all diagonal matrices, in combination>

And &>

Are respectively based on>

And &>

A degree matrix of (c);

is Z->

S5: aggregating neighbor information of a target user according to user social relations in a heterogeneous information network to obtain an expression vector of the target user and perform demand prediction, specifically comprising

S51: s4, obtaining a vector of the target user i and expressing the vector as

And a neighbor user j of the user i belongs to N (i), and the final target user vector representation is obtained by aggregating neighbor user information by using an attention mechanism. The weight coefficient of the neighbor to the target user is calculated,

the vector representation of the target user is updated,

wherein, α and W are weight parameters, σ is an activation function, and | is a splicing operation.

S53: the loss function is a binary cross entropy function:

wherein Y and Y ^- Positive and negative examples in the data set, Y represents the demand object set used by the user, Y ^- Sampled from unused demand objects in the data set by the user,

Is 1, otherwise is 0. And (4) performing optimization solution on the loss function by using a random gradient descent method, sequencing the kth demand objects from high to low according to the prediction score obtained by S52 calculation, and selecting the first n demand objects as a kth demand list of the user.

The embodiment also provides a crowd-sourced user demand active prediction system, which comprises

A crowdsourcing task issuing module: determining annotators participating in the crowdsourcing task, designing the crowdsourcing task and issuing the crowdsourcing task to a crowdsourcing task platform, wherein the annotators receive the crowdsourcing task and complete the task; in particular to

S11: acquiring a user set of a service provider as a target user, acquiring user social relationship data from Twitter, a service provider platform and the like to obtain a social neighbor user set of the target user, and taking all users as annotators for receiving crowdsourcing tasks;

s12: a user preference survey questionnaire is designed from the aspects of demographic information, social requirements reflecting common interest in social relations, enjoyment requirements reflecting personal preferences and the like, the content of the questionnaire comprises word expressions and selection questions displayed in a graphical mode (for example, the interested content is selected from the following options), and a annotator is allowed to submit auxiliary information such as texts, videos, pictures and the like independently. And issuing the questionnaire to a crowdsourcing platform in a crowdsourcing task form, and issuing a task to the annotator acquired in S11.

Heterogeneous information network construction module: constructing a heterogeneous information network according to the user social relationship, the historical behavior data and the user preference information acquired in the crowdsourcing mode in the heterogeneous information network; in particular to

S21: taking the multi-modal data collected by S1 based on the crowdsourcing mode as a property set of each user, such as age, gender, favorite movie posters and the like, and properties of demand objects, such as manufacturers, dates of marketing and the like. The user and demand objects (including goods, services, etc.) are taken as nodes.

S22: establishing a connecting edge between the user and the demand object according to the following relation:

And &>

And &>

Wherein k represents a kth class requirement object;

The user requirement data space generation module: uniformly representing different types of entities in a heterogeneous information network to generate a user demand data space; in particular to

S31: user information and attribute information of a demand object collected based on a crowdsourcing mode generally have different expression forms, including types such as texts and pictures, and need to be uniformly expressed by adopting different expression learning methods according to different modalities:

s32: in order to learn the embedded representation of the nodes, the multimodality attribute information after being uniformly expressed needs to be fused. The user attribute vector e obtained in S31 is used _u And g _u Performing outer product operation to realize feature crossing, flattening the obtained matrix according to rows, and inputting the flattened matrix into a multilayer perceptron to obtain a user node initial vector representation Z and a demand object attribute representation vector

And &>

Repeating the operation to obtain the initial node of the demand objectVector representation O ^k The vector representations of all users and demand objects form a user demand data space.

The user and demand object representation vector learning module: extracting the interactive semantics of the user and the demand object by using a meta path in a heterogeneous information network, and respectively learning the expression vectors of the user and the demand object; in particular to

S41: establishing a plurality of user-demand object co-occurrence matrixes T according to historical behavior information of users ^k : for example, a user-item co-occurrence matrix

Subscriber-service co-occurrence matrix ≥>

Multiply, i.e. [ means ] in>

Get the inter-user relationship matrix->

Obtaining a relation matrix between kth class demand objects under the semantics/>

S43: for the relationship matrix between users obtained in S42

And a relationship matrix between demand objects>

wherein the content of the first and second substances,

and &>

Are all diagonal matrices, are asserted>

And &>

Are respectively based on>

And &>

A degree matrix of (c);

vector representations of the l-th layer user and the k-th type demand object respectively are shown, when l is 0,

is Z->

The demand prediction module is used for aggregating neighbor information of the target user according to the social relations of the users in the heterogeneous information network to obtain an expression vector of the target user and performing demand prediction, and specifically comprises

A neighbor user j of the user i belongs to N (i), and the final target user vector representation is obtained by aggregating neighbor user information by using an attention mechanism; the weight coefficient of the neighbor to the target user is calculated,

the vector representation of the target user is updated,

wherein alpha and W are weight parameters, sigma is an activation function, and | is splicing operation;

/>

S53: the loss function is a binary cross entropy function:

Is 1, otherwise is 0; optimizing and solving the loss function by using a random gradient descent method, sequencing kth demand objects from high to low according to the prediction score calculated in the step S52, and selecting the first n demand objects as a kth demand list of the user;

The above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A user demand active prediction method based on crowdsourcing is characterized by comprising the following steps: the method comprises the following steps:

s1: determining a annotator participating in the crowdsourcing task, designing the crowdsourcing task and issuing the crowdsourcing task to a crowdsourcing task platform, and receiving the crowdsourcing task and completing the task by the annotator; the method specifically comprises the following steps:

s12: designing a user preference survey questionnaire from the perspectives of demographic information, social requirements for reflecting common interest and love in social relations and enjoyment requirements for reflecting personal preferences, wherein the content of the questionnaire comprises word expression and selection questions displayed graphically, allowing a annotator to submit auxiliary information independently, publishing the questionnaire to a crowdsourcing platform in a crowdsourcing task form, and publishing tasks to the annotator obtained in S11;

s2: constructing a heterogeneous information network according to the user social relationship, the historical behavior data and the user preference information acquired in the crowdsourcing mode; the method specifically comprises the following steps:

the relationship R1: direct relationships of friends and concerns exist among users U, and L are used respectively ^-1 Representing relationships between users U, i.e.

And &>

Relationship R2: part of the users have historical behavior information, respectively B and B ^-1 Representing user U and demand object O ^k In relation to each other, i.e.

And &>

Wherein k represents a kth class requirement object;

s23: establishing a multi-mode heterogeneous information network according to the attribute set, the nodes and the relationship among the nodes;

s4: extracting the interactive semantics of the user and the demand object by using a meta path in a heterogeneous information network, and respectively learning the expression vectors of the user and the demand object;

s5: and aggregating the neighbor information of the target user according to the social relation of the users in the heterogeneous information network to obtain the expression vector of the target user and perform demand prediction.

2. The crowd-sourced user demand active prediction method of claim 1, wherein: the step S3 includes:

representing vectors for the attributes of the demand objects;

representing vectors for the attributes of the demand objects; />

And &>

3. The crowd-sourced user demand active prediction method of claim 1, wherein: the step S4 includes:

s41: establishing a co-occurrence matrix T of a plurality of user-demand objects according to historical behavior information of users ^k : user-item co-occurrence matrix

Subscriber-service co-occurrence matrix ≥>

s42: in the k-th demand active prediction scene of the user, extracting the UO from the heterogeneous information network constructed in the step S2 ^k U-element path, meaning that two users use the semantic information of the kth class demand object together, co-occurrence matrix T ^k With its transpose (T) ^k ) ^T Multiplication, i.e. T ^k ×(T ^k ) ^T Obtaining the relationship matrix between the users under the semantic meaning

Extracting O from the heterogeneous information network constructed in step S2 ^k UO ^k Meta-path, meaning semantic information used by the same user for two kth class requirement objects, passing through (T) ^k ) ^T ×T ^k Obtain a relationship matrix ^ between the kth class of demand objects under the semantic>

S43: for the relationship matrix between users obtained in S42

And a relation matrix between demand objects>

and &>

Are all diagonal matrices, in combination>

And &>

Are respectively based on>

And &>

A degree matrix of (c);

wherein the content of the first and second substances,

vector representations of the ith layer user and kth class demand object, respectively, are based on the fact that when l is 0, then->

Is a group of a group Z having a structure,

s45: and repeating the operation in the S44, and alternately updating the vector representations of the user and the kth type demand object respectively until the last layer of convolution is finished to obtain the vector representations of all the users and the kth type demand object.

4. The active prediction method of user demand based on crowdsourcing of claim 3, wherein: the step S5 includes:

s51: s4, obtaining a vector of the target user i and expressing the vector as

the vector representation of the target user is updated,

S53: the loss function is a binary cross entropy function:

Is 1, otherwise is 0; optimizing and solving the loss function by using a random gradient descent method, sequencing kth-class demand objects from high to low according to the prediction score obtained by calculation in the step S52, and selecting the first n demand objects as a kth-class demand list of the user;

5. A crowd-sourced based active user demand prediction system is characterized in that: comprises that

the specific execution process of the crowdsourcing task issuing module is as follows:

s12: designing a user preference survey questionnaire from the aspects of demographic information, social requirements for reflecting common interest in social relations and enjoyment requirements for reflecting individual preferences, wherein the content of the questionnaire comprises selection questions expressed in characters and displayed in a graphical mode, a annotator is allowed to submit auxiliary information independently, the questionnaire is issued to a crowdsourcing platform in a crowdsourcing task mode, and tasks are issued to the annotator acquired in S11;

the heterogeneous information network construction module: constructing a heterogeneous information network according to the user social relationship, the historical behavior data and the user preference information acquired in the crowdsourcing mode;

the heterogeneous information network construction module comprises the following execution processes:

And &>

And &>

Wherein k represents a kth class requirement object;

the user requirement data space generation module: uniformly representing different types of entities in a heterogeneous information network to generate a user demand data space;

a demand forecasting module: and aggregating the neighbor information of the target user according to the user social relationship in the heterogeneous information network to obtain the expression vector of the target user and perform demand prediction.

6. The crowd-sourced, user-demand active prediction system of claim 5, wherein: the specific execution process of the user requirement data space generation module comprises the following steps:

Wherein e is _u Representing a userText attribute vector representation, e _o Representing the text attribute vector of the demand object, wherein N is the quantity of the demand object categories;

s32: fusing the multi-mode attribute information after uniform expression: the user attribute vector e obtained in S31 is used _u And g _u Performing outer product operation to realize feature intersection, flattening the obtained matrix according to rows, inputting the flattened matrix into a multilayer perceptron to obtain an initial vector representation Z of a user node, and expressing a vector of a demand object attribute

And &>

7. The crowd-sourced, user-demand active prediction system of claim 6, wherein: the specific implementation process of the expression vector learning module of the user and the demand object comprises the following steps:

Subscriber-service co-occurrence matrix ≥>

s42: in the k-th demand active prediction scene of the user, extracting the UO from the constructed heterogeneous information network ^k U-element path, meaning that two users use the semantic information of the kth class demand object together, co-occurrence matrix T ^k With its transpose (T) ^k ) ^T Multiplication, i.e. T ^k ×(T ^k ) ^T Obtaining the relationship matrix between the users under the semantic meaning

Extracting O from a constructed heterogeneous information network ^k UO ^k Meta-path, meaning semantic information used by the same user for two kth class requirement objects, passing through (T) ^k ) ^T ×T ^k Obtain a relationship matrix ^ between the kth class of demand objects under the semantic>

S43: for the relationship matrix between users obtained in S42

And a relation matrix between demand objects>

wherein the content of the first and second substances,

and &>

Are all diagonal matrices, are asserted>

And &>

Are respectively in>

And &>

A degree matrix of (c);

the vector representation of the kth class demand object is learned as follows,

Is a group of Z and is a group of Z,

8. The crowd-sourced, user-demand active prediction system of claim 7, wherein: the specific execution process of the step demand forecasting module comprises the following steps:

the vector representation of the target user is updated,

S53: the loss function is a binary cross entropy function: