WO2021213156A1 - Method and related apparatus for generating task label on basis of relationship graph convolutional network - Google Patents

Method and related apparatus for generating task label on basis of relationship graph convolutional network Download PDF

Info

Publication number
WO2021213156A1
WO2021213156A1 PCT/CN2021/084223 CN2021084223W WO2021213156A1 WO 2021213156 A1 WO2021213156 A1 WO 2021213156A1 CN 2021084223 W CN2021084223 W CN 2021084223W WO 2021213156 A1 WO2021213156 A1 WO 2021213156A1
Authority
WO
WIPO (PCT)
Prior art keywords
task
user
data
node
graph data
Prior art date
Application number
PCT/CN2021/084223
Other languages
French (fr)
Chinese (zh)
Inventor
张楠
王健宗
瞿晓阳
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021213156A1 publication Critical patent/WO2021213156A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • This application belongs to the field of artificial intelligence technology, and in particular relates to a task label generation method and related devices based on a relational graph convolutional network.
  • Deep learning is to learn the inherent laws and representation levels of sample data.
  • the information obtained in the learning process is of great help to the interpretation of data such as text, images and sounds.
  • An important process of generating sample data is to label the data, that is, to label the data.
  • the labeled data can be used as a pilot experience for deep learning.
  • data labeling is usually carried out by labeling systems or platforms.
  • the administrator distributes the labeling task to a large number of users for labeling through the labeling system, and the same labeling task is usually distributed to multiple users, and the answers are summarized as the labeling result.
  • the allocation efficiency is low and may result in poor matching between tasks and annotators, reducing the efficiency of annotation. .
  • the present application provides a method and related device for generating task tags based on a relational graph convolutional network, which avoids the administrator from manually assigning tasks to users, improves the efficiency of task assignment, and the generated task tags are consistent with The user's relevance is stronger, which improves the matching degree of task assignment.
  • a task label generation method based on a relationship graph convolutional network including:
  • the user task graph data includes at least one user node and at least one task node.
  • the user node corresponds to the user information contained in the user data
  • the task node corresponds to the task contained in the task data. information
  • the task feature corresponds to the task node contained in the user task graph data;
  • the user task matrix contains the probability distribution of user nodes on at least one task label, and the task labels correspond to the task nodes one-to-one;
  • the target task label of the user corresponding to the user node is generated.
  • an apparatus for generating task tags including:
  • the user task graph data module is used to generate user task graph data according to user data and task data.
  • the user task graph data includes at least one user node and at least one task node.
  • the user node corresponds to the user information contained in the user data.
  • the task node Correspond to the task information contained in the task data;
  • the feature representation output module is used to input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolutional network for the user task graph data and at least one task feature, the user feature corresponding to the user task For the user nodes contained in the graph data, the task features correspond to the task nodes contained in the user task graph data;
  • the user task matrix building module is used to construct a user task matrix according to user characteristics and task characteristics.
  • the user task matrix contains the probability distribution of user nodes on at least one task label, and the task labels correspond to the task nodes one by one;
  • the task label generating module is used to generate the target task label of the user corresponding to the user node according to the probability distribution of the user node contained in the user task matrix on at least one task label.
  • a task tag generation device based on a relational graph convolutional network.
  • the device includes: a processor; and a memory for storing executable instructions of the processor; the processor executes The following steps are performed when the computer-readable instructions are:
  • user task graph data is generated.
  • the user task graph data includes at least one user node and at least one task node.
  • the user node corresponds to the user information contained in the user data.
  • the task node Corresponding to the task information contained in the task data;
  • a user task matrix Constructing a user task matrix according to the user characteristics and the task characteristics, the user task matrix containing the probability distribution of the user node on at least one task label, and the task label corresponds to the task node one-to-one;
  • a target task label of the user corresponding to the user node is generated.
  • a computer-readable storage medium stores at least one instruction, and the following steps are executed when the at least one instruction is executed by a processor:
  • user task graph data is generated.
  • the user task graph data includes at least one user node and at least one task node.
  • the user node corresponds to the user information contained in the user data.
  • the task information contained in the task data corresponds to the task information contained in the task data.
  • a user task matrix Constructing a user task matrix according to the user characteristics and the task characteristics, the user task matrix containing the probability distribution of the user node on at least one task label, and the task label corresponds to the task node one-to-one;
  • a target task label of the user corresponding to the user node is generated.
  • the relationship between the user and the user, the relationship between the task and the task, and the relationship between the user and the task, and the relationship between the user and the task, and the relationship between the user and the task are learned through the relationship graph convolutional network.
  • the administrator manually assigns tasks to users, which improves the efficiency of assigning tasks, and the generated task tags are more relevant to users, which improves the matching of task assignments.
  • Figure 1 is a schematic diagram of an exemplary system architecture of the technical solution of the present application in an application scenario.
  • Fig. 2 is a flowchart of a method for generating task labels based on a relational graph convolutional network according to an embodiment of the present application.
  • Fig. 3 is a schematic diagram of an example of user graph data in an embodiment of the present application.
  • FIG. 4 is a schematic diagram of an example of task graph data in an embodiment of the present application.
  • Fig. 5 is a schematic diagram of an example of user task graph data in an embodiment of the present application.
  • Fig. 6 is a block diagram of the composition of a task tag generating device in an embodiment of the present application.
  • Fig. 7 is a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
  • Figure 1 is a schematic diagram of an exemplary system architecture of the technical solution of the present application in an application scenario.
  • the implementation environment may include: a terminal 101 and a server 102.
  • a labeling system is deployed on the terminal 101 and the server 102.
  • the labeling system is a system for labeling data.
  • the effective labeling data generated by the labeling system provides source data for deep learning training models with strong generalization capabilities.
  • Common labeling tasks in the labeling system include text labeling, voice labeling, translation labeling, image labeling and so on.
  • the server distributes the corresponding labeling task to each client, and the user completes the labeling task through the client and sends it back to the server.
  • the tagging system will score the user according to the task completion status of the user, such as the speed and number of tasks completed, and the review status of the tagging results.
  • a wired or wireless communication connection is established between the terminal 101 and the server 102, and data transmission with the server 102 is realized through this communication connection.
  • a client terminal is running in the terminal 101, and the client terminal can provide a user interaction interface. User interaction can be triggered on the user interaction interface for data annotation.
  • the client may be an image labeling client, and in the process of image labeling, the user receives the distribution through the client.
  • the terminal 101 may be a mobile phone, a tablet computer, a notebook computer, a desktop computer, or any other electronic device that can be operated by the client, and there is no restriction here.
  • the client can be an application client or a web client, and there are no restrictions here.
  • the server 102 is used to provide data support for user interaction operations triggered in the client, so that the client can run normally in the terminal 101. Still taking the above-mentioned image annotation as an example, the server 102 may send the image to be annotated and the corresponding candidate task tag to the client according to the image annotation operation triggered in the client, or may receive the result of the image annotation from the client to provide The customer assigns the content of the follow-up marking task.
  • the server 102 may be a single server device or a server group composed of multiple server devices, which is not limited here.
  • Fig. 2 is a flowchart of a method for generating task labels based on a relational graph convolutional network according to an embodiment of the present application. This method can be executed by the server 103 in the application scenario shown in FIG. 1. As shown in Figure 2, in one embodiment, the method may include the following steps:
  • Step S210 Generate user task graph data according to the user data and task data.
  • the user task graph data includes at least one user node and at least one task node.
  • the user node corresponds to the user information contained in the user data
  • the task node corresponds to the task data. Contains task information.
  • the main manifestation of user task graph data is an undirected weighted graph containing user nodes and task nodes, which express the association relationship and degree of association between users, tasks, and between users and tasks involved in the labeling system.
  • User data includes user information and completion of tasks.
  • User information includes user attributes such as age, resume, profession, professional skills, and foreign language level.
  • the task completion status includes the status of the tasks completed by the user, including information such as the number, type, and time of the task.
  • user nodes contain user information of users, and the edges between user nodes represent the association relationship between users with respect to the completed tasks. The greater the weight of the edge, the closer the association relationship.
  • the task data includes task information and task records.
  • the task information mainly includes task type and task content.
  • Task types include, for example, text annotation tasks, voice annotation tasks, translation annotation tasks, image annotation tasks, and so on.
  • the task content is the task goal of the corresponding task.
  • the task content is the original text and the translated text, as well as the introduction information about the translation and labeling task itself.
  • task nodes include task information about tasks, and the edges between task nodes represent the relationship between tasks relative to the user who completes the task. The greater the weight of the edge, the closer the relationship.
  • the edge between the user node and the task node represents the association relationship between the user and the task. For example, the task completed by the user belongs to the association relationship between the user and the task. The greater the weight of the edge, the closer the association relationship.
  • association relationships between users, tasks, and users and tasks can be determined. Based on these association relationships, user task graph data can be generated.
  • Step S220 Input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolution network for the user task graph data, and at least one task feature, the user feature corresponding to the user task graph data contained
  • the task feature corresponds to the task node contained in the user task graph data.
  • Relational graph convolutional network Graph Convolutional Network is an extension of large-scale relational data based on graph convolutional networks. It is a graph convolutional network that aggregates local neighbor information.
  • a propagation model is defined to calculate the forward update of nodes or entities in a relational multi-graph:
  • the index of the neighbor set of node i in relation r is a regularization constant, which can be learned or extracted, and sparse matrix multiplication can be used to avoid explicit summation of neighbors.
  • the conversion of relationship and feature is introduced, which depends on the type and direction of the edge.
  • the conversion can be a linear message conversion, and other more flexible functions, such as a multilayer neural network, can be used, but at the same time it will increase the amount of calculation required.
  • the relational graph convolutional network calculates the corresponding feature representation for each user and each task according to the user information and task information in the user task graph data and the relationship between the user and the task.
  • the feature representation is usually a one-dimensional vector, and may be a sparse feature vector, for example, a one-hot vector.
  • User feature representation and task feature representation are representations related to user data and task data. Then, learn user embedding (embedding) and task embedding (embedding) through RGCN.
  • the calculation process of RGCN for each node in the user task graph can be, for example, taking any node as a central node, performing a convolution on the central node, and updating the representation of the central node by aggregating information of neighbor nodes.
  • the aggregation of neighbor nodes is classified according to the edge type, and the corresponding conversion is performed according to the different edge types.
  • the collected information undergoes a regularized summation, and finally passes through the activation function.
  • the information of each vertex updates the shared parameters, parallel calculation, and also includes self-connection, that is to say, includes the node's own representation.
  • Step S230 Construct a user task matrix according to the user characteristics and the task characteristics.
  • the user task matrix contains the probability distribution of the user nodes on at least one task label, and the task labels correspond to the task nodes one-to-one.
  • the user task matrix can be constructed by matrix multiplication. If there are 3 users and 3 tasks, a 3x3 user task matrix will be constructed, and each element in the matrix represents the probability that the corresponding user will be better at the corresponding task.
  • the user task matrix is normalized by softmax. For example, the value of the probability distribution of each user on all tasks is between 0 and 1, and the sum is 1.
  • Step S240 According to the probability distribution of the user nodes contained in the user task matrix on at least one task label, generate the target task label of the user corresponding to the user node.
  • the probability distribution of each user for all task labels can be determined, and the task label corresponding to each user can be generated according to the predetermined task allocation principle. For example, if a predetermined number of tasks are assigned to each user, the task labels can be sorted in descending order according to the probability quantity and the task labels can be taken as the result according to the predetermined number, or all task labels with probabilities greater than a predetermined threshold can be taken as the result, or for Each user can select the task corresponding to the highest probability for the user as the task label.
  • the server can dispatch the task corresponding to the task tag to the user.
  • the relationship between the user and the user, the relationship between the task and the task, and the relationship between the user and the task, and the relationship between the user and the task, and the relationship between the user and the task are learned through the relationship graph convolutional network.
  • the administrator manually assigns tasks to users, which improves the efficiency of assigning tasks, and the generated task tags are more relevant to users, which improves the matching of task assignments.
  • the method may further include the following steps:
  • Step S201 Generate historical user task graph data based on historical task data and historical user data.
  • the historical user data corresponds to multiple known task tags, and the known task tags correspond to the historical task information in a one-to-one manner;
  • Step S202 Training the relational graph convolutional network based on historical user task graph data and known task labels.
  • the correspondence between historical user data and task tags can be determined according to the task records in the historical task data. For example, for a certain user included in historical user data, the task label that appears most frequently among the tasks completed by the user is counted as its corresponding task label.
  • the content contained in the historical user task data is the same as the above-mentioned user task diagram, and will not be repeated here.
  • historical user data and historical task data can directly use the historical task record information stored in the annotation system.
  • the relation graph convolutional network can be trained. Specifically, in the training process, historical user task graph data and known task labels are input into the relationship graph convolutional network to obtain a user feature representation and a task feature representation. According to the obtained results and the correspondence between historical user data and known task tags, the solution of the loss function can be determined.
  • the loss function can use various classification loss functions such as cross entropy. The solution of the loss function is minimized by adjusting the parameters in order to determine the value of each parameter in the graph convolutional network.
  • building historical user task graph data and using the historical user task graph data to train the relational graph convolutional network can obtain a specific model that conforms to the task situation of the labeling system and the user's situation, which is beneficial to improve the relational graph convolution.
  • the accuracy of network output results, thereby improving the accuracy of task assignment.
  • step S210 Generate user task graph data according to user data and task data, including:
  • Step S211 Generate user graph data according to the user data, the user graph data includes at least one user node, and generate task graph data according to the task data, the task graph data includes at least one task node;
  • Step S212 The user map data and the task map data are merged to obtain the user task map data.
  • the user data graph is an undirected weighted graph with user information as a node and a collaborative relationship between users relative to a task as an edge.
  • the user graph data includes at least one user node.
  • the task graph is an undirected weighted graph with task information as nodes and the relationship between tasks relative to users as edges.
  • the task graph data includes at least one task node.
  • the user map data and the task map data are generated, merge the two to obtain the user task map data. Specifically, while keeping the relationship between the nodes and edges in the user graph data and the task graph data unchanged, an edge is added between the task node and the user node according to the user's completion of the task to indicate the relationship between the user and the task. The relationship between them. Traverse each user node in the user graph data, determine its association relationship with each task node in the task graph data, and establish corresponding edges according to the association dangling relationship to obtain the user task graph data.
  • the user task graph is obtained by merging the user graph data and the task graph data. While retaining the original data of the user graph data and the task graph data, the relationship between the user and the task is added to maintain The integrity of the data is conducive to improving the accuracy of the graph convolutional network.
  • generating user graph data according to user data in step S211 includes:
  • Step S2111 Obtain the user information contained in the user data and the task record information corresponding to each user information;
  • Step S2112. According to the task record information corresponding to each user information, determine the same task completed between users corresponding to each user information;
  • Step S2113. Generate user nodes in the user graph data according to the user information, and generate edges between the user nodes according to the same tasks completed between users corresponding to the user information to obtain the user graph data.
  • the task record information mainly includes the task type of each task completed by the user.
  • the same task indicates tasks of the same task type.
  • the number of tasks of the same type completed by two users can be determined.
  • the edges between users in the user graph data represent the number of tasks of the same type completed by two users.
  • FIG. 3 is a schematic diagram of an example of user graph data in an embodiment of the present application.
  • the weight value of the edge represents the number of tasks of the same type. For example, if user A has completed 30 voice labeling tasks, and user B has completed 15 voice labeling tasks, the weight value of the edge is 15. For another example, if user A has completed 7 voice labeling tasks and 8 image labeling tasks, and user B has also completed 7 voice labeling tasks and 8 image labeling tasks, the weight value of the edge is also 15, that is, edge The weight value includes the number of all tasks of the same type.
  • the task record information includes task identification of each task completed by the user.
  • the same task indicates tasks with the same task ID, that is, identical tasks.
  • the edges between users in the user graph data represent the number of tasks completed by two users with the same task identifier. For example, if user B has completed 4 image labeling tasks, user C has completed 5 image labeling tasks and the pictures in 3 tasks are the same as the labeling task completed by user B, then there is a margin between user B and user C , And the weight value of the edge is 3.
  • the user map data is established based on the same task completed by the user information and the users corresponding to each user information, which fully reflects the collaborative relationship between multiple users with respect to the task, and is beneficial to dispatching multiple users on the same task.
  • generating task graph data according to the task data in step S211 includes:
  • Step S2114 Obtain the task information contained in the task data and the task history data corresponding to each user information;
  • Step S2115 Determine each task information performed by the same user according to the task history data corresponding to each user information
  • Step S2116 The task nodes in the task graph data are generated according to the task information, and the edges between the task nodes are generated according to the information of each task performed by the same user to obtain the task graph data.
  • the task history data mainly includes the information of the user who completed the task and the task completion time.
  • a time threshold can be set.
  • the edges between tasks in the task graph data represent the number of users who have completed two tasks.
  • FIG. 4 is a schematic diagram of an example of task graph data in an embodiment of the present application.
  • the user graph data is established based on the task information and the information of each task performed by the same user, which fully reflects the association relationship between multiple tasks with respect to the user, and distributes tasks based on the user’s characteristics and skills. So that the user completes the task efficiency.
  • the above step S212 on the basis of the above embodiments, the above step S212.
  • the user map data and the task map data are merged to obtain the user task map data, including:
  • Step S2121. Obtain the task record information corresponding to the user node in the user graph data, and obtain the task history data corresponding to the task node in the task graph data;
  • Step S2122. Determine the association relationship between the user node and the task node according to the task record information and the task history data, and the association relationship indicates that the user corresponding to the user node completes the task corresponding to the task node;
  • Step S2123. According to the association relationship between the user node and the task node, an edge between the user node and the task node is constructed between the user graph data and the task graph data to obtain the user task graph data.
  • the specific tasks completed by each user and the number of times each specific task have been completed can be specifically determined.
  • the association relationship between the user node and the task node can be determined, and for the two nodes that have the association relationship, based on the user graph data and the task graph data, the user can be constructed
  • the edge between the node and the task node represents the situation where the user completes the corresponding task, and the weight of the edge represents the number of times the user completes the task.
  • FIG. 5 is a schematic diagram of an example of user task graph data in an embodiment of the present application. Assuming that task 1 represents a translation and annotation task, if user A has completed 10 different translation and annotation tasks, correspondingly, there is an edge between user A and task 1 on the way of the user task, and its weight is 10.
  • task 1 represents a specific translation and annotation task, for example, annotating the translation of a certain article
  • task 2 represents another specific translation and annotation task.
  • User A completes task 1 10 times
  • user B completes task 2 30 times.
  • user B and task 2 There is an edge between, and its weight value is 30.
  • the user task graph is generated based on the user graph data and task graph data based on the user completing the task, which is beneficial to fully consider the relationship between the user and the task in the calculation process of the relation graph convolutional network, and improve The degree of association between the task and the user in turn makes the calculation of the characteristic representation of the user and the task more accurate.
  • the above step S240 According to the probability distribution of the user node contained in the user task matrix on at least one task label, generate the target task label of the user corresponding to the user node ,include:
  • Step S241. Determine the maximum probability corresponding to the user node according to the user task matrix
  • Step S242 If the maximum probability is greater than the preset probability distribution threshold, the task label corresponding to the maximum probability is used as the target task label of the user corresponding to the user node.
  • the user task matrix contains the assignment probability of each task for each user. If the user task matrix includes 3 users and 3 types of tasks, for one user, there is a 3-dimensional vector, and each feature value in the vector represents the probability that the corresponding task is assigned to the user.
  • the three tasks are text annotation, voice annotation, and image annotation.
  • the three-dimensional vector can be ⁇ 0.7, 0.2, 0.1 ⁇ .
  • the maximum probability can be determined, and if the maximum probability is greater than the probability distribution threshold, it means that the user and the corresponding task are more closely related, and the task label can be used as the The user's task label. For example, for the aforementioned user X, the maximum probability is 0.7, and if the probability distribution threshold is 0.5, it can be determined that the text is labeled as the task tag of the user.
  • the user's task label is determined by comparing the maximum probability of the user node with the probability distribution threshold, which helps to avoid forcibly restricting the user's task type when the user's own tendency is not obvious, and helps to improve the accuracy of task assignment. sex.
  • Fig. 6 is a block diagram of the composition of a task tag generating device in an embodiment of the present application.
  • the task tag generating apparatus 300 may mainly include:
  • the user task graph data module 310 is used to generate user task graph data according to user data and task data.
  • the user task graph data includes at least one user node and at least one task node.
  • the user node corresponds to the user information contained in the user data.
  • the node corresponds to the task information contained in the task data;
  • the feature representation output module 320 is configured to input user task graph data into the relationship graph convolutional network to obtain at least one user feature output by the relationship graph convolution network for the user task graph data, and at least one task feature, the user feature corresponding to the user
  • the user nodes contained in the task graph data, and the task characteristics correspond to the task nodes contained in the user task graph data;
  • the user task matrix construction module 330 is configured to construct a user task matrix according to user characteristics and task characteristics.
  • the user task matrix contains the probability distribution of user nodes on at least one task label, and the task labels correspond to the task nodes one-to-one;
  • the task label generating module 340 is configured to generate the target task label of the user corresponding to the user node according to the probability distribution of the user node contained in the user task matrix on at least one task label.
  • the task tag generating device 300 further includes:
  • the historical user task graph data generation module is used to generate historical user task graph data based on historical task data and historical user data.
  • the historical user data corresponds to multiple known task tags, and the known task tags and historical task information are one by one correspond;
  • the relational graph convolutional network training module is used to train the relational graph convolutional network based on historical user task graph data and known task labels.
  • the user task graph data module 310 may include:
  • the user graph data generating unit is configured to generate user graph data according to user data, the user graph data includes at least one user node, and the task graph data is generated according to task data, and the task graph data includes at least one task node;
  • the merging processing unit is used for merging the user map data and the task map data to obtain the user task map data.
  • the user graph data generating unit may include:
  • the user information acquisition subunit is used to acquire the user information contained in the user data and the task record information corresponding to each user information;
  • the same task determination subunit is used to determine the same task completed between users corresponding to each user information according to the task record information corresponding to each user information respectively;
  • the user node generating subunit is used to generate user nodes in the user graph data according to the user information, and generate edges between the user nodes according to the same tasks completed between users corresponding to the user information to obtain the user graph data.
  • the user task graph data module 310 may include:
  • the task information acquisition subunit is used to acquire the task information contained in the task data and the task history data corresponding to each user information;
  • the same user determination subunit is used to determine each task information performed by the same user according to the task history data corresponding to each user information;
  • the task node generating subunit is used to generate task nodes in the task graph data according to task information, and generate edges between task nodes according to the information of each task performed by the same user to obtain task graph data.
  • the merging processing unit may include:
  • the task record information obtaining subunit is used to obtain the task record information corresponding to the user node in the user graph data, and obtain the task history data corresponding to the task node in the task graph data;
  • the association relationship determination subunit is used to determine the association relationship between the user node and the task node according to the task record information and the task history data, and the association relationship instructs the user corresponding to the user node to complete the task corresponding to the task node;
  • the user task graph data construction subunit is used to construct an edge between the user node and the task node between the user graph data and the task graph data according to the association relationship between the user node and the task node to obtain the user task graph data.
  • the task tag generation module 340 may include:
  • the maximum probability determining unit is used to determine the maximum probability corresponding to the user node according to the user task matrix
  • the task label determining unit is configured to, if the maximum probability is greater than the preset probability distribution threshold, use the task label corresponding to the maximum probability as the target task label of the user corresponding to the user node.
  • Fig. 7 is a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
  • the computer system 400 includes a central processing unit (Central Processing Unit (CPU) 401, which can execute according to a program stored in a read-only memory (Read-Only Memory, ROM) 402 or a program loaded from the storage part 408 to a random access memory (Random Access Memory, RAM) 403 Various appropriate actions and processing.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • various programs and data required for system operation are also stored.
  • the CPU 401, the ROM 402, and the RAM 403 are connected to each other through a bus 404.
  • An input/output (Input/Output, I/O) interface 405 is also connected to the bus 404.
  • the following components are connected to the I/O interface 405: an input part 406 including a keyboard, a mouse, etc.; an output part 407 including a cathode ray tube (Cathode Ray Tube, CRT), a liquid crystal display (LCD), etc., and speakers 407 ; A storage part 408 including a hard disk, etc.; and a communication part 409 including a network interface card such as a LAN (Local Area Network) card, a modem, and the like.
  • the communication section 409 performs communication processing via a network such as the Internet.
  • the driver 410 is also connected to the I/O interface 405 as needed.
  • a removable medium 411 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc., is installed on the drive 410 as required, so that the computer program read therefrom is installed into the storage part 408 as required.
  • the processes described in the flowcharts of the various methods may be implemented as computer software programs.
  • the embodiments of the present application include a computer program product, which includes a computer program carried on a computer-readable medium, and the computer program contains program code for executing the method shown in the flowchart.
  • the computer program may be downloaded and installed from the network through the communication part 409, and/or installed from the removable medium 411.
  • CPU central processing unit
  • the computer-readable medium shown in the embodiments of the present application may be a computer-readable signal medium or a computer-readable storage medium or any combination of the two.
  • the computer-readable storage medium may be non-volatile or Can be volatile.
  • the computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or a combination of any of the above.
  • Computer-readable storage media may include, but are not limited to: electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable Erasable Programmable Read Only Memory (EPROM), flash memory, optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the above.
  • the computer-readable storage medium may be any tangible medium that contains or stores a program, and the program may be used by or in combination with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in a baseband or as a part of a carrier wave, and a computer-readable program code is carried therein.
  • This propagated data signal can take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • the computer-readable signal medium may also be any computer-readable medium other than the computer-readable storage medium, and the computer-readable medium may send, propagate, or transmit the program for use by or in combination with the instruction execution system, apparatus, or device .
  • the program code contained on the computer-readable medium can be transmitted by any suitable medium, including but not limited to: wireless, wired, etc., or any suitable combination of the foregoing.
  • each block in the flowchart or block diagram may represent a module, program segment, or part of the code, and the above-mentioned module, program segment, or part of the code contains one or more for realizing the specified logic function.
  • Executable instructions may also occur in a different order from the order marked in the drawings. For example, two blocks shown one after another can actually be executed substantially in parallel, and they can sometimes be executed in the reverse order, depending on the functions involved.
  • each block in the block diagram or flowchart, and a combination of blocks in the block diagram or flowchart can be implemented by a dedicated hardware-based system that performs the specified function or operation, or can be implemented by It is realized by a combination of dedicated hardware and computer instructions.
  • modules or units of the device for action execution are mentioned in the above detailed description, this division is not mandatory.
  • the features and functions of two or more modules or units described above may be embodied in one module or unit.
  • the features and functions of a module or unit described above can be further divided into multiple modules or units to be embodied.
  • the example embodiments described here can be implemented by software, or can be implemented by combining software with necessary hardware. Therefore, the technical solution according to the embodiments of the present application can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, U disk, mobile hard disk, etc.) or on the network , Including several instructions to make a computing device (which can be a personal computer, a server, a touch terminal, or a network device, etc.) execute the method according to the embodiment of the application.
  • a non-volatile storage medium which can be a CD-ROM, U disk, mobile hard disk, etc.
  • Including several instructions to make a computing device which can be a personal computer, a server, a touch terminal, or a network device, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Provided are a method and related apparatus for generating a task label on the basis of a relationship graph convolutional network, relating to the field of artificial intelligence. The method comprises: according to user data and task data, generating user task graph data; inputting the user task graph data into a relational graph convolutional network to obtain at least one user feature outputted by the relational graph convolutional network for the user task graph data, and at least one task feature; according to the user feature and task feature, constructing a user task matrix; according to the probability distribution, on at least one task label, of user nodes contained in the user task matrix, generating a target task label of the user corresponding to the user node. By means of the described method, it is unnecessary for an administrator to manually assign tasks to users, improving the efficiency of assigning tasks; furthermore, the generated task labels are more relevant to users, improving the degree of matching of task assignment.

Description

根据关系图卷积网络的任务标签生成方法及相关装置Task label generation method and related device based on relation graph convolutional network
本申请要求于2020年11月25日提交中国专利局、申请号为202011342170.X,申请名称为“根据关系图卷积网络的任务标签生成方法及相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on November 25, 2020, the application number is 202011342170.X, and the application title is "Task label generation method and related device based on relational graph convolutional network", which The entire content is incorporated into this application by reference.
技术领域Technical field
本申请属于人工智能技术领域,特别涉及一种根据关系图卷积网络的任务标签生成方法及相关装置。This application belongs to the field of artificial intelligence technology, and in particular relates to a task label generation method and related devices based on a relational graph convolutional network.
背景技术Background technique
深度学习是学习样本数据的内在规律和表示层次,这些学习过程中获得的信息对诸如文字,图像和声音等数据的解释有很大的帮助。生成样本数据的一个重要过程是标注数据,即为数据打标签,标注后的数据可以用作深度学习的先导经验。Deep learning is to learn the inherent laws and representation levels of sample data. The information obtained in the learning process is of great help to the interpretation of data such as text, images and sounds. An important process of generating sample data is to label the data, that is, to label the data. The labeled data can be used as a pilot experience for deep learning.
目前,数据标注通常标注系统或平台进行。管理员通过标注系统将标注任务派发给大量用户进行标注,并且同一个标注任务通常被派发给多个用户,并且将答案汇总作为标注结果。At present, data labeling is usually carried out by labeling systems or platforms. The administrator distributes the labeling task to a large number of users for labeling through the labeling system, and the same labeling task is usually distributed to multiple users, and the answers are summarized as the labeling result.
然而,发明人发现在上述方案中,任务派发的过程通常凭借管理员的主观判断和经验来为目标任务选择标注人员或用户,分配效率低且可能造成任务与标注人员匹配度差,降低标注效率。However, the inventor found that in the above-mentioned solution, the task dispatch process usually relies on the administrator’s subjective judgment and experience to select annotators or users for the target task. The allocation efficiency is low and may result in poor matching between tasks and annotators, reducing the efficiency of annotation. .
技术问题technical problem
根据上述技术问题,本申请提供一种根据关系图卷积网络的任务标签生成方法及相关装置,以避免了由管理员手动向用户分派任务,提升了分派任务的效率,并且生成的任务标签与用户的相关性更强,提升了任务分派的匹配度。According to the above technical problems, the present application provides a method and related device for generating task tags based on a relational graph convolutional network, which avoids the administrator from manually assigning tasks to users, improves the efficiency of task assignment, and the generated task tags are consistent with The user's relevance is stronger, which improves the matching degree of task assignment.
技术解决方案Technical solutions
根据本申请实施例的一个方面,提供一种根据关系图卷积网络的任务标签生成方法,包括:According to an aspect of the embodiments of the present application, there is provided a task label generation method based on a relationship graph convolutional network, including:
根据用户数据以及任务数据,生成用户任务图数据,用户任务图数据包括至少一个用户节点以及至少一个任务节点,用户节点对应于用户数据中含有的用户信息,任务节点对应于任务数据中含有的任务信息;According to user data and task data, generate user task graph data. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user information contained in the user data, and the task node corresponds to the task contained in the task data. information;
将用户任务图数据输入关系图卷积网络中,获得关系图卷积网络针对用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,用户特征对应于用户任务图数据中含有的用户节点,任务特征对应于用户任务图数据中含有的任务节点;Input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolution network for the user task graph data and at least one task feature, the user feature corresponding to the user node contained in the user task graph data , The task feature corresponds to the task node contained in the user task graph data;
根据用户特征以及任务特征,构建用户任务矩阵,用户任务矩阵中含有用户节点在至少一个任务标签上的概率分布,任务标签与任务节点一一对应;According to user characteristics and task characteristics, construct a user task matrix, the user task matrix contains the probability distribution of user nodes on at least one task label, and the task labels correspond to the task nodes one-to-one;
根据用户任务矩阵中含有的用户节点在至少一个任务标签上的概率分布,生成用户节点所对应用户的目标任务标签。According to the probability distribution of the user node contained in the user task matrix on at least one task label, the target task label of the user corresponding to the user node is generated.
根据本申请实施例的另一个方面,提供一种任务标签生成装置,包括:According to another aspect of the embodiments of the present application, there is provided an apparatus for generating task tags, including:
用户任务图数据模块,用于根据用户数据以及任务数据,生成用户任务图数据,用户任务图数据包括至少一个用户节点以及至少一个任务节点,用户节点对应于用户数据中含有的用户信息,任务节点对应于任务数据中含有的任务信息;The user task graph data module is used to generate user task graph data according to user data and task data. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user information contained in the user data. The task node Correspond to the task information contained in the task data;
特征表示输出模块,用于将用户任务图数据输入关系图卷积网络中,获得关系图卷积网络针对用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,用户特征对应于用户任务图数据中含有的用户节点,任务特征对应于用户任务图数据中含有的任务节点;The feature representation output module is used to input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolutional network for the user task graph data and at least one task feature, the user feature corresponding to the user task For the user nodes contained in the graph data, the task features correspond to the task nodes contained in the user task graph data;
用户任务矩阵构建模块,用于根据用户特征以及任务特征,构建用户任务矩阵,用户任务矩阵中含有用户节点在至少一个任务标签上的概率分布,任务标签与任务节点一一对应;The user task matrix building module is used to construct a user task matrix according to user characteristics and task characteristics. The user task matrix contains the probability distribution of user nodes on at least one task label, and the task labels correspond to the task nodes one by one;
任务标签生成模块,用于根据用户任务矩阵中含有的用户节点在至少一个任务标签上的概率分布,生成用户节点所对应用户的目标任务标签。The task label generating module is used to generate the target task label of the user corresponding to the user node according to the probability distribution of the user node contained in the user task matrix on at least one task label.
根据本申请实施例的另一个方面,提供一种根据关系图卷积网络的任务标签生成设备,该设备包括:处理器;以及存储器,用于存储处理器的可执行指令;所述处理器执行所述计算机可读指令时执行以下步骤:According to another aspect of the embodiments of the present application, there is provided a task tag generation device based on a relational graph convolutional network. The device includes: a processor; and a memory for storing executable instructions of the processor; the processor executes The following steps are performed when the computer-readable instructions are:
根据用户数据以及任务数据,生成用户任务图数据,所述用户任务图数据包括至少一个用户节点以及至少一个任务节点,所述用户节点对应于所述用户数据中含有的用户信息,所述任务节点对应于所述任务数据中含有的任务信息;According to user data and task data, user task graph data is generated. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user information contained in the user data. The task node Corresponding to the task information contained in the task data;
将所述用户任务图数据输入关系图卷积网络中,获得所述关系图卷积网络针对所述用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,所述用户特征对应于所述用户任务图数据中含有的用户节点,所述任务特征对应于所述用户任务图数据中含有的任务节点;Input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolution network for the user task graph data, and at least one task feature, the user feature corresponding to all The user node contained in the user task graph data, and the task feature corresponds to the task node contained in the user task graph data;
根据所述用户特征以及所述任务特征,构建用户任务矩阵,所述用户任务矩阵中含有所述用户节点在至少一个任务标签上的概率分布,所述任务标签与所述任务节点一一对应;Constructing a user task matrix according to the user characteristics and the task characteristics, the user task matrix containing the probability distribution of the user node on at least one task label, and the task label corresponds to the task node one-to-one;
根据所述用户任务矩阵中含有的所述用户节点在至少一个任务标签上的概率分布,生成所述用户节点所对应用户的目标任务标签。According to the probability distribution of the user node on at least one task label contained in the user task matrix, a target task label of the user corresponding to the user node is generated.
根据本申请实施例的另一个方面,提供一种计算机可读存储介质,所述计算机可读存储介质存储有至少一个指令,所述至少一个指令被处理器执行时实现时执行以下步骤:According to another aspect of the embodiments of the present application, a computer-readable storage medium is provided, the computer-readable storage medium stores at least one instruction, and the following steps are executed when the at least one instruction is executed by a processor:
根据用户数据以及任务数据,生成用户任务图数据,所述用户任务图数据包括至少一个用户节点以及至少一个任务节点,所述用户节点对应于所述用户数据中含有的用户信息,所述任务节点对应于所述任务数据中含有的任务信息;According to user data and task data, user task graph data is generated. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user information contained in the user data. Corresponding to the task information contained in the task data;
将所述用户任务图数据输入关系图卷积网络中,获得所述关系图卷积网络针对所述用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,所述用户特征对应于所述用户任务图数据中含有的用户节点,所述任务特征对应于所述用户任务图数据中含有的任务节点;Input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolution network for the user task graph data, and at least one task feature, the user feature corresponding to all The user node contained in the user task graph data, and the task feature corresponds to the task node contained in the user task graph data;
根据所述用户特征以及所述任务特征,构建用户任务矩阵,所述用户任务矩阵中含有所述用户节点在至少一个任务标签上的概率分布,所述任务标签与所述任务节点一一对应;Constructing a user task matrix according to the user characteristics and the task characteristics, the user task matrix containing the probability distribution of the user node on at least one task label, and the task label corresponds to the task node one-to-one;
根据所述用户任务矩阵中含有的所述用户节点在至少一个任务标签上的概率分布,生成所述用户节点所对应用户的目标任务标签。According to the probability distribution of the user node on at least one task label contained in the user task matrix, a target task label of the user corresponding to the user node is generated.
有益效果Beneficial effect
在本申请的实施例中,通过关系图卷积网络学习用户和用户的相关性,任务和任务的相关性,以及用户和任务的相关性,能够生成与用户相关性强的任务标签,从而避免了由管理员手动向用户分派任务,提升了分派任务的效率,并且生成的任务标签与用户的相关性更强,提升了任务分派的匹配度。In the embodiment of the present application, the relationship between the user and the user, the relationship between the task and the task, and the relationship between the user and the task, and the relationship between the user and the task are learned through the relationship graph convolutional network. The administrator manually assigns tasks to users, which improves the efficiency of assigning tasks, and the generated task tags are more relevant to users, which improves the matching of task assignments.
附图说明Description of the drawings
图1是本申请技术方案在一个应用场景中的示例性系统构架示意图。Figure 1 is a schematic diagram of an exemplary system architecture of the technical solution of the present application in an application scenario.
图2是根据本申请实施例的一种根据关系图卷积网络的任务标签生成方法的流程图。Fig. 2 is a flowchart of a method for generating task labels based on a relational graph convolutional network according to an embodiment of the present application.
图3是本申请实施例中的用户图数据的示例示意图。Fig. 3 is a schematic diagram of an example of user graph data in an embodiment of the present application.
图4是本申请实施例中的任务图数据的示例示意图。FIG. 4 is a schematic diagram of an example of task graph data in an embodiment of the present application.
图5是本申请实施例中用户任务图数据的示例示意图。Fig. 5 is a schematic diagram of an example of user task graph data in an embodiment of the present application.
图6是本申请实施例中任务标签生成装置的组成框图。Fig. 6 is a block diagram of the composition of a task tag generating device in an embodiment of the present application.
图7是适于用来实现本申请实施例的电子设备的计算机系统的结构示意图。Fig. 7 is a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
本发明的实施方式Embodiments of the present invention
现在将参考附图更全面地描述示例实施方式。然而,示例实施方式能够以多种形式实施,且不应被理解为限于在此阐述的范例;相反,提供这些实施方式使得本申请将更加全面和完整,并将示例实施方式的构思全面地传达给本领域的技术人员。Example embodiments will now be described more fully with reference to the accompanying drawings. However, the example embodiments can be implemented in various forms, and should not be construed as being limited to the examples set forth herein; on the contrary, the provision of these embodiments makes this application more comprehensive and complete, and fully conveys the concept of the example embodiments To those skilled in the art.
此外,所描述的特征、结构或特性可以以任何合适的方式结合在一个或更多实施例中。在下面的描述中,提供许多具体细节从而给出对本申请的实施例的充分理解。然而,本领域技术人员将意识到,可以实践本申请的技术方案而没有特定细节中的一个或更多,或者可以采用其它的方法、组元、装置、步骤等。在其它情况下,不详细示出或描述公知方法、装置、实现或者操作以避免模糊本申请的各方面。In addition, the described features, structures, or characteristics may be combined in one or more embodiments in any suitable manner. In the following description, many specific details are provided to give a sufficient understanding of the embodiments of the present application. However, those skilled in the art will realize that the technical solutions of the present application can be practiced without one or more of the specific details, or other methods, components, devices, steps, etc. can be used. In other cases, well-known methods, devices, implementations or operations are not shown or described in detail in order to avoid obscuring various aspects of the present application.
附图中所示的方框图仅仅是功能实体,不一定必须与物理上独立的实体相对应。即,可以采用软件形式来实现这些功能实体,或在一个或多个硬件模块或集成电路中实现这些功能实体,或在不同网络和/或处理器装置和/或微控制器装置中实现这些功能实体。The block diagrams shown in the drawings are merely functional entities, and do not necessarily correspond to physically independent entities. That is, these functional entities can be implemented in the form of software, or implemented in one or more hardware modules or integrated circuits, or implemented in different networks and/or processor devices and/or microcontroller devices. entity.
附图中所示的流程图仅是示例性说明,不是必须包括所有的内容和操作/步骤,也不是必须按所描述的顺序执行。例如,有的操作/步骤还可以分解,而有的操作/步骤可以合并或部分合并,因此实际执行的顺序有可能根据实际情况改变。The flowchart shown in the drawings is only an exemplary description, and does not necessarily include all contents and operations/steps, nor does it have to be performed in the described order. For example, some operations/steps can be decomposed, and some operations/steps can be combined or partially combined, so the actual execution order may be changed according to actual conditions.
图1是本申请技术方案在一个应用场景中的示例性系统构架示意图。如图1所示,在一示例性的实施例中,该实施环境可以包括:终端101和服务器102。终端101与服务器102上部署有标注系统。标注系统是一种为数据打标签的系统,标注系统产生的有效的标注数据为深度学习训练泛化能力强的模型提供源数据。标注系统中常见的标注任务有文本标注,语音标注,翻译标注,图像标注等。在标注系统中,服务器向各个客户端派发相应的标注任务,用户通过客户端完整该标注任务后发送回服务器。在一些实施例中,标注系统会根据用户的完成任务情况对用户进行打分,例如完成任务的速度和数量以及标注结果复核情况等。Figure 1 is a schematic diagram of an exemplary system architecture of the technical solution of the present application in an application scenario. As shown in FIG. 1, in an exemplary embodiment, the implementation environment may include: a terminal 101 and a server 102. A labeling system is deployed on the terminal 101 and the server 102. The labeling system is a system for labeling data. The effective labeling data generated by the labeling system provides source data for deep learning training models with strong generalization capabilities. Common labeling tasks in the labeling system include text labeling, voice labeling, translation labeling, image labeling and so on. In the labeling system, the server distributes the corresponding labeling task to each client, and the user completes the labeling task through the client and sends it back to the server. In some embodiments, the tagging system will score the user according to the task completion status of the user, such as the speed and number of tasks completed, and the review status of the tagging results.
其中,终端101与服务器102之间建立有线或者无线的通信连接,进而通过此通信连接实现与服务器102的数据传输。Wherein, a wired or wireless communication connection is established between the terminal 101 and the server 102, and data transmission with the server 102 is realized through this communication connection.
终端101中运行有客户端,该客户端可以提供一用户交互界面。用户交互界面上可以触发用户交互操作以便进行数据标注。示例性的,该客户端可以为一图像标注客户端,在进行图像标注的过程中,用户通过该客户端接收到分派的。A client terminal is running in the terminal 101, and the client terminal can provide a user interaction interface. User interaction can be triggered on the user interaction interface for data annotation. Exemplarily, the client may be an image labeling client, and in the process of image labeling, the user receives the distribution through the client.
需要说明的是,在本实施环境中,终端101可以是手机、平板电脑、笔记本电脑、台式电脑或者其它任意可供客户端运行的电子设备,本处不进行限制。客户端可以是应用程序客户端,也可以是网页客户端,本处也不进行限制。It should be noted that, in this implementation environment, the terminal 101 may be a mobile phone, a tablet computer, a notebook computer, a desktop computer, or any other electronic device that can be operated by the client, and there is no restriction here. The client can be an application client or a web client, and there are no restrictions here.
服务器102用于为客户端中触发的用户交互操作提供数据支持,从而使客户端可以在终端101中正常运行。仍以上述图像标注为例,服务器102可以根据客户端中触发的图像标注操作,向客户端发送需要标注的图像以及相应的备选任务标签,也可以从客户端接收图像标注的结果,以便为客户分派后续的标注任务内容。The server 102 is used to provide data support for user interaction operations triggered in the client, so that the client can run normally in the terminal 101. Still taking the above-mentioned image annotation as an example, the server 102 may send the image to be annotated and the corresponding candidate task tag to the client according to the image annotation operation triggered in the client, or may receive the result of the image annotation from the client to provide The customer assigns the content of the follow-up marking task.
其中,服务器102可以是单一的服务器设备,也可以是由多台服务器设备组成的服务器群组,本处不进行限定。The server 102 may be a single server device or a server group composed of multiple server devices, which is not limited here.
下面结合具体实施方式对本申请提供的技术方案做出详细说明。The technical solutions provided by this application will be described in detail below in conjunction with specific implementations.
图2是根据本申请实施例的一种根据关系图卷积网络的任务标签生成方法的流程图。该方法可以由图1所示的应用场景中的服务器103执行。如图2所示的,在一个实施例中,该方法可以包括如下步骤:Fig. 2 is a flowchart of a method for generating task labels based on a relational graph convolutional network according to an embodiment of the present application. This method can be executed by the server 103 in the application scenario shown in FIG. 1. As shown in Figure 2, in one embodiment, the method may include the following steps:
步骤S210. 根据用户数据以及任务数据,生成用户任务图数据,用户任务图数据包括至少一个用户节点以及至少一个任务节点,用户节点对应于用户数据中含有的用户信息,任务节点对应于任务数据中含有的任务信息。Step S210. Generate user task graph data according to the user data and task data. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user information contained in the user data, and the task node corresponds to the task data. Contains task information.
用户任务图数据的主要表现形式为包含用户节点以及任务节点的无向加权图,其表达标注系统所涉及的用户之间、任务之间以及用户与任务之间的关联关系和关联程度。The main manifestation of user task graph data is an undirected weighted graph containing user nodes and task nodes, which express the association relationship and degree of association between users, tasks, and between users and tasks involved in the labeling system.
用户数据包括用户信息以及完成任务情况。用户信息包括例如年龄、履历、专业、专业技能以及外语水平等用户属性。完成任务情况包括用户所完成的任务的情况,主要包括任务的数量、类型以及时间等信息。在用户任务图数据中,用户节点中包含用户的用户信息,用户节点之间的边表示用户之间相对于所完成的任务的关联关系,边的权值越大,关联关系也越紧密。User data includes user information and completion of tasks. User information includes user attributes such as age, resume, profession, professional skills, and foreign language level. The task completion status includes the status of the tasks completed by the user, including information such as the number, type, and time of the task. In the user task graph data, user nodes contain user information of users, and the edges between user nodes represent the association relationship between users with respect to the completed tasks. The greater the weight of the edge, the closer the association relationship.
任务数据包括任务信息以及任务记录。任务信息主要包括任务类型以及任务内容。任务类型包括例如文本标注任务、语音标注任务,翻译标注任务,图像标注任务等。任务内容即相应任务的任务目标,例如,对于翻译标注任务,任务内容即原文文本以及译文文本、以及关于翻译标注任务本身的介绍信息等。在用户任务图数据中,任务节点中包括任务的任务信息,任务节点之间的边表示任务之间相对于完成任务的用户的关联关系,边的权值越大,关联关系也越紧密。The task data includes task information and task records. The task information mainly includes task type and task content. Task types include, for example, text annotation tasks, voice annotation tasks, translation annotation tasks, image annotation tasks, and so on. The task content is the task goal of the corresponding task. For example, for the translation and labeling task, the task content is the original text and the translated text, as well as the introduction information about the translation and labeling task itself. In the user task graph data, task nodes include task information about tasks, and the edges between task nodes represent the relationship between tasks relative to the user who completes the task. The greater the weight of the edge, the closer the relationship.
用户节点与任务节点之间的边表示用户与任务的关联关系,例如用户完成过的任务属于用户与任务之间的关联关系,边的权值越大,关联关系也越紧密。The edge between the user node and the task node represents the association relationship between the user and the task. For example, the task completed by the user belongs to the association relationship between the user and the task. The greater the weight of the edge, the closer the association relationship.
根据用户数据以及任务数据,可以确定用户之间、任务之间以及用户与任务的关联关系,根据这些关联关系,即可以生成用户任务图数据。According to user data and task data, the association relationships between users, tasks, and users and tasks can be determined. Based on these association relationships, user task graph data can be generated.
步骤S220. 将用户任务图数据输入关系图卷积网络中,获得关系图卷积网络针对用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,用户特征对应于用户任务图数据中含有的用户节点,任务特征对应于用户任务图数据中含有的任务节点。Step S220. Input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolution network for the user task graph data, and at least one task feature, the user feature corresponding to the user task graph data contained The task feature corresponds to the task node contained in the user task graph data.
关系图卷积网络(Relational Graph Convolutional Network,RGCN)是建立在图卷积网络的基础上的在大规模关系数据上的一种扩展,其是将局部邻居信息进行聚合的图卷积网络。在RGCN中,定义了一种传播模型用于计算在一个关系多图中的节点或实体的前向更新:Relational graph convolutional network Graph Convolutional Network (RGCN) is an extension of large-scale relational data based on graph convolutional networks. It is a graph convolutional network that aggregates local neighbor information. In RGCN, a propagation model is defined to calculate the forward update of nodes or entities in a relational multi-graph:
其中,表示在关系r的节点i的邻居集的索引,是一个正则化常量,可以学习或者提取选取为,可以使用稀疏矩阵乘法避免对邻居进行显式求和。在RGCN中引入了关系与特征的转换,其依赖于边的类型和方向。转换可以采用线性消息转换,可以采用其他更灵活的函数,例如多层神经网络,然而同时会增大所需的计算量。Among them, the index of the neighbor set of node i in relation r is a regularization constant, which can be learned or extracted, and sparse matrix multiplication can be used to avoid explicit summation of neighbors. In RGCN, the conversion of relationship and feature is introduced, which depends on the type and direction of the edge. The conversion can be a linear message conversion, and other more flexible functions, such as a multilayer neural network, can be used, but at the same time it will increase the amount of calculation required.
在本申请中,关系图卷积网络根据用户任务图数据中用户信息、任务信息以用户和任务之间的关系,对于每个用户以及每个任务计算得到其相应的特征表示。该特征表示通常是一维向量,并且可以是稀疏特征向量,例如,one-hot向量。用户特征表示和任务特征表示为用户数据以及任务数据相关的表示。随后,通过RGCN来学习用户的嵌入(embedding)和任务的嵌入(embedding)。In this application, the relational graph convolutional network calculates the corresponding feature representation for each user and each task according to the user information and task information in the user task graph data and the relationship between the user and the task. The feature representation is usually a one-dimensional vector, and may be a sparse feature vector, for example, a one-hot vector. User feature representation and task feature representation are representations related to user data and task data. Then, learn user embedding (embedding) and task embedding (embedding) through RGCN.
RGCN对于用户任务图中每个节点的计算过程可以例如为:将任意节点作为中心节点,对中心节点进行一次卷积,通过聚合邻居节点的信息来更新中心节点的表示。其中邻居节点的聚合是按照边的类型进行分类,根据边类型的不同进行相应的转换,收集的信息经过一个正则化的加和,最后通过激活函数。其中每个顶点的信息更新共享参数,并行计算,同时也包括自连接,也就是说包括了节点自身表示。The calculation process of RGCN for each node in the user task graph can be, for example, taking any node as a central node, performing a convolution on the central node, and updating the representation of the central node by aggregating information of neighbor nodes. Among them, the aggregation of neighbor nodes is classified according to the edge type, and the corresponding conversion is performed according to the different edge types. The collected information undergoes a regularized summation, and finally passes through the activation function. The information of each vertex updates the shared parameters, parallel calculation, and also includes self-connection, that is to say, includes the node's own representation.
步骤S230. 根据用户特征以及任务特征,构建用户任务矩阵,用户任务矩阵中含有用户节点在至少一个任务标签上的概率分布,任务标签与任务节点一一对应。Step S230. Construct a user task matrix according to the user characteristics and the task characteristics. The user task matrix contains the probability distribution of the user nodes on at least one task label, and the task labels correspond to the task nodes one-to-one.
具体地,构建用户任务矩阵可以采用矩阵乘法的方式。若存在3个用户以及3种任务,则将会构建出3x3的用户任务矩阵,矩阵中的每个元素表示将所对应的用户对于所对应的任务更为擅长的概率。该用户任务矩阵经过softmax而进行归一化,例如,每个用户在所有任务上的概率分布的值为0至1之间的值并且总和为1。Specifically, the user task matrix can be constructed by matrix multiplication. If there are 3 users and 3 tasks, a 3x3 user task matrix will be constructed, and each element in the matrix represents the probability that the corresponding user will be better at the corresponding task. The user task matrix is normalized by softmax. For example, the value of the probability distribution of each user on all tasks is between 0 and 1, and the sum is 1.
步骤S240. 根据用户任务矩阵中含有的用户节点在至少一个任务标签上的概率分布,生成用户节点所对应用户的目标任务标签。Step S240. According to the probability distribution of the user nodes contained in the user task matrix on at least one task label, generate the target task label of the user corresponding to the user node.
根据用户任务矩阵可以确定每个用户对于所有任务标签的概率分布,根据预定的任务分配原则,即可以生成每个用户对应的任务标签。例如,若为每个用户分配预定数量的任务,则可以按概率量任务标签进行降序排列并按照预定数量取任务标签作为结果,或者,可取概率大于预定阈值的所有任务标签作为结果,或者,对于每个用户,可以选取针对于该用户的最大概率所对应的任务作为任务标签。According to the user task matrix, the probability distribution of each user for all task labels can be determined, and the task label corresponding to each user can be generated according to the predetermined task allocation principle. For example, if a predetermined number of tasks are assigned to each user, the task labels can be sorted in descending order according to the probability quantity and the task labels can be taken as the result according to the predetermined number, or all task labels with probabilities greater than a predetermined threshold can be taken as the result, or for Each user can select the task corresponding to the highest probability for the user as the task label.
在确定用户的任务标签后,服务器可以向用户派发与任务标签相对应的任务。After determining the task tag of the user, the server can dispatch the task corresponding to the task tag to the user.
在本申请的实施例中,通过关系图卷积网络学习用户和用户的相关性,任务和任务的相关性,以及用户和任务的相关性,能够生成与用户相关性强的任务标签,从而避免了由管理员手动向用户分派任务,提升了分派任务的效率,并且生成的任务标签与用户的相关性更强,提升了任务分派的匹配度。In the embodiment of the present application, the relationship between the user and the user, the relationship between the task and the task, and the relationship between the user and the task, and the relationship between the user and the task are learned through the relationship graph convolutional network. The administrator manually assigns tasks to users, which improves the efficiency of assigning tasks, and the generated task tags are more relevant to users, which improves the matching of task assignments.
在本申请的一些实施例中,在以上实施例的基础上,在上述步骤S210. 根据用户数据以及任务数据,生成用户任务图数据之前,该方法还可以包括以下步骤:In some embodiments of the present application, on the basis of the above embodiments, in the above step S210. Before generating user task graph data according to the user data and task data, the method may further include the following steps:
步骤S201. 根据历史任务数据以及历史用户数据,生成历史用户任务图数据,历史用户数据对应于多个已知的任务标签,已知的任务标签与历史任务信息一一对应;Step S201. Generate historical user task graph data based on historical task data and historical user data. The historical user data corresponds to multiple known task tags, and the known task tags correspond to the historical task information in a one-to-one manner;
步骤S202. 根据历史用户任务图数据以及已知的任务标签,对关系图卷积网络进行训练。Step S202. Training the relational graph convolutional network based on historical user task graph data and known task labels.
其中,历史用户数据与任务标签的对应关系可以根据历史任务数据中的任务记录来确定。例如,历史用户数据中所包含的某个用户,统计该用户所完成的任务中出现次数最多的任务标签,作为其对应的任务标签。历史用户任务数据所包含的内容与上述的用户任务图相同次数不在赘述。通常,历史用户数据以及历史任务数据可以直接采用标注系统中所保存的历史任务记录信息。Among them, the correspondence between historical user data and task tags can be determined according to the task records in the historical task data. For example, for a certain user included in historical user data, the task label that appears most frequently among the tasks completed by the user is counted as its corresponding task label. The content contained in the historical user task data is the same as the above-mentioned user task diagram, and will not be repeated here. Generally, historical user data and historical task data can directly use the historical task record information stored in the annotation system.
基于历史用户任务图数据以及已知的任务标签,可以对关系图卷积网络进行训练。具体地,在训练过程中,将历史用户任务图数据以及已知的任务标签输入关系图卷积网络得到用户特征表示以及任务特征表示。根据所得到的结果以及历史用户数据与已知的任务标签的对应关系,可以确定损失函数的求解。损失函数可以采用交叉熵等各类分类损失函数。通过调节参数来使损失函数的求解最小化以便确定关系图卷积网络中各个参数的值。Based on historical user task graph data and known task labels, the relation graph convolutional network can be trained. Specifically, in the training process, historical user task graph data and known task labels are input into the relationship graph convolutional network to obtain a user feature representation and a task feature representation. According to the obtained results and the correspondence between historical user data and known task tags, the solution of the loss function can be determined. The loss function can use various classification loss functions such as cross entropy. The solution of the loss function is minimized by adjusting the parameters in order to determine the value of each parameter in the graph convolutional network.
在本申请的实施例中,构建历史用户任务图数据并利用历史用户任务图数据训练关系图卷积网络,可以获得符合标注系统的任务情况以及用户情况的特定模型,有利于提高关系图卷积网络输出结果的准确性,进而提高任务分派的准确性。In the embodiment of the present application, building historical user task graph data and using the historical user task graph data to train the relational graph convolutional network can obtain a specific model that conforms to the task situation of the labeling system and the user's situation, which is beneficial to improve the relational graph convolution. The accuracy of network output results, thereby improving the accuracy of task assignment.
在本申请的一些实施例中,在以上实施例的基础上,上述步骤S210.根据用户数据以及任务数据,生成用户任务图数据,包括:In some embodiments of the present application, on the basis of the above embodiments, the above step S210. Generate user task graph data according to user data and task data, including:
步骤S211. 根据用户数据生成用户图数据,用户图数据包括至少一个用户节点,以及根据任务数据生成任务图数据,任务图数据包括至少一个任务节点;Step S211. Generate user graph data according to the user data, the user graph data includes at least one user node, and generate task graph data according to the task data, the task graph data includes at least one task node;
步骤S212. 对用户图数据和任务图数据进行合并处理,得到用户任务图数据。Step S212. The user map data and the task map data are merged to obtain the user task map data.
具体地,用户数据图是以用户信息为节点,以用户之间相对于任务的协同关系为边的无向加权图。该用户图数据中包括至少一个用户节点。任务图是以任务信息为节点,以任务之间相对于用户的关联关系为边的无向加权图。任务图数据中包括至少一个任务节点。Specifically, the user data graph is an undirected weighted graph with user information as a node and a collaborative relationship between users relative to a task as an edge. The user graph data includes at least one user node. The task graph is an undirected weighted graph with task information as nodes and the relationship between tasks relative to users as edges. The task graph data includes at least one task node.
在生成用户图数据以及任务图数据后,将二者进行合并,可以得到用户任务图数据。具体地,在保持用户图数据以及任务图数据中的节点以及边的关系不变的情况下,根据用户完成任务的情况在任务节点与用户节点之间添加边,以用于表示用户与任务之间的关联关系。遍历用户图数据中的每个用户节点,确定其与任务图数据中的各个任务节点的关联关系,并且根据关联悬关系建立相应的边,得到用户任务图数据。After the user map data and the task map data are generated, merge the two to obtain the user task map data. Specifically, while keeping the relationship between the nodes and edges in the user graph data and the task graph data unchanged, an edge is added between the task node and the user node according to the user's completion of the task to indicate the relationship between the user and the task. The relationship between them. Traverse each user node in the user graph data, determine its association relationship with each task node in the task graph data, and establish corresponding edges according to the association dangling relationship to obtain the user task graph data.
在本申请的实施例中,通过用户图数据和任务图数据合并得到用户任务图,在保留用户图数据和任务图数据的原有的数据的情况下,添加用户与任务之间的关系,保持的数据的完整性,有利于提升图卷积网络的准确性。In the embodiment of the present application, the user task graph is obtained by merging the user graph data and the task graph data. While retaining the original data of the user graph data and the task graph data, the relationship between the user and the task is added to maintain The integrity of the data is conducive to improving the accuracy of the graph convolutional network.
在本申请的一些实施例中,在以上实施例的基础上,在步骤S211中的根据用户数据生成用户图数据,包括:In some embodiments of the present application, on the basis of the above embodiments, generating user graph data according to user data in step S211 includes:
步骤S2111. 获取用户数据中含有的用户信息,以及各个用户信息所分别对应的任务记录信息;Step S2111. Obtain the user information contained in the user data and the task record information corresponding to each user information;
步骤S2112. 根据各个用户信息所分别对应的任务记录信息,确定各个用户信息对应的用户之间所完成的相同任务;Step S2112. According to the task record information corresponding to each user information, determine the same task completed between users corresponding to each user information;
步骤S2113. 根据用户信息生成用户图数据中的用户节点,并根据用户信息对应的用户之间所完成的相同任务生成用户节点之间的边,以得到用户图数据。Step S2113. Generate user nodes in the user graph data according to the user information, and generate edges between the user nodes according to the same tasks completed between users corresponding to the user information to obtain the user graph data.
其中,在一个实施例中,任务记录信息主要包括用户所完成的各个任务的任务类型。相同任务指示任务类型相同的任务。根据任务类型,可以确定两个用户完成相同类型的任务的数量。在该实施例中,用户图数据中的用户之间的边表示两个用户所完成的相同类型的任务的数量。Among them, in an embodiment, the task record information mainly includes the task type of each task completed by the user. The same task indicates tasks of the same task type. According to the task type, the number of tasks of the same type completed by two users can be determined. In this embodiment, the edges between users in the user graph data represent the number of tasks of the same type completed by two users.
具体地,请参阅图3,图3是本申请实施例中的用户图数据的示例示意图。如图3所示,在该示例中,用户A与用户B之间存在边,即表示用户A与用户B完成过相同类型的任务,并且边的权重值表示相同类型的任务的数量。例如,若用户A完成过30个语音标注任务,而用户B完成过15个语音标注任务,则边的权重值为15。又例如,若用户A完成过7个语音标注任务和8个图像标注任务,而用户B也完成过7个语音标注任务和8个图像标注任务,则边的权重值同样为15,即,边的权重值包括所有类型相同的任务的数量。Specifically, please refer to FIG. 3, which is a schematic diagram of an example of user graph data in an embodiment of the present application. As shown in Figure 3, in this example, there is an edge between user A and user B, which means that user A and user B have completed the same type of task, and the weight value of the edge represents the number of tasks of the same type. For example, if user A has completed 30 voice labeling tasks, and user B has completed 15 voice labeling tasks, the weight value of the edge is 15. For another example, if user A has completed 7 voice labeling tasks and 8 image labeling tasks, and user B has also completed 7 voice labeling tasks and 8 image labeling tasks, the weight value of the edge is also 15, that is, edge The weight value includes the number of all tasks of the same type.
在另一个实施例中,任务记录信息包括用户所完成的各个任务的任务标识。相同任务指示任务标识相同的任务,即完全相同的任务。相应地,用户图数据中的用户之间的边表示两个用户所完成的任务标识相同的任务数量。例如,若用户B完成了4个图片标注任务,用户C完成了5个图片标注任务并且其中3个任务中的图片与用户B所完成的标注任务相同,则用户B与用户C之间存在边,并且边的权重值为3。In another embodiment, the task record information includes task identification of each task completed by the user. The same task indicates tasks with the same task ID, that is, identical tasks. Correspondingly, the edges between users in the user graph data represent the number of tasks completed by two users with the same task identifier. For example, if user B has completed 4 image labeling tasks, user C has completed 5 image labeling tasks and the pictures in 3 tasks are the same as the labeling task completed by user B, then there is a margin between user B and user C , And the weight value of the edge is 3.
在本实施例中,根据用户信息和各个用户信息对应的用户之间所完成的相同任务建立用户图数据,充分体现多个用户相对于任务的协同关系,有利于在相同任务的派发多个用户时的派发合理性。In this embodiment, the user map data is established based on the same task completed by the user information and the users corresponding to each user information, which fully reflects the collaborative relationship between multiple users with respect to the task, and is beneficial to dispatching multiple users on the same task. The rationality of distribution at the time.
在本申请的一些实施例中,在以上实施例的基础上,在步骤S211中的根据任务数据生成任务图数据,包括:In some embodiments of the present application, on the basis of the above embodiments, generating task graph data according to the task data in step S211 includes:
步骤S2114. 获取任务数据中含有的任务信息,以及各个用户信息所分别对应的任务历史数据;Step S2114. Obtain the task information contained in the task data and the task history data corresponding to each user information;
步骤S2115. 根据各个用户信息所分别对应的任务历史数据,确定由相同用户所执行的各个任务信息;Step S2115. Determine each task information performed by the same user according to the task history data corresponding to each user information;
步骤S2116. 根据任务信息生成任务图数据中的任务节点,并根据由相同用户所执行的各个任务信息生成任务节点之间的边,以得到任务图数据。Step S2116. The task nodes in the task graph data are generated according to the task information, and the edges between the task nodes are generated according to the information of each task performed by the same user to obtain the task graph data.
其中,在一个实施例中,任务历史数据主要包括完成该任务的用户信息以及任务完成时间。在确定由相同用户所执行的各个任务信息时,可以设定时间阈值,两个任务由同一个用户完成的时间差在时间阈值内时,才认为两个任务之间相对于该用户存在关联。在该实施例中,任务图数据中的任务之间的边表示完成过着两个任务的用户的数量。Among them, in one embodiment, the task history data mainly includes the information of the user who completed the task and the task completion time. When determining the information of each task performed by the same user, a time threshold can be set. When the time difference between two tasks completed by the same user is within the time threshold, the two tasks are considered to be related to the user. In this embodiment, the edges between tasks in the task graph data represent the number of users who have completed two tasks.
具体地,请参阅图4,图4是本申请实施例中的任务图数据的示例示意图。如图4所示,在该示例中,任务1与任务2之间存在边,即表示任务1与任务2由相同的用户完成,并且边的权重值表示完成过任务1和任务2的用户的数量。例如,若40个用户完成过任务1,并且其中的30个用户完成过任务2,则边的权重值为30。Specifically, please refer to FIG. 4, which is a schematic diagram of an example of task graph data in an embodiment of the present application. As shown in Figure 4, in this example, there is an edge between task 1 and task 2, which means that task 1 and task 2 are completed by the same user, and the weight value of the edge indicates that the user who has completed task 1 and task 2 quantity. For example, if 40 users have completed task 1, and 30 of them have completed task 2, the weight value of the edge is 30.
在本实施例中,根据任务信息和由相同用户所执行的各个任务信息建立用户图数据,充分体现多个任务相对于用户的关联关系,在派发任务时针对用户的特型和技能进行派发,从而用户完成任务的效率。In this embodiment, the user graph data is established based on the task information and the information of each task performed by the same user, which fully reflects the association relationship between multiple tasks with respect to the user, and distributes tasks based on the user’s characteristics and skills. So that the user completes the task efficiency.
在本申请的一些实施例中,在以上实施例的基础上,上述步骤S212. 对用户图数据和任务图数据进行合并处理,得到用户任务图数据,包括:In some embodiments of the present application, on the basis of the above embodiments, the above step S212. The user map data and the task map data are merged to obtain the user task map data, including:
步骤S2121. 获取用户图数据中的用户节点所对应的任务记录信息,以及获取任务图数据中的任务节点所对应的任务历史数据;Step S2121. Obtain the task record information corresponding to the user node in the user graph data, and obtain the task history data corresponding to the task node in the task graph data;
步骤S2122. 根据任务记录信息以及任务历史数据,确定用户节点与任务节点之间的关联关系,关联关系指示用户节点对应的用户完成任务节点对应的任务;Step S2122. Determine the association relationship between the user node and the task node according to the task record information and the task history data, and the association relationship indicates that the user corresponding to the user node completes the task corresponding to the task node;
步骤S2123. 根据用户节点与任务节点之间的关联关系,在用户图数据和任务图数据之间构建用户节点与任务节点之间的边,以得到用户任务图数据。Step S2123. According to the association relationship between the user node and the task node, an edge between the user node and the task node is constructed between the user graph data and the task graph data to obtain the user task graph data.
具体地,根据任务记录信息以及任务历史数据,可以具体确定每个用户所完成的具体任务,以及完成每个具体任务的次数。根据所确定的各个用户完成任务的情况,可以确定用户节点与任务节点之间的关联关系,并且对于存在该关联关系的两个节点,在用户图数据和任务图数据的基础上,可以构建用户节点与任务节点之间的边。该边表示用户完成对应任务的情况,并且边的权重表示用户完成该任务的次数。Specifically, according to the task record information and the task history data, the specific tasks completed by each user and the number of times each specific task have been completed can be specifically determined. According to the determined situation of each user completing the task, the association relationship between the user node and the task node can be determined, and for the two nodes that have the association relationship, based on the user graph data and the task graph data, the user can be constructed The edge between the node and the task node. The edge represents the situation where the user completes the corresponding task, and the weight of the edge represents the number of times the user completes the task.
为了便于介绍,请参阅图5并结合图3以及图4,图5是本申请实施例中用户任务图数据的示例示意图。假定任务1表示翻译标注任务,若用户A完成过10个不同的翻译标注任务,则相应地,在用户任务途中,用户A与任务1之间存在边,并且其权重为10。For ease of introduction, please refer to FIG. 5 in conjunction with FIG. 3 and FIG. 4. FIG. 5 is a schematic diagram of an example of user task graph data in an embodiment of the present application. Assuming that task 1 represents a translation and annotation task, if user A has completed 10 different translation and annotation tasks, correspondingly, there is an edge between user A and task 1 on the way of the user task, and its weight is 10.
在另一个实施例中,任务1表示一个具体翻译标注任务,例如对于某篇文章的翻译进行标注,而任务2表示另一个具体翻译标注任务。用户A将任务1完成了10次,而用户B将任务2完成了30次,则在用户关系图中,用户A与任务1之间存在边,其权重为10,并且用户B与任务2之间存在边,其权重值为30。In another embodiment, task 1 represents a specific translation and annotation task, for example, annotating the translation of a certain article, and task 2 represents another specific translation and annotation task. User A completes task 1 10 times, and user B completes task 2 30 times. In the user relationship graph, there is an edge between user A and task 1, with a weight of 10, and user B and task 2 There is an edge between, and its weight value is 30.
在本实施例中,通过用户完成任务的情况,基于用户图数据以及任务图数据,生成用户任务图,有利于在关系图卷积网络的计算过程中充分考虑用户与任务之间的关系,提升任务与用户之间的关联程度,进而使得计算得到用户和任务的特征表示更准确。In this embodiment, the user task graph is generated based on the user graph data and task graph data based on the user completing the task, which is beneficial to fully consider the relationship between the user and the task in the calculation process of the relation graph convolutional network, and improve The degree of association between the task and the user in turn makes the calculation of the characteristic representation of the user and the task more accurate.
在本申请的一些实施例中,在以上实施例的基础上,上述步骤S240. 根据用户任务矩阵中含有的用户节点在至少一个任务标签上的概率分布,生成用户节点所对应用户的目标任务标签,包括:In some embodiments of the present application, on the basis of the above embodiments, the above step S240. According to the probability distribution of the user node contained in the user task matrix on at least one task label, generate the target task label of the user corresponding to the user node ,include:
步骤S241. 根据用户任务矩阵,确定用户节点对应的最大概率;Step S241. Determine the maximum probability corresponding to the user node according to the user task matrix;
步骤S242. 若最大概率大于预设的概率分布阈值,则将最大概率对应的任务标签作为用户节点所对应用户的目标任务标签。Step S242. If the maximum probability is greater than the preset probability distribution threshold, the task label corresponding to the maximum probability is used as the target task label of the user corresponding to the user node.
具体地,用户任务矩阵中包含各个任务对于每个用户的分派概率。若用户任务矩阵包括3个用户以及3种任务,对于其中一个用户,存在一个3维向量,并且向量中的每个特征值表示对应任务分配给该用户的概率。例如,3种任务分别为文字标注、语音标注和图像标注,对于用户X,三维向量可以是{0.7,0.2,0.1}。Specifically, the user task matrix contains the assignment probability of each task for each user. If the user task matrix includes 3 users and 3 types of tasks, for one user, there is a 3-dimensional vector, and each feature value in the vector represents the probability that the corresponding task is assigned to the user. For example, the three tasks are text annotation, voice annotation, and image annotation. For user X, the three-dimensional vector can be {0.7, 0.2, 0.1}.
在针对于特定用户的向量中,可以确定其中的最大概率,并且若最大概率大于概率分布阈值,则表示该用户与相对应的任务之间的关联关系更加紧密,则可以将该任务标签作为该用户的任务标签。例如,对于上述的用户X,最大概率为0.7,若概率分布阈值为0.5,则可以确定文字标注为该用户的任务标签。In the vector for a specific user, the maximum probability can be determined, and if the maximum probability is greater than the probability distribution threshold, it means that the user and the corresponding task are more closely related, and the task label can be used as the The user's task label. For example, for the aforementioned user X, the maximum probability is 0.7, and if the probability distribution threshold is 0.5, it can be determined that the text is labeled as the task tag of the user.
在本实施例中,通过将用户节点的最大概率与概率分布阈值比较来确定用户的任务标签,有利于避免在用户本身的倾向不明显时强行限制用户的任务种类,有利于提高任务分派的准确性。In this embodiment, the user's task label is determined by comparing the maximum probability of the user node with the probability distribution threshold, which helps to avoid forcibly restricting the user's task type when the user's own tendency is not obvious, and helps to improve the accuracy of task assignment. sex.
应当注意,尽管在附图中以特定顺序描述了本申请中方法的各个步骤,但是,这并非要求或者暗示必须按照该特定顺序来执行这些步骤,或是必须执行全部所示的步骤才能实现期望的结果。附加的或备选的,可以省略某些步骤,将多个步骤合并为一个步骤执行,以及/或者将一个步骤分解为多个步骤执行等。It should be noted that although the various steps of the method in this application are described in a specific order in the drawings, this does not require or imply that these steps must be performed in the specific order, or that all the steps shown must be performed to achieve the expectation. the result of. Additionally or alternatively, some steps may be omitted, multiple steps may be combined into one step for execution, and/or one step may be decomposed into multiple steps for execution, etc.
以下介绍本申请的装置实施,可以用于执行本申请上述实施例中的根据关系图卷积网络的任务标签生成方法。图6是本申请实施例中任务标签生成装置的组成框图。如图6所示,任务标签生成装置300主要可以包括:The following describes the implementation of the device of the present application, which can be used to execute the task label generation method based on the relational graph convolutional network in the foregoing embodiment of the present application. Fig. 6 is a block diagram of the composition of a task tag generating device in an embodiment of the present application. As shown in FIG. 6, the task tag generating apparatus 300 may mainly include:
用户任务图数据模块310,用于根据用户数据以及任务数据,生成用户任务图数据,用户任务图数据包括至少一个用户节点以及至少一个任务节点,用户节点对应于用户数据中含有的用户信息,任务节点对应于任务数据中含有的任务信息;The user task graph data module 310 is used to generate user task graph data according to user data and task data. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user information contained in the user data. The node corresponds to the task information contained in the task data;
特征表示输出模块320,用于将用户任务图数据输入关系图卷积网络中,获得关系图卷积网络针对用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,用户特征对应于用户任务图数据中含有的用户节点,任务特征对应于用户任务图数据中含有的任务节点;The feature representation output module 320 is configured to input user task graph data into the relationship graph convolutional network to obtain at least one user feature output by the relationship graph convolution network for the user task graph data, and at least one task feature, the user feature corresponding to the user The user nodes contained in the task graph data, and the task characteristics correspond to the task nodes contained in the user task graph data;
用户任务矩阵构建模块330,用于根据用户特征以及任务特征,构建用户任务矩阵,用户任务矩阵中含有用户节点在至少一个任务标签上的概率分布,任务标签与任务节点一一对应;The user task matrix construction module 330 is configured to construct a user task matrix according to user characteristics and task characteristics. The user task matrix contains the probability distribution of user nodes on at least one task label, and the task labels correspond to the task nodes one-to-one;
任务标签生成模块340,用于根据用户任务矩阵中含有的用户节点在至少一个任务标签上的概率分布,生成用户节点所对应用户的目标任务标签。The task label generating module 340 is configured to generate the target task label of the user corresponding to the user node according to the probability distribution of the user node contained in the user task matrix on at least one task label.
在本申请的一些实施例中,根据以上技术方案,任务标签生成装置300还包括:In some embodiments of the present application, according to the above technical solutions, the task tag generating device 300 further includes:
历史用户任务图数据生成模块,用于根据历史任务数据以及历史用户数据,生成历史用户任务图数据,历史用户数据对应于多个已知的任务标签,已知的任务标签与历史任务信息一一对应;The historical user task graph data generation module is used to generate historical user task graph data based on historical task data and historical user data. The historical user data corresponds to multiple known task tags, and the known task tags and historical task information are one by one correspond;
关系图卷积网络进行训练模块,用于根据历史用户任务图数据以及已知的任务标签,对关系图卷积网络进行训练。The relational graph convolutional network training module is used to train the relational graph convolutional network based on historical user task graph data and known task labels.
在本申请的一些实施例中,根据以上技术方案,用户任务图数据模块310可以包括:In some embodiments of the present application, according to the above technical solutions, the user task graph data module 310 may include:
用户图数据生成单元,用于根据用户数据生成用户图数据,用户图数据包括至少一个用户节点,以及根据任务数据生成任务图数据,任务图数据包括至少一个任务节点;The user graph data generating unit is configured to generate user graph data according to user data, the user graph data includes at least one user node, and the task graph data is generated according to task data, and the task graph data includes at least one task node;
合并处理单元,用于对用户图数据和任务图数据进行合并处理,得到用户任务图数据。The merging processing unit is used for merging the user map data and the task map data to obtain the user task map data.
在本申请的一些实施例中,根据以上技术方案,用户图数据生成单元可以包括:In some embodiments of the present application, according to the above technical solutions, the user graph data generating unit may include:
用户信息获取子单元,用于获取用户数据中含有的用户信息,以及各个用户信息所分别对应的任务记录信息;The user information acquisition subunit is used to acquire the user information contained in the user data and the task record information corresponding to each user information;
相同任务确定子单元,用于根据各个用户信息所分别对应的任务记录信息,确定各个用户信息对应的用户之间所完成的相同任务;The same task determination subunit is used to determine the same task completed between users corresponding to each user information according to the task record information corresponding to each user information respectively;
用户节点生成子单元,用于根据用户信息生成用户图数据中的用户节点,并根据用户信息对应的用户之间所完成的相同任务生成用户节点之间的边,以得到用户图数据。The user node generating subunit is used to generate user nodes in the user graph data according to the user information, and generate edges between the user nodes according to the same tasks completed between users corresponding to the user information to obtain the user graph data.
在本申请的一些实施例中,根据以上技术方案,用户任务图数据模块310可以包括:In some embodiments of the present application, according to the above technical solutions, the user task graph data module 310 may include:
任务信息获取子单元,用于获取任务数据中含有的任务信息,以及各个用户信息所分别对应的任务历史数据;The task information acquisition subunit is used to acquire the task information contained in the task data and the task history data corresponding to each user information;
相同用户确定子单元,用于根据各个用户信息所分别对应的任务历史数据,确定由相同用户所执行的各个任务信息;The same user determination subunit is used to determine each task information performed by the same user according to the task history data corresponding to each user information;
任务节点生成子单元,用于根据任务信息生成任务图数据中的任务节点,并根据由相同用户所执行的各个任务信息生成任务节点之间的边,以得到任务图数据。The task node generating subunit is used to generate task nodes in the task graph data according to task information, and generate edges between task nodes according to the information of each task performed by the same user to obtain task graph data.
在本申请的一些实施例中,根据以上技术方案,合并处理单元可以包括:In some embodiments of the present application, according to the above technical solutions, the merging processing unit may include:
任务记录信息获取子单元,用于获取用户图数据中的用户节点所对应的任务记录信息,以及获取任务图数据中的任务节点所对应的任务历史数据;The task record information obtaining subunit is used to obtain the task record information corresponding to the user node in the user graph data, and obtain the task history data corresponding to the task node in the task graph data;
关联关系确定子单元,用于根据任务记录信息以及任务历史数据,确定用户节点与任务节点之间的关联关系,关联关系指示用户节点对应的用户完成任务节点对应的任务;The association relationship determination subunit is used to determine the association relationship between the user node and the task node according to the task record information and the task history data, and the association relationship instructs the user corresponding to the user node to complete the task corresponding to the task node;
用户任务图数据构建子单元,用于根据用户节点与任务节点之间的关联关系,在用户图数据和任务图数据之间构建用户节点与任务节点之间的边,以得到用户任务图数据。The user task graph data construction subunit is used to construct an edge between the user node and the task node between the user graph data and the task graph data according to the association relationship between the user node and the task node to obtain the user task graph data.
在本申请的一些实施例中,根据以上技术方案,任务标签生成模块340可以包括:In some embodiments of the present application, according to the above technical solutions, the task tag generation module 340 may include:
最大概率确定单元,用于根据用户任务矩阵,确定用户节点对应的最大概率;The maximum probability determining unit is used to determine the maximum probability corresponding to the user node according to the user task matrix;
任务标签确定单元,用于若最大概率大于预设的概率分布阈值,则将最大概率对应的任务标签作为用户节点所对应用户的目标任务标签。The task label determining unit is configured to, if the maximum probability is greater than the preset probability distribution threshold, use the task label corresponding to the maximum probability as the target task label of the user corresponding to the user node.
需要说明的是,上述实施例所提供的装置与上述实施例所提供的方法属于同一构思,其中各个模块执行操作的具体方式已经在方法实施例中进行了详细描述,此处不再赘述。It should be noted that the device provided in the foregoing embodiment and the method provided in the foregoing embodiment belong to the same concept, and the specific manners for performing operations of each module have been described in detail in the method embodiment, and will not be repeated here.
图7是适于用来实现本申请实施例的电子设备的计算机系统的结构示意图。Fig. 7 is a schematic structural diagram of a computer system suitable for implementing an electronic device according to an embodiment of the present application.
需要说明的是,图7示出的电子设备的计算机系统400仅是一个示例,不应对本申请实施例的功能和使用范围带来任何限制。It should be noted that the computer system 400 of the electronic device shown in FIG. 7 is only an example, and should not bring any limitation to the functions and scope of use of the embodiments of the present application.
如图7所示,计算机系统400包括中央处理单元(Central Processing Unit,CPU)401,其可以根据存储在只读存储器(Read-Only Memory,ROM)402中的程序或者从储存部分408加载到随机访问存储器(Random Access Memory,RAM)403中的程序而执行各种适当的动作和处理。在RAM 403中,还存储有系统操作所需的各种程序和数据。CPU 401、ROM 402以及RAM 403通过总线404彼此相连。输入/输出(Input /Output,I/O)接口405也连接至总线404。As shown in FIG. 7, the computer system 400 includes a central processing unit (Central Processing Unit (CPU) 401, which can execute according to a program stored in a read-only memory (Read-Only Memory, ROM) 402 or a program loaded from the storage part 408 to a random access memory (Random Access Memory, RAM) 403 Various appropriate actions and processing. In RAM 403, various programs and data required for system operation are also stored. The CPU 401, the ROM 402, and the RAM 403 are connected to each other through a bus 404. An input/output (Input/Output, I/O) interface 405 is also connected to the bus 404.
以下部件连接至I/O接口405:包括键盘、鼠标等的输入部分406;包括诸如阴极射线管(Cathode Ray Tube,CRT)、液晶显示器(Liquid Crystal Display,LCD)等以及扬声器等的输出部分407;包括硬盘等的储存部分408;以及包括诸如LAN(Local Area Network,局域网)卡、调制解调器等的网络接口卡的通信部分409。通信部分409经由诸如因特网的网络执行通信处理。驱动器410也根据需要连接至I/O接口405。可拆卸介质411,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器410上,以便于从其上读出的计算机程序根据需要被安装入储存部分408。The following components are connected to the I/O interface 405: an input part 406 including a keyboard, a mouse, etc.; an output part 407 including a cathode ray tube (Cathode Ray Tube, CRT), a liquid crystal display (LCD), etc., and speakers 407 ; A storage part 408 including a hard disk, etc.; and a communication part 409 including a network interface card such as a LAN (Local Area Network) card, a modem, and the like. The communication section 409 performs communication processing via a network such as the Internet. The driver 410 is also connected to the I/O interface 405 as needed. A removable medium 411, such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc., is installed on the drive 410 as required, so that the computer program read therefrom is installed into the storage part 408 as required.
特别地,根据本申请的实施例,各个方法流程图中所描述的过程可以被实现为计算机软件程序。例如,本申请的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信部分409从网络上被下载和安装,和/或从可拆卸介质411被安装。在该计算机程序被中央处理单元(CPU)401执行时,执行本申请的系统中限定的各种功能。In particular, according to the embodiments of the present application, the processes described in the flowcharts of the various methods may be implemented as computer software programs. For example, the embodiments of the present application include a computer program product, which includes a computer program carried on a computer-readable medium, and the computer program contains program code for executing the method shown in the flowchart. In such an embodiment, the computer program may be downloaded and installed from the network through the communication part 409, and/or installed from the removable medium 411. When the computer program is executed by the central processing unit (CPU) 401, various functions defined in the system of the present application are executed.
需要说明的是,本申请实施例所示的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合,计算机可读存储介质可以是非易失性,也可以是易失性。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(Erasable Programmable Read Only Memory,EPROM)、闪存、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本申请中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本申请中,计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:无线、有线等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium shown in the embodiments of the present application may be a computer-readable signal medium or a computer-readable storage medium or any combination of the two. The computer-readable storage medium may be non-volatile or Can be volatile. The computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or a combination of any of the above. More specific examples of computer-readable storage media may include, but are not limited to: electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable Erasable Programmable Read Only Memory (EPROM), flash memory, optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the above. In this application, the computer-readable storage medium may be any tangible medium that contains or stores a program, and the program may be used by or in combination with an instruction execution system, apparatus, or device. In this application, a computer-readable signal medium may include a data signal propagated in a baseband or as a part of a carrier wave, and a computer-readable program code is carried therein. This propagated data signal can take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. The computer-readable signal medium may also be any computer-readable medium other than the computer-readable storage medium, and the computer-readable medium may send, propagate, or transmit the program for use by or in combination with the instruction execution system, apparatus, or device . The program code contained on the computer-readable medium can be transmitted by any suitable medium, including but not limited to: wireless, wired, etc., or any suitable combination of the foregoing.
附图中的流程图和框图,图示了按照本申请各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,上述模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图或流程图中的每个方框、以及框图或流程图中的方框的组合,可以用执行规定的功能或操作的专用的根据硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the accompanying drawings illustrate the possible implementation of the system architecture, functions, and operations of the system, method, and computer program product according to various embodiments of the present application. In this regard, each block in the flowchart or block diagram may represent a module, program segment, or part of the code, and the above-mentioned module, program segment, or part of the code contains one or more for realizing the specified logic function. Executable instructions. It should also be noted that, in some alternative implementations, the functions marked in the block may also occur in a different order from the order marked in the drawings. For example, two blocks shown one after another can actually be executed substantially in parallel, and they can sometimes be executed in the reverse order, depending on the functions involved. It should also be noted that each block in the block diagram or flowchart, and a combination of blocks in the block diagram or flowchart, can be implemented by a dedicated hardware-based system that performs the specified function or operation, or can be implemented by It is realized by a combination of dedicated hardware and computer instructions.
应当注意,尽管在上文详细描述中提及了用于动作执行的设备的若干模块或者单元,但是这种划分并非强制性的。实际上,根据本申请的实施方式,上文描述的两个或更多模块或者单元的特征和功能可以在一个模块或者单元中具体化。反之,上文描述的一个模块或者单元的特征和功能可以进一步划分为由多个模块或者单元来具体化。It should be noted that although several modules or units of the device for action execution are mentioned in the above detailed description, this division is not mandatory. In fact, according to the embodiments of the present application, the features and functions of two or more modules or units described above may be embodied in one module or unit. Conversely, the features and functions of a module or unit described above can be further divided into multiple modules or units to be embodied.
通过以上的实施方式的描述,本领域的技术人员易于理解,这里描述的示例实施方式可以通过软件实现,也可以通过软件结合必要的硬件的方式来实现。因此,根据本申请实施方式的技术方案可以以软件产品的形式体现出来,该软件产品可以存储在一个非易失性存储介质(可以是CD-ROM,U盘,移动硬盘等)中或网络上,包括若干指令以使得一台计算设备(可以是个人计算机、服务器、触控终端、或者网络设备等)执行根据本申请实施方式的方法。Through the description of the above embodiments, those skilled in the art can easily understand that the example embodiments described here can be implemented by software, or can be implemented by combining software with necessary hardware. Therefore, the technical solution according to the embodiments of the present application can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, U disk, mobile hard disk, etc.) or on the network , Including several instructions to make a computing device (which can be a personal computer, a server, a touch terminal, or a network device, etc.) execute the method according to the embodiment of the application.
本领域技术人员在考虑说明书及实践这里公开的申请后,将容易想到本申请的其它实施方案。本申请旨在涵盖本申请的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本申请的一般性原理并包括本申请未公开的本技术领域中的公知常识或惯用技术手段。Those skilled in the art will easily think of other embodiments of the present application after considering the specification and practicing the application disclosed herein. This application is intended to cover any variations, uses, or adaptive changes of this application. These variations, uses, or adaptive changes follow the general principles of this application and include common knowledge or customary technical means in the technical field that are not disclosed in this application. .
应当理解的是,本申请并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本申请的范围仅由所附的权利要求来限制。It should be understood that the present application is not limited to the precise structure that has been described above and shown in the drawings, and various modifications and changes can be made without departing from its scope. The scope of the application is only limited by the appended claims.

Claims (20)

  1. 一种根据关系图卷积网络的任务标签生成方法,其中,包括: A task label generation method based on a relational graph convolutional network, which includes:
    根据用户数据以及任务数据,生成用户任务图数据,所述用户任务图数据包括至少一个用户节点以及至少一个任务节点,所述用户节点对应于所述用户数据中含有的用户信息,所述任务节点对应于所述任务数据中含有的任务信息;According to user data and task data, user task graph data is generated. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user information contained in the user data. Corresponding to the task information contained in the task data;
    将所述用户任务图数据输入关系图卷积网络中,获得所述关系图卷积网络针对所述用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,所述用户特征对应于所述用户任务图数据中含有的用户节点,所述任务特征对应于所述用户任务图数据中含有的任务节点;Input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolution network for the user task graph data, and at least one task feature, the user feature corresponding to all The user node contained in the user task graph data, and the task feature corresponds to the task node contained in the user task graph data;
    根据所述用户特征以及所述任务特征,构建用户任务矩阵,所述用户任务矩阵中含有所述用户节点在至少一个任务标签上的概率分布,所述任务标签与所述任务节点一一对应;Constructing a user task matrix according to the user characteristics and the task characteristics, the user task matrix containing the probability distribution of the user node on at least one task label, and the task label corresponds to the task node one-to-one;
    根据所述用户任务矩阵中含有的所述用户节点在至少一个任务标签上的概率分布,生成所述用户节点所对应用户的目标任务标签。According to the probability distribution of the user node on at least one task label contained in the user task matrix, a target task label of the user corresponding to the user node is generated.
  2. 根据权利要求1所述的方法,其中,在所述根据用户数据以及任务数据,生成用户任务图数据之前,所述方法还包括: The method according to claim 1, wherein before said generating user task graph data based on user data and task data, the method further comprises:
    根据历史任务数据以及历史用户数据,生成历史用户任务图数据,所述历史用户数据对应于多个已知的任务标签,所述已知的任务标签与所述历史任务信息一一对应;Generating historical user task graph data according to historical task data and historical user data, where the historical user data corresponds to a plurality of known task tags, and the known task tags correspond to the historical task information in a one-to-one correspondence;
    根据所述历史用户任务图数据以及所述已知的任务标签,对所述关系图卷积网络进行训练。Training the relational graph convolutional network according to the historical user task graph data and the known task label.
  3. 根据权利要求1所述的方法,其中,所述根据用户数据以及任务数据,生成用户任务图数据,包括: The method according to claim 1, wherein said generating user task graph data according to user data and task data comprises:
    根据所述用户数据生成用户图数据,所述用户图数据包括所述至少一个用户节点,以及根据所述任务数据生成任务图数据,所述任务图数据包括所述至少一个任务节点;Generating user graph data according to the user data, the user graph data including the at least one user node, and generating task graph data according to the task data, the task graph data including the at least one task node;
    对所述用户图数据和所述任务图数据进行合并处理,得到所述用户任务图数据。The user map data and the task map data are combined to obtain the user task map data.
  4. 根据权利要求3所述的方法,其中,所述根据所述用户数据生成用户图数据,包括:The method according to claim 3, wherein said generating user graph data according to said user data comprises:
    获取所述用户数据中含有的用户信息,以及各个用户信息所分别对应的任务记录信息;Acquiring user information contained in the user data and task record information corresponding to each user information;
    根据所述各个用户信息所分别对应的任务记录信息,确定所述各个用户信息对应的用户之间所完成的相同任务;Determine, according to the task record information corresponding to the respective user information, the same tasks completed by the users corresponding to the respective user information;
    根据所述用户信息生成所述用户图数据中的用户节点,并根据所述用户信息对应的用户之间所完成的相同任务生成所述用户节点之间的边,以得到所述用户图数据。The user nodes in the user graph data are generated according to the user information, and the edges between the user nodes are generated according to the same tasks completed between users corresponding to the user information to obtain the user graph data.
  5. 根据权利要求3所述的方法,其中,所述根据所述任务数据生成任务图数据,包括:The method according to claim 3, wherein the generating task graph data according to the task data comprises:
    获取所述任务数据中含有的任务信息,以及各个用户信息所分别对应的任务历史数据;Acquiring task information contained in the task data and task historical data corresponding to each user information;
    根据各个用户信息所分别对应的任务历史数据,确定由相同用户所执行的各个任务信息;Determine each task information performed by the same user according to the task history data corresponding to each user information;
    根据所述任务信息生成所述任务图数据中的任务节点,并根据所述由相同用户所执行的各个任务信息生成所述任务节点之间的边,以得到所述任务图数据。The task nodes in the task graph data are generated according to the task information, and the edges between the task nodes are generated according to the information of each task performed by the same user to obtain the task graph data.
  6. 根据权利要求3所述的方法,其中,所述对所述用户图数据和所述任务图数据进行合并处理,得到所述用户任务图数据,包括:The method according to claim 3, wherein said merging process of said user map data and said task map data to obtain said user task map data comprises:
    获取所述用户图数据中的用户节点所对应的任务记录信息,以及获取所述任务图数据中的任务节点所对应的任务历史数据;Obtaining task record information corresponding to the user node in the user graph data, and obtaining task history data corresponding to the task node in the task graph data;
    根据所述任务记录信息以及所述任务历史数据,确定所述用户节点与所述任务节点之间的关联关系,所述关联关系指示所述用户节点对应的用户完成所述任务节点对应的任务;Determining an association relationship between the user node and the task node according to the task record information and the task history data, the association relationship instructing the user corresponding to the user node to complete the task corresponding to the task node;
    根据所述用户节点与所述任务节点之间的关联关系,在所述用户图数据和任务图数据之间构建所述用户节点与所述任务节点之间的边,以得到所述用户任务图数据。According to the association relationship between the user node and the task node, an edge between the user node and the task node is constructed between the user graph data and the task graph data to obtain the user task graph data.
  7. 根据权利要求1所述的方法,其中,所述根据所述用户任务矩阵中含有的所述用户节点在至少一个任务标签上的概率分布,生成所述用户节点所对应用户的目标任务标签,包括: The method according to claim 1, wherein the generating the target task label of the user corresponding to the user node according to the probability distribution of the user node contained in the user task matrix on at least one task label comprises :
    根据所述用户任务矩阵,确定所述用户节点对应的最大概率;Determine the maximum probability corresponding to the user node according to the user task matrix;
    若所述最大概率大于预设的概率分布阈值,则将所述最大概率对应的任务标签作为所述用户节点所对应用户的目标任务标签。If the maximum probability is greater than the preset probability distribution threshold, the task label corresponding to the maximum probability is used as the target task label of the user corresponding to the user node.
  8. 一种任务标签生成装置,其中,包括:A task tag generating device, which includes:
    用户任务图数据模块,用于根据用户数据以及任务数据,生成用户任务图数据,所述用户任务图数据包括至少一个用户节点以及至少一个任务节点,所述用户节点对应于所述用户数据中含有的用户信息,所述任务节点对应于所述任务数据中含有的任务信息;The user task graph data module is used to generate user task graph data according to user data and task data. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user data contained in The user information of the task node corresponds to the task information contained in the task data;
    特征表示输出模块,用于将所述用户任务图数据输入关系图卷积网络中,获得所述关系图卷积网络针对所述用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,所述用户特征对应于所述用户任务图数据中含有的用户节点,所述任务特征对应于所述用户任务图数据中含有的任务节点;The feature representation output module is configured to input the user task graph data into the relation graph convolutional network to obtain at least one user feature and at least one task feature output by the relation graph convolutional network for the user task graph data, The user feature corresponds to the user node contained in the user task graph data, and the task feature corresponds to the task node contained in the user task graph data;
    用户任务矩阵构建模块,用于根据所述用户特征以及所述任务特征,构建用户任务矩阵,所述用户任务矩阵中含有所述用户节点在至少一个任务标签上的概率分布,所述任务标签与所述任务节点一一对应;The user task matrix construction module is configured to construct a user task matrix according to the user characteristics and the task characteristics, the user task matrix contains the probability distribution of the user node on at least one task label, and the task label is The task nodes have a one-to-one correspondence;
    任务标签生成模块,用于根据所述用户任务矩阵中含有的所述用户节点在至少一个任务标签上的概率分布,生成所述用户节点所对应用户的目标任务标签。The task label generating module is configured to generate the target task label of the user corresponding to the user node according to the probability distribution of the user node on at least one task label contained in the user task matrix.
  9. 一种任务标签生成设备,其中,所述任务标签生成设备包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时执行以下步骤:A task tag generating device, wherein the task tag generating device includes a memory, a processor, and computer-readable instructions stored in the memory and running on the processor, and the processor executes the computer Perform the following steps when the instructions are readable:
    根据用户数据以及任务数据,生成用户任务图数据,所述用户任务图数据包括至少一个用户节点以及至少一个任务节点,所述用户节点对应于所述用户数据中含有的用户信息,所述任务节点对应于所述任务数据中含有的任务信息;According to user data and task data, user task graph data is generated. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user information contained in the user data. Corresponding to the task information contained in the task data;
    将所述用户任务图数据输入关系图卷积网络中,获得所述关系图卷积网络针对所述用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,所述用户特征对应于所述用户任务图数据中含有的用户节点,所述任务特征对应于所述用户任务图数据中含有的任务节点;Input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolution network for the user task graph data, and at least one task feature, the user feature corresponding to all The user node contained in the user task graph data, and the task feature corresponds to the task node contained in the user task graph data;
    根据所述用户特征以及所述任务特征,构建用户任务矩阵,所述用户任务矩阵中含有所述用户节点在至少一个任务标签上的概率分布,所述任务标签与所述任务节点一一对应;Constructing a user task matrix according to the user characteristics and the task characteristics, the user task matrix containing the probability distribution of the user node on at least one task label, and the task label corresponds to the task node one-to-one;
    根据所述用户任务矩阵中含有的所述用户节点在至少一个任务标签上的概率分布,生成所述用户节点所对应用户的目标任务标签。According to the probability distribution of the user node on at least one task label contained in the user task matrix, a target task label of the user corresponding to the user node is generated.
  10. 根据权利要求9所述的任务标签生成设备,其中,在所述根据用户数据以及任务数据,生成用户任务图数据之前,所述处理器执行所述计算机可读指令时还执行以下步骤:9. The task label generating device according to claim 9, wherein, before the user task graph data is generated based on the user data and task data, the processor further executes the following steps when executing the computer readable instruction:
    根据历史任务数据以及历史用户数据,生成历史用户任务图数据,所述历史用户数据对应于多个已知的任务标签,所述已知的任务标签与所述历史任务信息一一对应;Generating historical user task graph data according to historical task data and historical user data, where the historical user data corresponds to a plurality of known task tags, and the known task tags correspond to the historical task information in a one-to-one correspondence;
    根据所述历史用户任务图数据以及所述已知的任务标签,对所述关系图卷积网络进行训练。Training the relational graph convolutional network according to the historical user task graph data and the known task label.
  11. 根据权利要求9所述的任务标签生成设备,其中,所述根据用户数据以及任务数据,生成用户任务图数据,包括:The task label generating device according to claim 9, wherein said generating user task graph data according to user data and task data comprises:
    根据所述用户数据生成用户图数据,所述用户图数据包括所述至少一个用户节点,以及根据所述任务数据生成任务图数据,所述任务图数据包括所述至少一个任务节点;Generating user graph data according to the user data, the user graph data including the at least one user node, and generating task graph data according to the task data, the task graph data including the at least one task node;
    对所述用户图数据和所述任务图数据进行合并处理,得到所述用户任务图数据。The user map data and the task map data are combined to obtain the user task map data.
  12. 根据权利要求11所述的任务标签生成设备,其中,所述根据所述用户数据生成用户图数据,包括:The task label generating device according to claim 11, wherein said generating user map data according to said user data comprises:
    获取所述用户数据中含有的用户信息,以及各个用户信息所分别对应的任务记录信息;Acquiring user information contained in the user data and task record information corresponding to each user information;
    根据所述各个用户信息所分别对应的任务记录信息,确定所述各个用户信息对应的用户之间所完成的相同任务;Determine, according to the task record information corresponding to the respective user information, the same tasks completed by the users corresponding to the respective user information;
    根据所述用户信息生成所述用户图数据中的用户节点,并根据所述用户信息对应的用户之间所完成的相同任务生成所述用户节点之间的边,以得到所述用户图数据。The user nodes in the user graph data are generated according to the user information, and the edges between the user nodes are generated according to the same tasks completed between users corresponding to the user information to obtain the user graph data.
  13. 根据权利要求11所述的任务标签生成设备,其中,所述根据所述任务数据生成任务图数据,包括:The task label generating device according to claim 11, wherein said generating task graph data according to said task data comprises:
    获取所述任务数据中含有的任务信息,以及各个用户信息所分别对应的任务历史数据;Acquiring task information contained in the task data and task historical data corresponding to each user information;
    根据各个用户信息所分别对应的任务历史数据,确定由相同用户所执行的各个任务信息;Determine each task information performed by the same user according to the task history data corresponding to each user information;
    根据所述任务信息生成所述任务图数据中的任务节点,并根据所述由相同用户所执行的各个任务信息生成所述任务节点之间的边,以得到所述任务图数据。The task nodes in the task graph data are generated according to the task information, and the edges between the task nodes are generated according to the information of each task performed by the same user to obtain the task graph data.
  14. 根据权利要求11所述的任务标签生成设备,其中,所述对所述用户图数据和所述任务图数据进行合并处理,得到所述用户任务图数据,包括:The task label generating device according to claim 11, wherein the merging process of the user map data and the task map data to obtain the user task map data comprises:
    获取所述用户图数据中的用户节点所对应的任务记录信息,以及获取所述任务图数据中的任务节点所对应的任务历史数据;Obtaining task record information corresponding to the user node in the user graph data, and obtaining task history data corresponding to the task node in the task graph data;
    根据所述任务记录信息以及所述任务历史数据,确定所述用户节点与所述任务节点之间的关联关系,所述关联关系指示所述用户节点对应的用户完成所述任务节点对应的任务;Determining an association relationship between the user node and the task node according to the task record information and the task history data, the association relationship instructing the user corresponding to the user node to complete the task corresponding to the task node;
    根据所述用户节点与所述任务节点之间的关联关系,在所述用户图数据和任务图数据之间构建所述用户节点与所述任务节点之间的边,以得到所述用户任务图数据。According to the association relationship between the user node and the task node, an edge between the user node and the task node is constructed between the user graph data and the task graph data to obtain the user task graph data.
  15. 根据权利要求14所述的任务标签生成设备,其中,所述根据所述用户任务矩阵中含有的所述用户节点在至少一个任务标签上的概率分布,生成所述用户节点所对应用户的目标任务标签,包括:The task label generating device according to claim 14, wherein the target task of the user corresponding to the user node is generated based on the probability distribution of the user node on at least one task label contained in the user task matrix Labels, including:
    根据所述用户任务矩阵,确定所述用户节点对应的最大概率;Determine the maximum probability corresponding to the user node according to the user task matrix;
    若所述最大概率大于预设的概率分布阈值,则将所述最大概率对应的任务标签作为所述用户节点所对应用户的目标任务标签。If the maximum probability is greater than the preset probability distribution threshold, the task label corresponding to the maximum probability is used as the target task label of the user corresponding to the user node.
  16. 一种计算机可读存储介质,其中,所述计算机可读存储介质存储有至少一个指令,所述至少一个指令被处理器执行时实现时执行以下步骤:A computer-readable storage medium, wherein the computer-readable storage medium stores at least one instruction, and the following steps are executed when the at least one instruction is executed by a processor:
    根据用户数据以及任务数据,生成用户任务图数据,所述用户任务图数据包括至少一个用户节点以及至少一个任务节点,所述用户节点对应于所述用户数据中含有的用户信息,所述任务节点对应于所述任务数据中含有的任务信息;According to user data and task data, user task graph data is generated. The user task graph data includes at least one user node and at least one task node. The user node corresponds to the user information contained in the user data. Corresponding to the task information contained in the task data;
    将所述用户任务图数据输入关系图卷积网络中,获得所述关系图卷积网络针对所述用户任务图数据输出的至少一个用户特征,以及至少一个任务特征,所述用户特征对应于所述用户任务图数据中含有的用户节点,所述任务特征对应于所述用户任务图数据中含有的任务节点;Input the user task graph data into the relation graph convolutional network to obtain at least one user feature output by the relation graph convolution network for the user task graph data, and at least one task feature, the user feature corresponding to all The user node contained in the user task graph data, and the task feature corresponds to the task node contained in the user task graph data;
    根据所述用户特征以及所述任务特征,构建用户任务矩阵,所述用户任务矩阵中含有所述用户节点在至少一个任务标签上的概率分布,所述任务标签与所述任务节点一一对应;Constructing a user task matrix according to the user characteristics and the task characteristics, the user task matrix containing the probability distribution of the user node on at least one task label, and the task label corresponds to the task node one-to-one;
    根据所述用户任务矩阵中含有的所述用户节点在至少一个任务标签上的概率分布,生成所述用户节点所对应用户的目标任务标签。According to the probability distribution of the user node on at least one task label contained in the user task matrix, a target task label of the user corresponding to the user node is generated.
  17. 根据权利要求16所述的计算机可读存储介质,其中,在所述根据用户数据以及任务数据,生成用户任务图数据之前,所述至少一个指令被处理器执行时还实现时执行以下步骤:16. The computer-readable storage medium according to claim 16, wherein, before the user task graph data is generated according to the user data and task data, the following steps are performed when the at least one instruction is executed by the processor:
    根据历史任务数据以及历史用户数据,生成历史用户任务图数据,所述历史用户数据对应于多个已知的任务标签,所述已知的任务标签与所述历史任务信息一一对应;Generating historical user task graph data according to historical task data and historical user data, where the historical user data corresponds to a plurality of known task tags, and the known task tags correspond to the historical task information in a one-to-one correspondence;
    根据所述历史用户任务图数据以及所述已知的任务标签,对所述关系图卷积网络进行训练。Training the relational graph convolutional network according to the historical user task graph data and the known task label.
  18. 根据权利要求16所述的计算机可读存储介质,其中,所述根据用户数据以及任务数据,生成用户任务图数据,包括:The computer-readable storage medium according to claim 16, wherein said generating user task graph data according to user data and task data comprises:
    根据所述用户数据生成用户图数据,所述用户图数据包括所述至少一个用户节点,以及根据所述任务数据生成任务图数据,所述任务图数据包括所述至少一个任务节点;Generating user graph data according to the user data, the user graph data including the at least one user node, and generating task graph data according to the task data, the task graph data including the at least one task node;
    对所述用户图数据和所述任务图数据进行合并处理,得到所述用户任务图数据。The user map data and the task map data are combined to obtain the user task map data.
  19. 根据权利要求18所述的计算机可读存储介质,其中,所述根据所述用户数据生成用户图数据,包括:18. The computer-readable storage medium according to claim 18, wherein said generating user graph data according to said user data comprises:
    获取所述用户数据中含有的用户信息,以及各个用户信息所分别对应的任务记录信息;Acquiring user information contained in the user data and task record information corresponding to each user information;
    根据所述各个用户信息所分别对应的任务记录信息,确定所述各个用户信息对应的用户之间所完成的相同任务;Determine, according to the task record information corresponding to the respective user information, the same tasks completed by the users corresponding to the respective user information;
    根据所述用户信息生成所述用户图数据中的用户节点,并根据所述用户信息对应的用户之间所完成的相同任务生成所述用户节点之间的边,以得到所述用户图数据。The user nodes in the user graph data are generated according to the user information, and the edges between the user nodes are generated according to the same tasks completed between users corresponding to the user information to obtain the user graph data.
  20. 根据权利要求18所述的计算机可读存储介质,其中,所述根据所述任务数据生成任务图数据,包括:18. The computer-readable storage medium of claim 18, wherein the generating task graph data according to the task data comprises:
    获取所述任务数据中含有的任务信息,以及各个用户信息所分别对应的任务历史数据;Acquiring task information contained in the task data and task historical data corresponding to each user information;
    根据各个用户信息所分别对应的任务历史数据,确定由相同用户所执行的各个任务信息;Determine each task information performed by the same user according to the task history data corresponding to each user information;
    根据所述任务信息生成所述任务图数据中的任务节点,并根据所述由相同用户所执行的各个任务信息生成所述任务节点之间的边,以得到所述任务图数据。The task nodes in the task graph data are generated according to the task information, and the edges between the task nodes are generated according to the information of each task performed by the same user to obtain the task graph data.
PCT/CN2021/084223 2020-11-25 2021-03-31 Method and related apparatus for generating task label on basis of relationship graph convolutional network WO2021213156A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011342170.X 2020-11-25
CN202011342170.XA CN112464042B (en) 2020-11-25 2020-11-25 Task label generating method and related device for convolution network according to relationship graph

Publications (1)

Publication Number Publication Date
WO2021213156A1 true WO2021213156A1 (en) 2021-10-28

Family

ID=74808128

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/084223 WO2021213156A1 (en) 2020-11-25 2021-03-31 Method and related apparatus for generating task label on basis of relationship graph convolutional network

Country Status (2)

Country Link
CN (1) CN112464042B (en)
WO (1) WO2021213156A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114510989A (en) * 2021-12-23 2022-05-17 中国科学院软件研究所 Normative evaluation method, device and equipment for image data set

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112464042B (en) * 2020-11-25 2023-07-28 平安科技(深圳)有限公司 Task label generating method and related device for convolution network according to relationship graph
CN113887575B (en) * 2021-09-13 2024-04-05 华南理工大学 Graph dataset enhancement method based on self-adaptive graph rolling network

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10599686B1 (en) * 2018-09-27 2020-03-24 Babylon Partners Limited Method and system for extracting information from graphs
CN111199474A (en) * 2019-12-16 2020-05-26 北京淇瑀信息科技有限公司 Risk prediction method and device based on network diagram data of two parties and electronic equipment
CN111553215A (en) * 2020-04-20 2020-08-18 深圳云天励飞技术有限公司 Personnel association method and device, and graph convolution network training method and device
CN111639696A (en) * 2020-05-26 2020-09-08 北京邮电大学 User classification method and device
CN111931076A (en) * 2020-09-22 2020-11-13 平安国际智慧城市科技股份有限公司 Method and device for carrying out relationship recommendation based on authorized directed graph and computer equipment
CN111932308A (en) * 2020-08-13 2020-11-13 中国工商银行股份有限公司 Data recommendation method, device and equipment
CN112464042A (en) * 2020-11-25 2021-03-09 平安科技(深圳)有限公司 Task label generation method according to relation graph convolution network and related device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170200101A1 (en) * 2016-01-07 2017-07-13 Tata Consultancy Services Limited Systems and methods for optimized task allocation
CN110363282B (en) * 2019-06-06 2021-10-26 中国科学院信息工程研究所 Network node label active learning method and system based on graph convolution network

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10599686B1 (en) * 2018-09-27 2020-03-24 Babylon Partners Limited Method and system for extracting information from graphs
CN111199474A (en) * 2019-12-16 2020-05-26 北京淇瑀信息科技有限公司 Risk prediction method and device based on network diagram data of two parties and electronic equipment
CN111553215A (en) * 2020-04-20 2020-08-18 深圳云天励飞技术有限公司 Personnel association method and device, and graph convolution network training method and device
CN111639696A (en) * 2020-05-26 2020-09-08 北京邮电大学 User classification method and device
CN111932308A (en) * 2020-08-13 2020-11-13 中国工商银行股份有限公司 Data recommendation method, device and equipment
CN111931076A (en) * 2020-09-22 2020-11-13 平安国际智慧城市科技股份有限公司 Method and device for carrying out relationship recommendation based on authorized directed graph and computer equipment
CN112464042A (en) * 2020-11-25 2021-03-09 平安科技(深圳)有限公司 Task label generation method according to relation graph convolution network and related device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114510989A (en) * 2021-12-23 2022-05-17 中国科学院软件研究所 Normative evaluation method, device and equipment for image data set

Also Published As

Publication number Publication date
CN112464042B (en) 2023-07-28
CN112464042A (en) 2021-03-09

Similar Documents

Publication Publication Date Title
US10963794B2 (en) Concept analysis operations utilizing accelerators
Zhang et al. Agl: a scalable system for industrial-purpose graph machine learning
WO2021213156A1 (en) Method and related apparatus for generating task label on basis of relationship graph convolutional network
WO2023097929A1 (en) Knowledge graph recommendation method and system based on improved kgat model
US9760627B1 (en) Private-public context analysis for natural language content disambiguation
WO2022116441A1 (en) Bert model fine-tuning method and apparatus based on convolutional neural network
US20180068221A1 (en) System and Method of Advising Human Verification of Machine-Annotated Ground Truth - High Entropy Focus
US11763084B2 (en) Automatic formulation of data science problem statements
WO2021120588A1 (en) Method and apparatus for language generation, computer device, and storage medium
CN108694201A (en) A kind of entity alignment schemes and device
US11593665B2 (en) Systems and methods driven by link-specific numeric information for predicting associations based on predicate types
CN111563158A (en) Text sorting method, sorting device, server and computer-readable storage medium
CN111859872A (en) Text labeling method and device
US11436413B2 (en) Modified machine learning model and method for coherent key phrase extraction
CN111008213B (en) Method and apparatus for generating language conversion model
JP2022545335A (en) New word classification technology
US20220230095A1 (en) Active learning via a surrogate machine learning model using knowledge distillation
CN117056612B (en) Lesson preparation data pushing method and system based on AI assistance
CN112269875B (en) Text classification method, device, electronic equipment and storage medium
WO2023174189A1 (en) Method and apparatus for classifying nodes of graph network model, and device and storage medium
CN117391497A (en) News manuscript quality subjective and objective scoring consistency evaluation method and system
WO2023178979A1 (en) Question labeling method and apparatus, electronic device and storage medium
US20220292396A1 (en) Method and system for generating training data for a machine-learning algorithm
US20220207384A1 (en) Extracting Facts from Unstructured Text
JP2023026279A (en) Multi-model method for natural language processing and recommendation generation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21791653

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21791653

Country of ref document: EP

Kind code of ref document: A1