WO2020029412A1 - Tag recommendation method and apparatus, computer device, and computer-readable storage medium - Google Patents

Tag recommendation method and apparatus, computer device, and computer-readable storage medium Download PDF

Info

Publication number
WO2020029412A1
WO2020029412A1 PCT/CN2018/108915 CN2018108915W WO2020029412A1 WO 2020029412 A1 WO2020029412 A1 WO 2020029412A1 CN 2018108915 W CN2018108915 W CN 2018108915W WO 2020029412 A1 WO2020029412 A1 WO 2020029412A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
tags
tag
cluster
target user
Prior art date
Application number
PCT/CN2018/108915
Other languages
French (fr)
Chinese (zh)
Inventor
吴壮伟
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2020029412A1 publication Critical patent/WO2020029412A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0203Market surveys; Market polls
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations

Definitions

  • the present application relates to the field of Internet technologies, and in particular, to a tag recommendation method, device, computer device, and computer-readable storage medium.
  • Tags are a type of data that identifies resources or users in the current era of e-commerce networks.
  • the user's tag data can be used to analyze the user's interest preferences to help e-commerce find products that specific users recommend for their purchase.
  • the tag data is generally provided by the e-commerce platform or social platform for users to choose and use. The number and category are fixed and may not meet the user's situation.
  • the tags provided by the e-commerce platform do not have tags that fit the user's preferences, they are generally Custom tags. Users with the same preferences may have different custom tags for things of the same nature. The more users, the more messy the custom tags are, resulting in diverse and difficult to unify tags, which is not good for e-commerce or social platforms. Subsequent use of tag data to analyze user preferences.
  • the embodiments of the present application provide a label recommendation method, device, computer device, and computer-readable storage medium, which are intended to recommend unified labels to users to avoid the situation where the labels used by users are too scattered.
  • an embodiment of the present application provides a tag recommendation method.
  • the method includes: obtaining a user-item rating matrix, where the user-item rating matrix includes all users and all users ’ratings of all products, and All users include the target user and several other users; calculate the similarity between each other user and the target user according to the user-item rating matrix to obtain a similar user group of the target user; Used first tags; categorizing the first tags to obtain the clusters to which each of the first tags belong; analyzing the first tags in each cluster to be used by users in the similar user group Use case; recommend the tag in the corresponding class cluster to the target user according to the situation where the first tag in each class cluster is used by the similar user group.
  • an embodiment of the present application further provides a label recommendation device, where the label recommendation device includes a unit for implementing the label recommendation method described in the first aspect.
  • an embodiment of the present application further provides a computer device including a memory and a processor connected to the memory; the memory is used to store a computer program that implements a tag recommendation method; and the processor is configured to run an The computer program stored in the memory is described in the method described in the first aspect.
  • an embodiment of the present application provides a computer-readable storage medium, where the storage medium stores one or more computer programs, and the one or more computer programs can be executed by one or more processors. To implement the method described in the first aspect above.
  • the tag recommendation method, device, computer equipment, and computer-readable storage medium provided in the embodiments of the present application recommend the unused tags to the target user based on the tag situation used by the similar user group of the target user, and not only can the similarity be used
  • the common tag preferences of the user group recommend tags that match the personal preferences of the target user, and also realize the unification of the tags used by similar user groups, avoiding the situation where the tags used by users are too scattered, and the unified tag data is conducive to subsequent analysis of users' common preference , To carry out other personalized marketing plans for user groups.
  • FIG. 1 is a schematic flowchart of a label recommendation method according to an embodiment of the present application
  • FIG. 2 is a schematic diagram of a sub-flow of a label recommendation method according to an embodiment of the present application
  • FIG. 3 is a schematic diagram of a sub-flow of a label recommendation method according to another embodiment of the present application.
  • FIG. 4 is a schematic diagram of a sub-flow of a label recommendation method according to another embodiment of the present application.
  • FIG. 5 is a schematic block diagram of a label recommendation device according to an embodiment of the present application.
  • FIG. 6 is a schematic block diagram of a subunit of a tag recommendation device according to an embodiment of the present application.
  • FIG. 7 is a schematic block diagram of a subunit of a label recommendation device according to another embodiment of the present application.
  • FIG. 8 is a schematic block diagram of a subunit of a tag recommendation device according to another embodiment of the present application.
  • FIG. 9 is a schematic block diagram of a structure of a computer device according to an embodiment of the present application.
  • FIG. 1 is a schematic flowchart of a tag recommendation method according to an embodiment of the present application.
  • the method can be applied to a terminal.
  • the terminal may be a smart phone, a tablet computer, a notebook computer, a desktop computer, or other electronic devices with communication functions.
  • the method includes steps S101 to S106.
  • various commodity consumption platforms record the purchase rating records of users purchasing products. These purchase rating records can be crawled through web crawler technology. Statistics of these purchase rating records can be obtained by all users on all products, that is, the user- Product rating matrix. All users refer to all users who have scored for product purchases. All products refer to all products included in the product consumption platform. The other users in step S101 above are relative to the target user, and their identities can be switched. When a product needs to be recommended to one user, that user is the target user, and the remaining users are other users.
  • the embodiment of the present application is to recommend resources to target users based on the user's collaborative filtering idea.
  • the user-based collaborative filtering idea is to use statistical techniques to find neighbors with the same preferences as the target user, that is, similar users (groups), and then according to the target user Of neighbors ’preferences generate recommendations to target users.
  • step S102 includes steps S1021-S1023.
  • U1 is the target user and U2-Um other users.
  • the vector dimension of the user vector corresponding to the other user is equal to the number of products.
  • the vector value of one dimension corresponding to the product that has been scored is 1, and the vector value corresponding to the product that has not been scored is 0.
  • the target user vector of U1 is User vector to be compared for U2 User vector to be compared for U3
  • the vector values omitted by the ellipsis are all 0.
  • the user vector can be simplified based on all the products rated by the two users to be compared. Compared with U1, the two products that have been rated by the two are I1, I2, and I3, so the user vector can be reduced to 3 dimensions, and the target user vector of U1 is User vector to be compared for U2 If U3 and U1 are compared, the two products they have rated are I1, I2, I3, and I4, so the user vector can be reduced to 4 dimensions.
  • the similar users of the target user are found based on the cosine similarity, that is, the similarity between the two users is calculated according to the following formula:
  • the threshold value is 0.5-0.7. In one embodiment, the threshold value is selected as 0.5, 0.6, or 0.7.
  • a similar user group of the target user can be obtained by calculating the similarity between each other user and the target user.
  • Tags are used by users to categorize resources. Users can analyze the user's interest in a certain type of resources by using the tags frequently.
  • an arbitrary label used by a similar user group is referred to as a first label.
  • the labels that have been used on the network need to be clustered to obtain different clusters, and it is clear that the different clusters contain Which tags can then be used to classify the first tag in step S104 and find each class cluster described by the first tag;
  • the class cluster to which the first tag belongs includes other than the first tag.
  • Tags are tags that have not been used by similar user groups.
  • the crawled network can be set, mainly for mainstream networks, such as Sina Weibo, major e-commerce network platforms, Baidu, etc.
  • mainstream networks such as Sina Weibo, major e-commerce network platforms, Baidu, etc.
  • Well-known web pages Since users can initially use any text or phrase as the label of the product, the label is generally messy and wide-ranging.
  • the original label needs to be divided into frequent labels and infrequent labels.
  • Frequent tags refer to tags that have been used by multiple users (for example, more than 100 users) and have been marked on multiple products (for example, more than 100 products); infrequent tags are not It is often used by users, so it is eliminated.
  • Labels are a kind of text resources. Using the existing corpus and word2vec algorithm, you can train word vectors with arbitrary labels. After you get word vectors with frequent labels, you use the DBScan model to cluster word vectors with frequent labels to get the clusters of labels.
  • the tag recommendation method of the present application is used to recommend tags to the user. For example, if the medical care wants to evaluate the purchased product after shopping, the evaluation process requires the user to tag the product, and the user's evaluation operation can be regarded as a trigger event.
  • each first tag After classifying each first tag, analyze the use of the first tag of each type of cluster by similar user groups. Since each user in the similar user group has the same preference for the same resource, it can be based on each The overall situation where the first type of tags of a class cluster is used by similar users to predict which type of cluster tags are more interested by similar user groups, thereby predicting which type of cluster tags the target user is more interested in, and recommending them to the target user More interested tags.
  • step S105 specifically includes: separately calculating a total frequency of the first tag in each type of cluster used by the similar user group.
  • the situation where the first tags of a certain type of cluster are used by similar user groups can be represented by the total frequency of all the first tags contained in them by the similar user group; all the first tags contained in a type of cluster are The total frequency used by the similar user group is calculated according to the situation where each first tag in the cluster is used by the similar user group.
  • step S105 includes steps S1051-S1052.
  • the frequency of the j first tags, si represents the similarity between the i-th similar user and the target user, and Q ij represents the number of times the i-th similar user uses the j-th first tag.
  • the frequency (represented by F j ) that the j-th first label is used by the similar user group is equal to the number of M similar users using the j-th first label.
  • step S106 specifically includes: recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster.
  • step S106 includes steps S1061-S1064.
  • TopN clusters that is, the first preset (N) clusters with a higher total frequency, where N is 1-4. In one embodiment, the value of N is 2 or 3.
  • the TopN clusters are tags that are frequently used by similar user groups, and also represent the tags that are frequently used by the target user.
  • the tag recommendation method provided in the embodiment of the present application recommends an unused tag to the target user based on the tag situation used by the similar user group of the target user, and not only can use the common tag preference of the similar user group to recommend matching the personality of the target user
  • the preferred tags also realize the unification of the tags used by similar user groups, avoiding the situation where the tags used by users are too scattered, and the unified tag data is conducive to subsequent analysis of the user's common preferences, and other personalized marketing promotion for the user group plan.
  • FIG. 5 is a schematic block diagram of a label recommendation device 100 according to an embodiment of the present application.
  • the tag recommendation device 100 includes a unit for performing the above-mentioned tag recommendation method, and the device may be configured in a desktop computer, a tablet computer, a laptop computer, and other terminals.
  • the tag recommendation device 100 includes a first acquisition unit 101, a first calculation unit 102, a second acquisition unit 103, a classification unit 104, an analysis unit 105, and a recommendation unit 106.
  • the first obtaining unit 101 is configured to obtain a user-item scoring matrix, where the user-item scoring matrix includes all users and the scoring of all products by all users, and all users include a target user and several other users.
  • the first calculation unit 102 is configured to calculate the similarity between each other user and the target user according to the user-item scoring matrix to obtain a similar user group of the target user.
  • the second obtaining unit 103 is configured to obtain a first tag used by the similar user group.
  • the classifying unit 104 is configured to classify the first tags to obtain a class cluster to which each of the first tags belongs.
  • the analysis unit 105 is configured to analyze a situation in which a first tag in each cluster is used by a user in the similar user group.
  • the recommendation unit 106 is configured to recommend the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group.
  • the first calculation unit 102 includes the following subunits:
  • a first calculation subunit 1021 configured to calculate a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users according to the user-product rating matrix;
  • a second calculation subunit 1022 configured to separately calculate the similarity between each comparison user vector and the target user vector based on the cosine similarity
  • the confirming subunit 1023 is configured to confirm other users corresponding to the similarity as similar users if the similarity is greater than or equal to a threshold, so as to obtain the similar user group.
  • the analysis unit 105 is specifically configured to separately calculate a total frequency of the first tag in each type of cluster used by the similar user group.
  • the recommendation unit 106 is specifically configured to recommend the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster.
  • the analysis unit 105 includes:
  • a third calculation subunit 1051 configured to calculate, according to the similarity corresponding to each similar user and the number of times each similar user uses a first tag, the frequency with which the first tag is used by the similar user group;
  • a fourth calculation subunit 1052 is configured to calculate a sum of frequencies of all the first tags in the same cluster used by the similar user group, and confirm the sum of the frequencies as the first tags of the corresponding cluster are similar to the The total frequency used by the user community.
  • the recommendation unit 106 includes:
  • a first obtaining subunit 1061 configured to obtain all tags included in a preset number of clusters with a total frequency ranking first
  • a second acquisition subunit 1062 configured to acquire a tag used by the target user
  • a third obtaining subunit 1063 configured to obtain, from all the tags, tags that have not been used by the target user according to the tags that have been used by the target user;
  • the recommendation subunit 1064 is configured to recommend the obtained unused tags to the target user.
  • the above-mentioned label recommendation device 100 corresponds to the foregoing label recommendation method.
  • the details of the label recommendation device 100 in this embodiment reference may be made to the foregoing method embodiment, and details are not described herein.
  • the above-mentioned tag recommendation device 100 may be implemented in the form of a computer program, and the computer program may be run on a computer device as shown in FIG. 9.
  • FIG. 9 is a schematic block diagram of a structure of a computer device 200 according to an embodiment of the present application.
  • the computer device 200 may be a terminal or a server.
  • the terminal may be an electronic device with a communication function, such as a smart phone, a tablet computer, a notebook computer, a desktop computer, a personal digital assistant, and a wearable device.
  • the server can be an independent server or a server cluster consisting of multiple servers.
  • the computer device 200 includes a processor 202, a memory, and a network interface 205 connected through a system bus 201.
  • the memory may include a non-volatile storage medium 203 and an internal memory 204.
  • the non-volatile storage medium 203 of the computer device 200 may store an operating system 2031 and a computer program 2032. When the computer program 2032 is executed, the processor 202 may execute a tag recommendation method.
  • the internal memory 204 provides an environment for running the computer program 2032 in the non-volatile storage medium 203.
  • the processor 202 of the computer device 200 is used to provide computing and control capabilities to support the operation of the entire computer device 200.
  • the network interface 205 of the computer device 200 is used for network communication, such as sending assigned tasks and receiving data.
  • the embodiment of the computer device shown in FIG. 9 does not constitute a limitation on the specific configuration of the computer device.
  • the computer device may include more or fewer components than shown in the figure. Either some parts are combined or different parts are arranged.
  • the computer device may include only a memory and a processor. In such an embodiment, the structure and function of the memory and the processor are consistent with the embodiment shown in FIG. 9, and details are not described herein again.
  • the processor 202 When the processor 202 runs the computer program 2032 in the non-volatile storage medium 203, the processor 202 performs the following steps: obtaining a user-item rating matrix, where the user-item rating matrix includes all users and the all users on all products All users include a target user and several other users; calculating the similarity between each other user and the target user according to the user-item rating matrix to obtain a similar user group of the target user; obtaining the similar user group of the target user; First tags used by similar user groups; categorizing the first tags to obtain the clusters to which each first tag belongs; analyzing the first tags in each class cluster by the similar user groups The situation used by the users in the cluster; recommend the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group.
  • the processor 202 when the processor 202 executes the step of calculating the similarity between each other user and the target user according to the user-item scoring matrix to obtain a similar user group of the target user, the processor 202 specifically Perform the following steps: calculate a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users according to the user-item scoring matrix; and calculate each comparison user vector and the target separately based on cosine similarity Similarity of user vectors; if the similarity is greater than or equal to a threshold, other users corresponding to the similarity are confirmed as similar users to obtain the similar user group.
  • the processor 202 when the processor 202 executes the step of analyzing the situation in which the first tag in each type of cluster is used by users in the similar user group, the processor 202 specifically performs the following steps: The total number of first tags in a class of clusters used by the similar user population.
  • the processor 202 executes the step of recommending the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group, Specifically, the following steps are performed: recommending tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster.
  • the processor 202 when the processor 202 executes the step of separately calculating the total frequency of the first tag in each type of cluster used by the similar user group, the processor 202 specifically performs the following steps: according to each similar The user's corresponding similarity and the number of times each similar user uses a first tag to calculate the frequency with which the first tag is used by the similar user group; calculate all first tags in the same type of cluster by the similar user group The sum of the used frequencies confirms the sum as the total frequency of the first tag of the corresponding cluster used by the similar user group.
  • the processor 202 when the processor 202 executes the step of recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster, the processor 202 specifically performs the following steps: All tags included in the preset number of clusters with the total frequency ranked; Get tags used by the target user; Get the target user among all tags according to the tags used by the target user Unused tags; recommend the obtained unused tags to the target user.
  • the processor 202 may be a central processing unit (CPU), and the processor 202 may also be another general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), Application-specific integrated circuits (Application Specific Integrated Circuits, ASICs), ready-made programmable gate arrays (Field-Programmable Gate Arrays, FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor may be a microprocessor, or the processor may be any conventional processor.
  • the computer program includes program instructions, and the computer program may be stored in a storage medium, and the storage medium is a computer-readable storage medium.
  • the program instructions are executed by at least one processor in the computer system to implement the process steps of the embodiment of the method.
  • the present application also provides a computer-readable storage medium, where the computer-readable storage medium stores one or more computer programs, and the one or more computer programs can be executed by one or more processors.
  • the following steps are implemented: obtaining a user-item rating matrix, the user-item rating matrix including all users and the all user ratings for all products, the all users including target users and several other users; according to the user-item
  • the scoring matrix calculates the similarity between each other user and the target user to obtain a similar user group of the target user; obtains a first tag used by the similar user group; classifies the first tag to obtain The class cluster to which each of the first tags belongs; analyzing the situation in which the first tag in each cluster is used by users in the similar user group; according to the first tag of each class cluster, the similar user is used The situation used by the group recommends the tags in the corresponding cluster to the target user.
  • the following steps are specifically implemented:
  • the user-item scoring matrix calculates a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users; and calculates the similarity between each comparison user vector and the target user vector based on the cosine similarity. If the similarity is greater than or equal to a threshold, confirming other users corresponding to the similarity as similar users to obtain the similar user group.
  • the following steps are specifically implemented: calculating the The total number of first tags used by the similar user population.
  • the following steps are specifically implemented: Recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster.
  • the step of separately calculating the total frequency of the first tag in each type of cluster used by the similar user group when the step of separately calculating the total frequency of the first tag in each type of cluster used by the similar user group is implemented, the following steps are specifically implemented: according to the similarity corresponding to each similar user And the number of times each first user uses a first tag to calculate the frequency with which the first tag is used by the similar user group; calculate the sum of the frequencies that all first tags in the same cluster are used by the similar user group , The total frequency of the first tag whose corresponding sum is confirmed as the corresponding cluster is used by the similar user group.
  • the following steps are specifically implemented: obtaining the total frequency ranking in All tags included in the previous preset number of clusters; obtaining tags used by the target user; and obtaining tags not used by the target user among all tags according to the tags used by the target user ; Recommending the obtained unused tags to the target user.
  • the computer-readable storage medium may be a non-volatile storage medium, which is an internal storage unit of the foregoing device, such as a hard disk or a memory of the device, and the storage medium may also be an external storage device of the device, such as on the device. Equipped with plug-in hard disk, Smart Memory Card (SMC), Secure Digital (SD) card, Flash Card, U disk, mobile hard disk, Read-Only Memory, A variety of computer-readable storage media, such as ROM), magnetic disks, or optical disks, that can store program codes. Further, the computer-readable storage medium may also include both an internal storage unit of the device and an external storage device.

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Theoretical Computer Science (AREA)
  • Economics (AREA)
  • Data Mining & Analysis (AREA)
  • Game Theory and Decision Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Embodiments of the present application provide a tag recommendation method and apparatus, a computer device, and a computer-readable storage medium. According to the embodiments of the present application, recommending tags having not yet been used by a target user to the target user on the basis of tags used by a similar user group of the target user implements not only the recommendation of tags complying with personal preferences of the target user by using common tag preferences of the similar user group, but also the unification of tags used by the similar user group, thereby avoiding excessive dispersion of tags used by users. In addition, unified tag data is conducive to subsequent analysis of common preferences of users and to other personalized marketing and promotion planning for the users.

Description

标签推荐方法、装置、计算机设备及计算机可读存储介质Label recommendation method, device, computer equipment and computer-readable storage medium
本申请要求于2018年08月09日提交中国专利局、申请号为201810902677.2、申请名称为“标签推荐方法、装置、计算机设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed on August 9, 2018 with the Chinese Patent Office, application number 201810902677.2, and application name "Label Recommendation Method, Device, Computer Equipment, and Storage Medium", the entire contents of which are incorporated by reference. In this application.
技术领域Technical field
本申请涉及互联网技术领域,尤其涉及一种标签推荐方法、装置、计算机设备及计算机可读存储介质。The present application relates to the field of Internet technologies, and in particular, to a tag recommendation method, device, computer device, and computer-readable storage medium.
背景技术Background technique
随着电子商务的快速发展,推荐系统已经被广泛研究和应用,推荐系统通过提取分析用户的资料、行为等信息来获取用户的喜好。标签是当前的电子商务网络时代中标识资源或者用户的一种数据,通过用户使用的标签数据可以分析出用户的兴趣喜好,以此帮助电商找到特定用户为其推荐可能购买的产品。而标签数据一般由电商平台或社交平台提供给用户选择使用,数量和类别固定有限,且不一定符合用户的情况,而当电商平台提供的标签没有适合用户喜好的标签时,一般由用户自定义标签,有着相同喜好的用户对相同性质的事物自定义的标签可能不一样,用户越多,自定义的标签越杂乱,造成标签多样化和难以统一化,这不利于电商或社交平台后续利用标签数据分析用户的喜好。With the rapid development of e-commerce, recommendation systems have been widely studied and applied. Recommendation systems obtain user preferences by extracting and analyzing user information and behavior information. Tags are a type of data that identifies resources or users in the current era of e-commerce networks. The user's tag data can be used to analyze the user's interest preferences to help e-commerce find products that specific users recommend for their purchase. The tag data is generally provided by the e-commerce platform or social platform for users to choose and use. The number and category are fixed and may not meet the user's situation. When the tags provided by the e-commerce platform do not have tags that fit the user's preferences, they are generally Custom tags. Users with the same preferences may have different custom tags for things of the same nature. The more users, the more messy the custom tags are, resulting in diverse and difficult to unify tags, which is not good for e-commerce or social platforms. Subsequent use of tag data to analyze user preferences.
发明内容Summary of the invention
本申请实施例提供了一种标签推荐方法、装置、计算机设备及计算机可读存储介质,旨在向用户推荐统一化的标签,以避免用户使用的标签过于分散的情况。The embodiments of the present application provide a label recommendation method, device, computer device, and computer-readable storage medium, which are intended to recommend unified labels to users to avoid the situation where the labels used by users are too scattered.
第一方面,本申请实施例提供了一种标签推荐方法,该方法包括:获取用户-商品评分矩阵,所述用户-商品评分矩阵包括所有用户以及所述所有用户对所有商品的评分,所述所有用户包括目标用户以及若干其他用户;根据所述用户-商品评分矩阵计算每一其他用户与所述目标用户的相似度,以得到所述目标 用户的相似用户群体;获取所述相似用户群体所使用过的第一标签;将所述第一标签进行归类以得到每个所述第一标签所属的类簇;分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况;根据每一类簇中的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签。In a first aspect, an embodiment of the present application provides a tag recommendation method. The method includes: obtaining a user-item rating matrix, where the user-item rating matrix includes all users and all users ’ratings of all products, and All users include the target user and several other users; calculate the similarity between each other user and the target user according to the user-item rating matrix to obtain a similar user group of the target user; Used first tags; categorizing the first tags to obtain the clusters to which each of the first tags belong; analyzing the first tags in each cluster to be used by users in the similar user group Use case; recommend the tag in the corresponding class cluster to the target user according to the situation where the first tag in each class cluster is used by the similar user group.
第二方面,本申请实施例还提供了一种标签推荐装置,所述标签推荐装置包括用于实现第一方面所述的标签推荐方法的单元。In a second aspect, an embodiment of the present application further provides a label recommendation device, where the label recommendation device includes a unit for implementing the label recommendation method described in the first aspect.
第三方面,本申请实施例还提供了一种计算机设备,包括存储器,以及与所述存储器相连的处理器;所述存储器用于存储实现标签推荐方法的计算机程序;所述处理器用于运行所述存储器中存储的计算机程序,以如上述第一方面所述的方法。In a third aspect, an embodiment of the present application further provides a computer device including a memory and a processor connected to the memory; the memory is used to store a computer program that implements a tag recommendation method; and the processor is configured to run an The computer program stored in the memory is described in the method described in the first aspect.
第四方面,本申请实施例提供了一种计算机可读存储介质,所述存储介质存储有一个或者一个以上计算机程序,所述一个或者一个以上计算机程序可被一个或者一个以上的处理器执行,以实现上述第一方面所述的方法。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, where the storage medium stores one or more computer programs, and the one or more computer programs can be executed by one or more processors. To implement the method described in the first aspect above.
本申请实施例提供的标签推荐方法、装置、计算机设备及计算机可读存储介质,基于目标用户的相似用户群体所使用的标签情况来向该目标用户推荐其未使用过的标签,不仅能利用相似用户群体的共同标签偏好推荐符合目标用户个性偏好的标签,还实现相似用户群体使用标签的统一化,避免用户使用的标签过于分散的情况,同时统一化的标签数据有利于后续分析用户的共同喜好,对用户群体进行其它个性化的营销推广策划。The tag recommendation method, device, computer equipment, and computer-readable storage medium provided in the embodiments of the present application recommend the unused tags to the target user based on the tag situation used by the similar user group of the target user, and not only can the similarity be used The common tag preferences of the user group recommend tags that match the personal preferences of the target user, and also realize the unification of the tags used by similar user groups, avoiding the situation where the tags used by users are too scattered, and the unified tag data is conducive to subsequent analysis of users' common preference , To carry out other personalized marketing plans for user groups.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the drawings used in the description of the embodiments are briefly introduced below. Obviously, the drawings in the following description are some embodiments of the present application. For ordinary technicians, other drawings can be obtained based on these drawings without paying creative work.
图1是本申请一实施例提供的一种标签推荐方法的流程示意图;FIG. 1 is a schematic flowchart of a label recommendation method according to an embodiment of the present application; FIG.
图2是本申请一实施例提供的一种标签推荐方法的子流程示意图;FIG. 2 is a schematic diagram of a sub-flow of a label recommendation method according to an embodiment of the present application; FIG.
图3是本申请另一实施例提供的一种标签推荐方法的子流程示意图;3 is a schematic diagram of a sub-flow of a label recommendation method according to another embodiment of the present application;
图4是本申请另一实施例提供的一种标签推荐方法的子流程示意图;4 is a schematic diagram of a sub-flow of a label recommendation method according to another embodiment of the present application;
图5是本申请一实施例提供的一种标签推荐装置的示意性框图;5 is a schematic block diagram of a label recommendation device according to an embodiment of the present application;
图6是本申请一实施例提供的一种标签推荐装置的子单元示意性框图;6 is a schematic block diagram of a subunit of a tag recommendation device according to an embodiment of the present application;
图7是本申请另一实施例提供的一种标签推荐装置的子单元示意性框图;7 is a schematic block diagram of a subunit of a label recommendation device according to another embodiment of the present application;
图8是本申请另一实施例提供的一种标签推荐装置的子单元示意性框图;8 is a schematic block diagram of a subunit of a tag recommendation device according to another embodiment of the present application;
图9是本申请实施例提供的一种计算机设备的结构示意性框图。FIG. 9 is a schematic block diagram of a structure of a computer device according to an embodiment of the present application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In the following, the technical solutions in the embodiments of the present application will be clearly and completely described with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.
应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It should be understood that when used in this specification and the appended claims, the terms "including" and "comprising" indicate the presence of described features, integers, steps, operations, elements and / or components, but do not exclude one or The presence or addition of a number of other features, wholes, steps, operations, elements, components, and / or sets thereof.
还应当理解,在本申请说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should also be understood that the term "and / or" used in the specification of the application and the appended claims refers to and includes any combination of one or more of the items listed in association and all possible combinations.
也应当理解,尽管术语第一、第二等可以在此用来描述各种元素,但这些元素不应该受限于这些术语,这些术语仅用来将这些元素彼此区分开。It should also be understood that, although the terms first, second, etc. may be used herein to describe various elements, these elements should not be limited to these terms, these terms are only used to distinguish these elements from each other.
图1为本申请实施例提供的一种标签推荐方法的流程示意图,该方法可应用于终端,终端可以是智能手机、平板电脑、笔记本电脑、台式电脑、等具有通信功能的电子设备。该方法包括步骤S101~S106。FIG. 1 is a schematic flowchart of a tag recommendation method according to an embodiment of the present application. The method can be applied to a terminal. The terminal may be a smart phone, a tablet computer, a notebook computer, a desktop computer, or other electronic devices with communication functions. The method includes steps S101 to S106.
S101、获取用户-商品评分矩阵,所述用户-商品评分矩阵包括所有用户以及所述所有用户对所有商品的评分,所述所有用户包括一目标用户以及若干其他用户。S101. Acquire a user-item rating matrix, where the user-item rating matrix includes all users and the all user ratings for all products, and all users include a target user and several other users.
目前各个商品消费平台都记录有用户购买商品的购买评分记录,通过网络爬虫技术可以爬取到这些购买评分记录,将这些购买评分记录进行统计可以得到所有用户对所有商品的评分矩阵,即用户-商品评分矩阵。所有用户指的是进行过商品购买评分的全部用户,所有商品指的是商品消费平台所包括的全部商品,上述步骤S101中的其他用户是相对于目标用户而言,二者的身份可以转换, 当需要向一个用户推荐商品时,该用户则作为目标用户,剩余的用户即为其他用户。At present, various commodity consumption platforms record the purchase rating records of users purchasing products. These purchase rating records can be crawled through web crawler technology. Statistics of these purchase rating records can be obtained by all users on all products, that is, the user- Product rating matrix. All users refer to all users who have scored for product purchases. All products refer to all products included in the product consumption platform. The other users in step S101 above are relative to the target user, and their identities can be switched. When a product needs to be recommended to one user, that user is the target user, and the remaining users are other users.
S102、根据所述用户-商品评分矩阵计算每一其他用户与所述目标用户的相似度,以得到所述目标用户的相似用户群体。S102. Calculate the similarity between each other user and the target user according to the user-product rating matrix to obtain a similar user group of the target user.
本申请实施例是基于用户的协同过滤思想向目标用户进行资源推荐,基于用户的协同过滤思想是通过使用统计技术寻找与目标用户具有相同偏好的邻居,即相似用户(群体),然后根据目标用户的邻居的喜好产生向目标用户的推荐。The embodiment of the present application is to recommend resources to target users based on the user's collaborative filtering idea. The user-based collaborative filtering idea is to use statistical techniques to find neighbors with the same preferences as the target user, that is, similar users (groups), and then according to the target user Of neighbors ’preferences generate recommendations to target users.
如图2所示,步骤S102包括步骤S1021-S1023。As shown in FIG. 2, step S102 includes steps S1021-S1023.
S1021、根据所述用户-商品评分矩阵计算所述目标用户对应的目标用户向量及所述若干其他用户对应的若干比较用户向量。S1021. Calculate a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users according to the user-product scoring matrix.
假设用户-商品评分矩阵如下表1所示:Assume that the user-item rating matrix is shown in Table 1 below:
表1:Table 1:
Figure PCTCN2018108915-appb-000001
Figure PCTCN2018108915-appb-000001
假设U1为目标用户,U2-Um其他用户,在一实施例中,另一用户对应的用户向量的向量维数等于商品的数量,即有n个商品,用户向量的维数为n,将用户进行过评分的商品对应的一个维度的向量值为1,未进行评分对应的向量值为0,则U1的目标用户向量为
Figure PCTCN2018108915-appb-000002
U2的待比较用户向量
Figure PCTCN2018108915-appb-000003
U3的待比较用户向量
Figure PCTCN2018108915-appb-000004
其中省略号省略的向量值均为0。
Suppose U1 is the target user and U2-Um other users. In one embodiment, the vector dimension of the user vector corresponding to the other user is equal to the number of products. The vector value of one dimension corresponding to the product that has been scored is 1, and the vector value corresponding to the product that has not been scored is 0. The target user vector of U1 is
Figure PCTCN2018108915-appb-000002
User vector to be compared for U2
Figure PCTCN2018108915-appb-000003
User vector to be compared for U3
Figure PCTCN2018108915-appb-000004
The vector values omitted by the ellipsis are all 0.
因用户未进行过评分对应的一维度的向量值为0,因此,为了用户向量的简洁性,可以根据两个待比较的用户进行评分过的所有商品对用户向量进行简化,例如,将U2与U1相比较,两人进行评分过的所用商品为I1、I2和I3共3个,因此可将用户向量简化为为3维,U1的目标用户向量为
Figure PCTCN2018108915-appb-000005
U2的待比较用户向量
Figure PCTCN2018108915-appb-000006
若将U3与U1相比较,两人进行评分过的所用商品为I1、I2、I3和I4共4个,因此可将用户向量简化为4维,则
Figure PCTCN2018108915-appb-000007
Figure PCTCN2018108915-appb-000008
Because the vector value of the one-dimensional vector corresponding to the user's rating has not been 0, for the simplicity of the user vector, the user vector can be simplified based on all the products rated by the two users to be compared. Compared with U1, the two products that have been rated by the two are I1, I2, and I3, so the user vector can be reduced to 3 dimensions, and the target user vector of U1 is
Figure PCTCN2018108915-appb-000005
User vector to be compared for U2
Figure PCTCN2018108915-appb-000006
If U3 and U1 are compared, the two products they have rated are I1, I2, I3, and I4, so the user vector can be reduced to 4 dimensions.
Figure PCTCN2018108915-appb-000007
Figure PCTCN2018108915-appb-000008
S1022、基于余弦相似性分别计算每一比较用户向量与所述目标用户向量的 相似度。S1022. Based on the cosine similarity, respectively calculate the similarity between each of the compared user vectors and the target user vector.
在本实施例中,基于余弦相似性寻找目标用户的相似用户,即根据以下公式计算两个用户之前的相似度:In this embodiment, the similar users of the target user are found based on the cosine similarity, that is, the similarity between the two users is calculated according to the following formula:
Figure PCTCN2018108915-appb-000009
Figure PCTCN2018108915-appb-000009
Figure PCTCN2018108915-appb-000010
Figure PCTCN2018108915-appb-000011
则目标用户U1与其他用户U2之间的相似性
Figure PCTCN2018108915-appb-000012
Figure PCTCN2018108915-appb-000013
Figure PCTCN2018108915-appb-000014
则目标用户U1与其他用户U2之间的相似性
Figure PCTCN2018108915-appb-000015
If
Figure PCTCN2018108915-appb-000010
Figure PCTCN2018108915-appb-000011
The similarity between the target user U1 and other users U2
Figure PCTCN2018108915-appb-000012
If
Figure PCTCN2018108915-appb-000013
Figure PCTCN2018108915-appb-000014
The similarity between the target user U1 and other users U2
Figure PCTCN2018108915-appb-000015
S1023、若所述相似度大于或等于阈值,将所述相似度对应的其他用户确认为相似用户,以得到所述相似用户群体。S1023: If the similarity is greater than or equal to a threshold, confirm other users corresponding to the similarity as similar users to obtain the similar user group.
设置一阈值,若两个用户的相似度大于或等于该阈值,说明这两个用户相似,即互为相似用户。在本申请中,该阈值为0.5-0.7,在一实施例中,该阈值选择为0.5、0.6或0.7。Set a threshold. If the similarity between two users is greater than or equal to the threshold, the two users are similar, that is, they are similar users to each other. In the present application, the threshold value is 0.5-0.7. In one embodiment, the threshold value is selected as 0.5, 0.6, or 0.7.
通过计算每一其他用户与目标用户的相似度即可得到目标用户的相似用户群体。A similar user group of the target user can be obtained by calculating the similarity between each other user and the target user.
S103、获取所述相似用户群体所使用过的第一标签。S103. Obtain a first tag used by the similar user group.
标签是用户用来对资源的分类,通过用户使用标签的频繁程度可以分析出用户对某一类资源的兴趣。在本申请实施例中,将相似用户群体使用过的任意标签称为第一标签。Tags are used by users to categorize resources. Users can analyze the user's interest in a certain type of resources by using the tags frequently. In the embodiment of the present application, an arbitrary label used by a similar user group is referred to as a first label.
S104、将所述第一标签进行归类以得到每个所述第一标签所属的类簇。S104. Classify the first tags to obtain a class cluster to which each of the first tags belongs.
将相似用户群体所使用过的全部标签进行归类,找得不同的第一标签分别属于哪一个类簇的标签,通过此方式可以分析相似用户群体可能对哪些标签类簇中的标签感兴趣。All the tags used by similar user groups are categorized, and the tags of which clusters the different first tags belong to are found. In this way, it is possible to analyze which tag clusters the similar user groups may be interested in.
需要说明的是,在进行第一标签的归类之前,或进行该标签推荐方法之前,需要将网络上曾经被使用过的标签进行聚类,得到不同的类簇,清楚不同类簇中包含有哪些标签,而后才可以进行步骤S104中的将第一标签进行归类,找到每一个第一标签所述的类簇;另外,第一标签所属的类簇中除包含第一标签外还包含其它标签,即未被相似用户群体使用过的标签。It should be noted that before the classification of the first label or the recommendation method of the label, the labels that have been used on the network need to be clustered to obtain different clusters, and it is clear that the different clusters contain Which tags can then be used to classify the first tag in step S104 and find each class cluster described by the first tag; In addition, the class cluster to which the first tag belongs includes other than the first tag. Tags are tags that have not been used by similar user groups.
对网络上的标签进行聚类包括以下过程:Clustering labels on the network includes the following processes:
(1)利用网络爬虫技术爬取网络上的原始标签数据;(1) Use web crawler technology to crawl the original tag data on the network;
(2)将所述原始标签数据分为频繁标签和非频繁标签;(2) dividing the original tag data into frequent tags and infrequent tags;
(3)将频繁标签进行聚类以得到不同的类簇以及每个类簇所包含的频繁标签。(3) Cluster frequent labels to obtain different clusters and the frequent labels contained in each cluster.
首先利用爬虫技术在网络上爬取不同用户使用过的标签数据,得到原始标签数据,爬取的网络可设置,主要为主流的网络,例如新浪微博、各大电商网络平台、百度等当前知名度高的网页。由于用户一开始可以使用任意文字或者词组作为商品的标签,因此,标签一般杂乱、范围广,为了标签的重要性和集中性,需要将原始标签划分为频繁标签和非频繁标签,剔除非频繁的标签,留取频繁标签;频繁标签指的是被多个用户(例如100个用户以上)使用过的标签,且被标注过在多个商品(例如100个商品以上)上;非频繁的标签不常被用户使用,因此剔除。将频繁标签进行聚类后可以得到不同的类簇以及每个类簇所包含的频繁标签。标签是一种文字资源,利用现有的语料库以及word2vec算法可训练得到任意标签的词向量,得到频繁标签的词向量后,利用DBScan模型将频繁标签的词向量进行聚类得到标签的类簇。First, use the crawler technology to crawl the tag data used by different users on the network to obtain the original tag data. The crawled network can be set, mainly for mainstream networks, such as Sina Weibo, major e-commerce network platforms, Baidu, etc. Well-known web pages. Since users can initially use any text or phrase as the label of the product, the label is generally messy and wide-ranging. For the importance and concentration of the label, the original label needs to be divided into frequent labels and infrequent labels. Frequent tags refer to tags that have been used by multiple users (for example, more than 100 users) and have been marked on multiple products (for example, more than 100 products); infrequent tags are not It is often used by users, so it is eliminated. After clustering frequent labels, different clusters and the frequent labels contained in each cluster can be obtained. Labels are a kind of text resources. Using the existing corpus and word2vec algorithm, you can train word vectors with arbitrary labels. After you get word vectors with frequent labels, you use the DBScan model to cluster word vectors with frequent labels to get the clusters of labels.
得到标签的类簇之后,当用户的行为产生触发标签推荐的触发事件,则利用本申请的标签推荐方法对用户行进标签推荐。例如,医用在购物之后欲对购买的商品进行评价,评价过程需要用户对商品标记标签,则用户的评价操作可视为触发事件。After the class of tags is obtained, when a user's behavior generates a trigger event that triggers tag recommendation, the tag recommendation method of the present application is used to recommend tags to the user. For example, if the medical care wants to evaluate the purchased product after shopping, the evaluation process requires the user to tag the product, and the user's evaluation operation can be regarded as a trigger event.
S105、分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况。S105. Analyze a situation in which the first tag in each cluster is used by users in the similar user group.
S106、根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签。S106. Recommend the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group.
将每一个第一标签进行归类之后,分析每一类簇的第一标签被相似用户群体使用的情况,由于相似用户群体中每个用户对相同的资源有相同的偏好,因此可根据每一类簇的第一类标签被相似用户使用的整体情况来预测相似用户群体对哪一类簇的标签更感兴趣,从而预测目标用户对哪一类簇的标签更感兴趣,向目标用户推荐其更为感兴趣的标签。After classifying each first tag, analyze the use of the first tag of each type of cluster by similar user groups. Since each user in the similar user group has the same preference for the same resource, it can be based on each The overall situation where the first type of tags of a class cluster is used by similar users to predict which type of cluster tags are more interested by similar user groups, thereby predicting which type of cluster tags the target user is more interested in, and recommending them to the target user More interested tags.
在一实施例中,步骤S105具体包括:分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数。In an embodiment, step S105 specifically includes: separately calculating a total frequency of the first tag in each type of cluster used by the similar user group.
某一类簇的第一标签被相似用户群体所使用的情况可通过其所包含的全部第一标签被相似用户群体所使用的总频数来表示;一类簇中所包含的全部第一标签被相似用户群体所使用的总频数根据该类簇中每个第一标签被该相似用户群体使用的情况来计算。The situation where the first tags of a certain type of cluster are used by similar user groups can be represented by the total frequency of all the first tags contained in them by the similar user group; all the first tags contained in a type of cluster are The total frequency used by the similar user group is calculated according to the situation where each first tag in the cluster is used by the similar user group.
进一步地,如图3所示,步骤S105包括步骤S1051-S1052。Further, as shown in FIG. 3, step S105 includes steps S1051-S1052.
S1051、根据每个相似用户对应的相似度及每个相似用户使用一第一标签的次数计算该一第一标签被所述相似用户群体所使用的频数。S1051. Calculate the frequency with which the first tag is used by the similar user group according to the similarity corresponding to each similar user and the number of times each similar user uses a first tag.
S1052、计算同一类簇中所有第一标签被所述相似用户群体所使用的频数之和,将所述频数之和确认为对应类簇的第一标签被所述相似用户群体所使用的总频数。S1052. Calculate the sum of the frequencies of all the first tags in the same cluster used by the similar user group, and confirm the sum of the frequencies as the total frequency of the first tags of the corresponding cluster used by the similar user group. .
假设该类簇中具有K个第一标签,第j个第一标签被第i个相似用户使用的频数根据公式f ij=si*Q ij计算,其中f ij表示第i个相似用户使用该第j个第一标签的频数,si表示第i个相似用户与目标用户的相似度,Q ij表示第i个相似用户使用该第j个第一标签的使用次数。利用相似度作为标签使用频繁度的加权值,用户之间越相似,用户间的偏好越趋近相同,因此相似度越高,权重越高,则对应的相似用户使用标签的情况的参考度更重要,这对标签推荐的个性化更强。 Assume that there are K first tags in this type of cluster, and the frequency of the j-th first tag used by the i-th similar user is calculated according to the formula f ij = si * Q ij , where f ij indicates that the i-th similar user uses the first The frequency of the j first tags, si represents the similarity between the i-th similar user and the target user, and Q ij represents the number of times the i-th similar user uses the j-th first tag. Using similarity as a weighted value of how frequently tags are used, the more similar between users, the closer the preferences between users are, so the higher the similarity and the higher the weight, the more reference the corresponding similar users use the tags Importantly, this is more personalized for tag recommendations.
假设该相似用户群体中具有M个相似用户,则该第j个第一标签被所述相似用户群体所使用的频数(用F j表示)等于M个相似用户使用该第j个第一标签的频数之和,即 Assuming that there are M similar users in the similar user group, the frequency (represented by F j ) that the j-th first label is used by the similar user group is equal to the number of M similar users using the j-th first label. Sum of frequencies, ie
Figure PCTCN2018108915-appb-000016
Figure PCTCN2018108915-appb-000016
同一类簇中所有第一标签被所述相似用户群体所使用的频数之和,即步骤S1052中的总频数,其计算公式如下:The sum of the frequencies of all the first tags in the same cluster used by the similar user group, that is, the total frequencies in step S1052, the calculation formula is as follows:
Figure PCTCN2018108915-appb-000017
Figure PCTCN2018108915-appb-000017
在一实施例中,步骤S106具体包括:根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签。In an embodiment, step S106 specifically includes: recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster.
总频数越大,说明对应类簇的第一标签被使用得越频繁,该类簇中的标签被该相似用户群体以及目标用户使用的概率越高,将该类簇中的标签推荐给目标用户,避免同一用户群体自定义的标签导致标签过于分散,从而实现相似用户群体使用标签的统一化。The larger the total frequency, the more frequently the first label of the corresponding cluster is used, and the higher the probability that the labels in the cluster are used by the similar user group and the target user, the labels in the cluster are recommended to the target user , To avoid the tags that are customized by the same user group leading to too scattered tags, thereby realizing the unified use of tags by similar user groups.
进一步地,如图4所示,步骤S106包括步骤S1061-S1064。Further, as shown in FIG. 4, step S106 includes steps S1061-S1064.
S1061、获取总频数排位在前的预设数量的类簇所包含的所有标签。S1061. Obtain all tags included in a preset number of clusters with a total frequency ranking.
依据总频数从高到低的顺序将类簇进行排序,获取TopN个类簇,即总频数较高的前预设个(N个)类簇,N为1-4。在一实施例中,N取值为2或3。Sort the clusters according to the order of the total frequency from high to low to obtain TopN clusters, that is, the first preset (N) clusters with a higher total frequency, where N is 1-4. In one embodiment, the value of N is 2 or 3.
TopN个类簇作为相似用户群体较为频繁使用的标签,也代表了该目标用户较为频繁使用的标签。The TopN clusters are tags that are frequently used by similar user groups, and also represent the tags that are frequently used by the target user.
S1062、获取所述目标用户使用过的标签。S1062. Obtain a tag used by the target user.
S1063、依据所述目标用户使用过的标签,在所述所有标签中获取所述目标用户未使用过的标签。S1063. According to the tags used by the target user, obtain, from all the tags, tags that the target user has not used.
S1064、将所获取的未使用过的标签推荐给所述目标用户。S1064. Recommend the obtained unused tags to the target user.
获取该TopN个类簇中所包含的未被目标用户使用过的标签全部推荐,形成不同类簇的推荐标签列表反馈给目标用户,进而用户可以在不同的推荐标签列表中选择对应类簇的标签。Get all recommended tags in the TopN clusters that have not been used by the target user, form a list of recommended tags for different clusters and feed them back to the target user, and then the user can select the tags of the corresponding cluster in different recommended tag lists .
本申请实施例提供的标签推荐方法,基于目标用户的相似用户群体所使用的标签情况来向该目标用户推荐其未使用过的标签,不仅能利用相似用户群体的共同标签偏好推荐符合目标用户个性偏好的标签,还实现相似用户群体使用标签的统一化,避免用户使用的标签过于分散的情况,同时统一化的标签数据有利于后续分析用户的共同喜好,对用户群体进行其它个性化的营销推广策划。The tag recommendation method provided in the embodiment of the present application recommends an unused tag to the target user based on the tag situation used by the similar user group of the target user, and not only can use the common tag preference of the similar user group to recommend matching the personality of the target user The preferred tags also realize the unification of the tags used by similar user groups, avoiding the situation where the tags used by users are too scattered, and the unified tag data is conducive to subsequent analysis of the user's common preferences, and other personalized marketing promotion for the user group plan.
图5为本申请实施例提供的一种标签推荐装置100的示意性框图。该标签推荐装置100包括用于执行上述标签推荐方法的单元,该装置可以被配置于台式电脑、平板电脑、手提电脑、等终端中。该标签推荐装置100包括第一获取单元101、第一计算单元102、第二获取单元103、归类单元104、分析单元105以及推荐单元106。FIG. 5 is a schematic block diagram of a label recommendation device 100 according to an embodiment of the present application. The tag recommendation device 100 includes a unit for performing the above-mentioned tag recommendation method, and the device may be configured in a desktop computer, a tablet computer, a laptop computer, and other terminals. The tag recommendation device 100 includes a first acquisition unit 101, a first calculation unit 102, a second acquisition unit 103, a classification unit 104, an analysis unit 105, and a recommendation unit 106.
第一获取单元101用于获取用户-商品评分矩阵,所述用户-商品评分矩阵包括所有用户以及所述所有用户对所有商品的评分,所述所有用户包括一目标用户以及若干其他用户。The first obtaining unit 101 is configured to obtain a user-item scoring matrix, where the user-item scoring matrix includes all users and the scoring of all products by all users, and all users include a target user and several other users.
第一计算单元102用于根据所述用户-商品评分矩阵计算每一其他用户与目标用户的相似度,以得到所述目标用户的相似用户群体。The first calculation unit 102 is configured to calculate the similarity between each other user and the target user according to the user-item scoring matrix to obtain a similar user group of the target user.
第二获取单元103用于获取所述相似用户群体所使用过的第一标签。The second obtaining unit 103 is configured to obtain a first tag used by the similar user group.
归类单元104用于将所述第一标签进行归类以得到每个所述第一标签所属的类簇。The classifying unit 104 is configured to classify the first tags to obtain a class cluster to which each of the first tags belongs.
分析单元105,用于分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况。The analysis unit 105 is configured to analyze a situation in which a first tag in each cluster is used by a user in the similar user group.
推荐单元106用于根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签。The recommendation unit 106 is configured to recommend the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group.
在一实施例中,如图6所示,所述第一计算单元102包括以下子单元:In an embodiment, as shown in FIG. 6, the first calculation unit 102 includes the following subunits:
第一计算子单元1021,用于根据所述用户-商品评分矩阵计算所述目标用户对应的目标用户向量及所述若干其他用户对应的若干比较用户向量;A first calculation subunit 1021, configured to calculate a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users according to the user-product rating matrix;
第二计算子单元1022,用于基于余弦相似性分别计算每一比较用户向量与所述目标用户向量的相似度;以及A second calculation subunit 1022, configured to separately calculate the similarity between each comparison user vector and the target user vector based on the cosine similarity; and
确认子单元1023,用于若所述相似度大于或等于阈值,将所述相似度对应的其他用户确认为相似用户,以得到所述相似用户群体。The confirming subunit 1023 is configured to confirm other users corresponding to the similarity as similar users if the similarity is greater than or equal to a threshold, so as to obtain the similar user group.
在一实施例中,所述分析单元105具体用于:分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数。In an embodiment, the analysis unit 105 is specifically configured to separately calculate a total frequency of the first tag in each type of cluster used by the similar user group.
所述推荐单元106具体用于:根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签。The recommendation unit 106 is specifically configured to recommend the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster.
在一实施例中,如图7所示,所述分析单元105包括:In an embodiment, as shown in FIG. 7, the analysis unit 105 includes:
第三计算子单元1051,用于根据每个相似用户对应的相似度及每个相似用户使用一第一标签的次数计算该一第一标签被所述相似用户群体所使用的频数;以及A third calculation subunit 1051, configured to calculate, according to the similarity corresponding to each similar user and the number of times each similar user uses a first tag, the frequency with which the first tag is used by the similar user group; and
第四计算子单元1052,用于计算同一类簇中所有第一标签被所述相似用户群体所使用的频数之和,将所述频数之和确认为对应类簇的第一标签被所述相似用户群体所使用的总频数。A fourth calculation subunit 1052 is configured to calculate a sum of frequencies of all the first tags in the same cluster used by the similar user group, and confirm the sum of the frequencies as the first tags of the corresponding cluster are similar to the The total frequency used by the user community.
在一实施例中,如图8所示,所述推荐单元106包括:In an embodiment, as shown in FIG. 8, the recommendation unit 106 includes:
第一获取子单元1061,用于获取总频数排位在前的预设数量的类簇所包含的所有标签;A first obtaining subunit 1061, configured to obtain all tags included in a preset number of clusters with a total frequency ranking first;
第二获取子单元1062,用于获取所述目标用户使用过的标签;A second acquisition subunit 1062, configured to acquire a tag used by the target user;
第三获取子单元1063,用于依据所述目标用户使用过的标签,在所述所有标签中获取所述目标用户未使用过的标签;以及A third obtaining subunit 1063, configured to obtain, from all the tags, tags that have not been used by the target user according to the tags that have been used by the target user; and
推荐子单元1064,用于将所获取的未使用过的标签推荐给所述目标用户。The recommendation subunit 1064 is configured to recommend the obtained unused tags to the target user.
上述标签推荐装置100与前述标签推荐方法对应,本实施例中对标签推荐 装置100未详尽之处可参考前述方法实施例,此处不做赘述。The above-mentioned label recommendation device 100 corresponds to the foregoing label recommendation method. For the details of the label recommendation device 100 in this embodiment, reference may be made to the foregoing method embodiment, and details are not described herein.
上述标签推荐装置100可以实现为一种计算机程序的形式,计算机程序可以在如图9所示的计算机设备上运行。The above-mentioned tag recommendation device 100 may be implemented in the form of a computer program, and the computer program may be run on a computer device as shown in FIG. 9.
图9为本申请实施例提供的一种计算机设备200的结构示意性框图。该计算机设备200,该计算机设备200可以是终端,也可以是服务器,其中,终端可以是智能手机、平板电脑、笔记本电脑、台式电脑、个人数字助理和穿戴式设备等具有通信功能的电子设备。服务器可以是独立的服务器,也可以是多个服务器组成的服务器集群。FIG. 9 is a schematic block diagram of a structure of a computer device 200 according to an embodiment of the present application. The computer device 200 may be a terminal or a server. The terminal may be an electronic device with a communication function, such as a smart phone, a tablet computer, a notebook computer, a desktop computer, a personal digital assistant, and a wearable device. The server can be an independent server or a server cluster consisting of multiple servers.
该计算机设备200,包括通过系统总线201连接的处理器202、存储器和网络接口205,其中,存储器可以包括非易失性存储介质203和内存储器204。The computer device 200 includes a processor 202, a memory, and a network interface 205 connected through a system bus 201. The memory may include a non-volatile storage medium 203 and an internal memory 204.
该计算机设备200的非易失性存储介质203可存储操作系统2031和计算机程序2032,该计算机程序2032被执行时,可使得处理器202执行一种标签推荐方法。该内存储器204为非易失性存储介质203中的计算机程序2032的运行提供环境。该计算机设备200的处理器202用于提供计算和控制能力,支撑整个计算机设备200的运行。计算机设备200的网络接口205用于进行网络通信,如发送分配的任务、接收数据等。The non-volatile storage medium 203 of the computer device 200 may store an operating system 2031 and a computer program 2032. When the computer program 2032 is executed, the processor 202 may execute a tag recommendation method. The internal memory 204 provides an environment for running the computer program 2032 in the non-volatile storage medium 203. The processor 202 of the computer device 200 is used to provide computing and control capabilities to support the operation of the entire computer device 200. The network interface 205 of the computer device 200 is used for network communication, such as sending assigned tasks and receiving data.
本领域技术人员可以理解,图9中示出的计算机设备的实施例并不构成对计算机设备具体构成的限定,在其他实施例中,计算机设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。例如,在一些实施例中,计算机设备可以仅包括存储器及处理器,在这样的实施例中,存储器及处理器的结构及功能与图9所示实施例一致,在此不再赘述。Those skilled in the art can understand that the embodiment of the computer device shown in FIG. 9 does not constitute a limitation on the specific configuration of the computer device. In other embodiments, the computer device may include more or fewer components than shown in the figure. Either some parts are combined or different parts are arranged. For example, in some embodiments, the computer device may include only a memory and a processor. In such an embodiment, the structure and function of the memory and the processor are consistent with the embodiment shown in FIG. 9, and details are not described herein again.
处理器202运行非易失性存储介质203中的计算机程序2032时,处理器202执行以下步骤:获取用户-商品评分矩阵,所述用户-商品评分矩阵包括所有用户以及所述所有用户对所有商品的评分,所述所有用户包括一目标用户以及若干其他用户;根据所述用户-商品评分矩阵计算每一其他用户与目标用户的相似度,以得到所述目标用户的相似用户群体;获取所述相似用户群体所使用过的第一标签;将所述第一标签进行归类以得到每个所述第一标签所属的类簇;分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况;根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签。When the processor 202 runs the computer program 2032 in the non-volatile storage medium 203, the processor 202 performs the following steps: obtaining a user-item rating matrix, where the user-item rating matrix includes all users and the all users on all products All users include a target user and several other users; calculating the similarity between each other user and the target user according to the user-item rating matrix to obtain a similar user group of the target user; obtaining the similar user group of the target user; First tags used by similar user groups; categorizing the first tags to obtain the clusters to which each first tag belongs; analyzing the first tags in each class cluster by the similar user groups The situation used by the users in the cluster; recommend the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group.
在一实施例中,所述处理器202在执行所述根据所述用户-商品评分矩阵计 算每一其他用户与目标用户的相似度,以得到所述目标用户的相似用户群体的步骤时,具体执行以下步骤:根据所述用户-商品评分矩阵计算所述目标用户对应的目标用户向量及所述若干其他用户对应的若干比较用户向量;基于余弦相似性分别计算每一比较用户向量与所述目标用户向量的相似度;若所述相似度大于或等于阈值,将所述相似度对应的其他用户确认为相似用户,以得到所述相似用户群体。In an embodiment, when the processor 202 executes the step of calculating the similarity between each other user and the target user according to the user-item scoring matrix to obtain a similar user group of the target user, the processor 202 specifically Perform the following steps: calculate a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users according to the user-item scoring matrix; and calculate each comparison user vector and the target separately based on cosine similarity Similarity of user vectors; if the similarity is greater than or equal to a threshold, other users corresponding to the similarity are confirmed as similar users to obtain the similar user group.
在一实施例中,所述处理器202在执行所述分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况的步骤时,具体执行以下步骤:分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数。In an embodiment, when the processor 202 executes the step of analyzing the situation in which the first tag in each type of cluster is used by users in the similar user group, the processor 202 specifically performs the following steps: The total number of first tags in a class of clusters used by the similar user population.
在一实施例中,所述处理器202在执行所述根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签的步骤时,具体执行以下步骤:根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签。In an embodiment, when the processor 202 executes the step of recommending the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group, Specifically, the following steps are performed: recommending tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster.
在一实施例中,所述处理器202在执行所述分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数的步骤时,具体执行以下步骤:根据每个相似用户对应的相似度及每个相似用户使用一第一标签的次数计算该一第一标签被所述相似用户群体所使用的频数;计算同一类簇中所有第一标签被所述相似用户群体所使用的频数之和,将所述和确认为对应类簇的第一标签被所述相似用户群体所使用的总频数。In an embodiment, when the processor 202 executes the step of separately calculating the total frequency of the first tag in each type of cluster used by the similar user group, the processor 202 specifically performs the following steps: according to each similar The user's corresponding similarity and the number of times each similar user uses a first tag to calculate the frequency with which the first tag is used by the similar user group; calculate all first tags in the same type of cluster by the similar user group The sum of the used frequencies confirms the sum as the total frequency of the first tag of the corresponding cluster used by the similar user group.
在一实施例中,所述处理器202在执行所述根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签的步骤时,具体执行以下步骤:获取总频数排位在前的预设数量的类簇所包含的所有标签;获取所述目标用户使用过的标签;依据所述目标用户使用过的标签,在所述所有标签中获取所述目标用户未使用过的标签;将所获取的未使用过的标签推荐给所述目标用户。In an embodiment, when the processor 202 executes the step of recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster, the processor 202 specifically performs the following steps: All tags included in the preset number of clusters with the total frequency ranked; Get tags used by the target user; Get the target user among all tags according to the tags used by the target user Unused tags; recommend the obtained unused tags to the target user.
应当理解,在本申请实施例中,处理器202可以是中央处理单元(Central Processing Unit,CPU),该处理器202还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that, in the embodiment of the present application, the processor 202 may be a central processing unit (CPU), and the processor 202 may also be another general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), Application-specific integrated circuits (Application Specific Integrated Circuits, ASICs), ready-made programmable gate arrays (Field-Programmable Gate Arrays, FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. The general-purpose processor may be a microprocessor, or the processor may be any conventional processor.
本领域普通技术人员可以理解的是实现上述实施例的方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成。该计算机程序包括程序指令,计算机程序可存储于一存储介质中,该存储介质为计算机可读存储介质。该程序指令被该计算机系统中的至少一个处理器执行,以实现上述方法的实施例的流程步骤。A person of ordinary skill in the art can understand that all or part of the processes in the method of the foregoing embodiment can be implemented by using a computer program to instruct related hardware. The computer program includes program instructions, and the computer program may be stored in a storage medium, and the storage medium is a computer-readable storage medium. The program instructions are executed by at least one processor in the computer system to implement the process steps of the embodiment of the method.
因此,本申请还提供一种计算机可读存储介质,所述计算机可读存储介质存储有一个或者一个以上计算机程序,所述一个或者一个以上计算机程序可被一个或者一个以上的处理器执行,可实现以下步骤:获取用户-商品评分矩阵,所述用户-商品评分矩阵包括所有用户以及所述所有用户对所有商品的评分,所述所有用户包括目标用户以及若干其他用户;根据所述用户-商品评分矩阵计算每一其他用户与目标用户的相似度,以得到所述目标用户的相似用户群体;获取所述相似用户群体所使用过的第一标签;将所述第一标签进行归类以得到每个所述第一标签所属的类簇;分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况;根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签。Therefore, the present application also provides a computer-readable storage medium, where the computer-readable storage medium stores one or more computer programs, and the one or more computer programs can be executed by one or more processors. The following steps are implemented: obtaining a user-item rating matrix, the user-item rating matrix including all users and the all user ratings for all products, the all users including target users and several other users; according to the user-item The scoring matrix calculates the similarity between each other user and the target user to obtain a similar user group of the target user; obtains a first tag used by the similar user group; classifies the first tag to obtain The class cluster to which each of the first tags belongs; analyzing the situation in which the first tag in each cluster is used by users in the similar user group; according to the first tag of each class cluster, the similar user is used The situation used by the group recommends the tags in the corresponding cluster to the target user.
在一实施例中,在实现所述根据所述用户-商品评分矩阵计算每一其他用户与目标用户的相似度,以得到所述目标用户的相似用户群体的步骤时,具体实现以下步骤:根据所述用户-商品评分矩阵计算所述目标用户对应的目标用户向量及所述若干其他用户对应的若干比较用户向量;基于余弦相似性分别计算每一比较用户向量与所述目标用户向量的相似度;若所述相似度大于或等于阈值,将所述相似度对应的其他用户确认为相似用户,以得到所述相似用户群体。In an embodiment, when implementing the step of calculating the similarity between each other user and the target user according to the user-item scoring matrix to obtain a similar user group of the target user, the following steps are specifically implemented: The user-item scoring matrix calculates a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users; and calculates the similarity between each comparison user vector and the target user vector based on the cosine similarity. If the similarity is greater than or equal to a threshold, confirming other users corresponding to the similarity as similar users to obtain the similar user group.
在一实施例中,在实现所述分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况的步骤时,具体实现以下步骤:分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数。In an embodiment, when implementing the step of analyzing the situation where the first tag in each type of cluster is used by users in the similar user group, the following steps are specifically implemented: calculating the The total number of first tags used by the similar user population.
在一实施例中,在实现所述根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签的步骤时,具体实现以下步骤:根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签。In an embodiment, when implementing the step of recommending the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group, the following steps are specifically implemented: Recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster.
在一实施例中,在实现所述分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数的步骤时,具体实现以下步骤:根据每个相似用户对应的相似度及每个相似用户使用一第一标签的次数计算该一第一标签被所述相似用户群体所使用的频数;计算同一类簇中所有第一标签被所述相似用户群体所 使用的频数之和,将所述和确认为对应类簇的第一标签被所述相似用户群体所使用的总频数。In an embodiment, when the step of separately calculating the total frequency of the first tag in each type of cluster used by the similar user group is implemented, the following steps are specifically implemented: according to the similarity corresponding to each similar user And the number of times each first user uses a first tag to calculate the frequency with which the first tag is used by the similar user group; calculate the sum of the frequencies that all first tags in the same cluster are used by the similar user group , The total frequency of the first tag whose corresponding sum is confirmed as the corresponding cluster is used by the similar user group.
在一实施例中,在实现所述根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签的步骤时,具体实现以下步骤:获取总频数排位在前的预设数量的类簇所包含的所有标签;获取所述目标用户使用过的标签;依据所述目标用户使用过的标签,在所述所有标签中获取所述目标用户未使用过的标签;将所获取的未使用过的标签推荐给所述目标用户。In an embodiment, when implementing the step of recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first label of each type of cluster, the following steps are specifically implemented: obtaining the total frequency ranking in All tags included in the previous preset number of clusters; obtaining tags used by the target user; and obtaining tags not used by the target user among all tags according to the tags used by the target user ; Recommending the obtained unused tags to the target user.
所述计算机可读存储介质,可以是非易失存储介质,是前述设备的内部存储单元,例如设备的硬盘或内存,所述存储介质也可以是所述设备的外部存储设备,例如所述设备上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)、U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、磁碟或者光盘等各种可以存储程序代码的计算机可读存储介质。进一步地,所述计算机可读存储介质还可以既包括所述设备的内部存储单元也包括外部存储设备。The computer-readable storage medium may be a non-volatile storage medium, which is an internal storage unit of the foregoing device, such as a hard disk or a memory of the device, and the storage medium may also be an external storage device of the device, such as on the device. Equipped with plug-in hard disk, Smart Memory Card (SMC), Secure Digital (SD) card, Flash Card, U disk, mobile hard disk, Read-Only Memory, A variety of computer-readable storage media, such as ROM), magnetic disks, or optical disks, that can store program codes. Further, the computer-readable storage medium may also include both an internal storage unit of the device and an external storage device.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above is only a specific implementation of this application, but the scope of protection of this application is not limited to this. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed in this application. Modifications or replacements, and these modifications or replacements should be covered by the protection scope of this application. Therefore, the protection scope of this application shall be subject to the protection scope of the claims.

Claims (20)

  1. 一种标签推荐方法,包括:A label recommendation method includes:
    获取用户-商品评分矩阵,所述用户-商品评分矩阵包括所有用户以及所述所有用户对所有商品的评分,所述所有用户包括一目标用户以及若干其他用户;Obtaining a user-item rating matrix, where the user-item rating matrix includes all users and the all user ratings for all products, and all users include a target user and several other users;
    根据所述用户-商品评分矩阵计算每一其他用户与所述目标用户的相似度,以得到所述目标用户的相似用户群体;Calculating the similarity between each other user and the target user according to the user-product rating matrix to obtain a similar user group of the target user;
    获取所述相似用户群体所使用过的第一标签;Acquiring a first tag used by the similar user group;
    将所述第一标签进行归类以得到每个所述第一标签所属的类簇;Classify the first tags to obtain a class cluster to which each of the first tags belongs;
    分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况;Analyze the situation where the first label in each cluster is used by users in the similar user group;
    根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签。Recommend the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group.
  2. 根据权利要求1所述的标签推荐方法,其中,所述根据所述用户-商品评分矩阵计算每一其他用户与所述目标用户的相似度,以得到所述目标用户的相似用户群体,包括:The tag recommendation method according to claim 1, wherein the calculating the similarity between each other user and the target user according to the user-item scoring matrix to obtain a similar user group of the target user, comprising:
    根据所述用户-商品评分矩阵计算所述目标用户对应的目标用户向量及所述若干其他用户对应的若干比较用户向量;Calculating a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users according to the user-item scoring matrix;
    基于余弦相似性分别计算每一比较用户向量与所述目标用户向量的相似度;Separately calculating the similarity between each comparison user vector and the target user vector based on the cosine similarity;
    若所述相似度大于或等于阈值,将所述相似度对应的其他用户确认为相似用户,以得到所述相似用户群体。If the similarity is greater than or equal to a threshold, other users corresponding to the similarity are confirmed as similar users to obtain the similar user group.
  3. 根据权利要求1所述的标签推荐方法,其中,所述分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况,包括:The tag recommendation method according to claim 1, wherein the analyzing a situation in which a first tag in each cluster is used by a user in the similar user group comprises:
    分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数;Calculate the total frequency of the first tag in each type of cluster used by the similar user group;
    所述根据不同类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签,包括:The recommending the tags in the corresponding cluster to the target user according to the situation that the first tags of different clusters are used by the similar user group includes:
    根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签。Recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster.
  4. 根据权利要求3所述的标签推荐方法,其中,所述分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数,包括:The tag recommendation method according to claim 3, wherein the calculating the total frequency of the first tag in each type of cluster used by the similar user group comprises:
    根据每个相似用户对应的相似度及每个相似用户使用一第一标签的次数计算该一第一标签被所述相似用户群体所使用的频数;Calculating the frequency with which the first tag is used by the similar user group according to the similarity corresponding to each similar user and the number of times each similar user uses a first tag;
    计算同一类簇中所有第一标签被所述相似用户群体所使用的频数之和,将 所述频数之和确认为对应类簇的第一标签被所述相似用户群体所使用的总频数。Calculate the sum of the frequencies of all the first tags in the same cluster used by the similar user group, and confirm the sum of the frequencies as the total frequency of the first tags of the corresponding cluster used by the similar user group.
  5. 根据权利要求3所述的标签推荐方法,其中,所述根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签,包括:The tag recommendation method according to claim 3, wherein the recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster includes:
    获取总频数排位在前的预设数量的类簇所包含的所有标签;Obtain all tags included in a preset number of clusters with a total frequency rank;
    获取所述目标用户使用过的标签;Acquiring tags used by the target user;
    依据所述目标用户使用过的标签,在所述所有标签中获取所述目标用户未使用过的标签;Obtaining, according to the tags used by the target user, among all tags, tags that have not been used by the target user;
    将所获取的未使用过的标签推荐给所述目标用户。The obtained unused tags are recommended to the target user.
  6. 一种标签推荐装置,包括:A label recommendation device includes:
    第一获取单元,用于获取用户-商品评分矩阵,所述用户-商品评分矩阵包括所有用户以及所述所有用户对所有商品的评分,所述所有用户包括一目标用户以及若干其他用户;A first obtaining unit, configured to obtain a user-item scoring matrix, where the user-item scoring matrix includes all users and the scoring of all products by all users, and all users include a target user and several other users;
    第一计算单元,用于根据所述用户-商品评分矩阵计算每一其他用户与所述目标用户的相似度,以得到所述目标用户的相似用户群体;A first calculation unit, configured to calculate the similarity between each other user and the target user according to the user-item scoring matrix to obtain a similar user group of the target user;
    第二获取单元,用于获取所述相似用户群体所使用过的第一标签;A second obtaining unit, configured to obtain a first tag used by the similar user group;
    归类单元,用于将所述第一标签进行归类以得到每个所述第一标签所属的类簇;A classifying unit, configured to classify the first tags to obtain a class cluster to which each of the first tags belongs;
    分析单元,用于分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况;An analysis unit, configured to analyze a situation in which a first tag in each cluster is used by a user in the similar user group;
    推荐单元,用于根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签。A recommendation unit is configured to recommend the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group.
  7. 根据权利要求6所述的标签推荐装置,其中,所述第一计算单元,包括:The tag recommendation device according to claim 6, wherein the first calculation unit comprises:
    第一计算子单元,用于根据所述用户-商品评分矩阵计算所述目标用户对应的目标用户向量及所述若干其他用户对应的若干比较用户向量;A first calculation subunit, configured to calculate a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users according to the user-product rating matrix;
    第二计算子单元,用于基于余弦相似性分别计算每一比较用户向量与所述目标用户向量的相似度;A second calculation subunit, configured to separately calculate the similarity between each comparison user vector and the target user vector based on the cosine similarity;
    确认子单元,用于若所述相似度大于或等于阈值,将所述相似度对应的其他用户确认为相似用户,以得到所述相似用户群体。A confirmation subunit is configured to, if the similarity is greater than or equal to a threshold, confirm other users corresponding to the similarity as similar users to obtain the similar user group.
  8. 根据权利要求6所述的标签推荐装置,其中,所述分析单元具体用于:The tag recommendation device according to claim 6, wherein the analysis unit is specifically configured to:
    分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数;Calculate the total frequency of the first tag in each type of cluster used by the similar user group;
    所述推荐单元具体用于:根据不同类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签。The recommendation unit is specifically configured to recommend the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tags of the different clusters.
  9. 根据权利要求8所述的标签推荐装置,其中,所述分析单元包括:The tag recommendation device according to claim 8, wherein the analysis unit comprises:
    第三计算子单元,用于根据每个相似用户对应的相似度及每个相似用户使用一第一标签的次数计算该一第一标签被所述相似用户群体所使用的频数;A third calculation subunit, configured to calculate, according to the similarity corresponding to each similar user and the number of times each similar user uses a first tag, the frequency with which the first tag is used by the similar user group;
    第四计算子单元,用于计算同一类簇中所有第一标签被所述相似用户群体所使用的频数之和,将所述频数之和确认为对应类簇的第一标签被所述相似用户群体所使用的总频数。A fourth calculation subunit, configured to calculate a sum of frequencies of all first tags in the same cluster used by the similar user group, and confirm the sum of frequencies as the first tags of the corresponding cluster are used by the similar user The total frequency used by the population.
  10. 根据权利要求8所述的标签推荐装置,其中,所述推荐单元包括:The tag recommendation device according to claim 8, wherein the recommendation unit comprises:
    第一获取子单元,用于获取总频数排位在前的预设数量的类簇所包含的所有标签;A first acquisition subunit, configured to acquire all tags included in a preset number of clusters with a total frequency ranking first;
    第二获取子单元,用于获取所述目标用户使用过的标签;A second acquisition subunit, configured to acquire a tag used by the target user;
    第三获取子单元,用于依据所述目标用户使用过的标签,在所述所有标签中获取所述目标用户未使用过的标签;A third obtaining subunit, configured to obtain, from all the tags, tags that have not been used by the target user according to the tags that have been used by the target user;
    推荐子单元,用于将所获取的未使用过的标签推荐给所述目标用户。A recommendation subunit, configured to recommend the obtained unused tags to the target user.
  11. 一种计算机设备,包括存储器,以及与所述存储器相连的处理器;A computer device including a memory and a processor connected to the memory;
    所述存储器用于存储实现标签推荐方法的计算机程序;The memory is used to store a computer program for implementing the tag recommendation method;
    所述处理器用于运行所述存储器中存储的计算机程序,以执行以下步骤:获取用户-商品评分矩阵,所述用户-商品评分矩阵包括所有用户以及所述所有用户对所有商品的评分,所述所有用户包括一目标用户以及若干其他用户;根据所述用户-商品评分矩阵计算每一其他用户与目标用户的相似度,以得到所述目标用户的相似用户群体;获取所述相似用户群体所使用过的第一标签;将所述第一标签进行归类以得到每个所述第一标签所属的类簇;分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况;根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签。The processor is configured to run a computer program stored in the memory to perform the following steps: obtaining a user-item rating matrix, where the user-item rating matrix includes all users and all users' ratings for all products, the All users include a target user and several other users; calculating the similarity between each other user and the target user according to the user-item rating matrix to obtain a similar user group of the target user; obtaining the similar user group used by The first tags that have passed; classify the first tags to obtain the clusters to which each of the first tags belong; analyze the first tags in each cluster to be used by users in the similar user group According to the situation that the first label of each cluster is used by the similar user group, the target user is recommended to use the label in the corresponding cluster.
  12. 根据权利要求11所述的计算机设备,其中,所述处理器在执行所述根据所述用户-商品评分矩阵计算每一其他用户与目标用户的相似度,以得到所述目标用户的相似用户群体的步骤时,具体执行以下步骤:根据所述用户-商品评分矩阵计算所述目标用户对应的目标用户向量及所述若干其他用户对应的若干比较用户向量;基于余弦相似性分别计算每一比较用户向量与所述目标用户向量的相似度;若所述相似度大于或等于阈值,将所述相似度对应的其他用户确认为相似用户,以得到所述相似用户群体。The computer device according to claim 11, wherein the processor executes the calculation of the similarity between each other user and the target user according to the user-item scoring matrix to obtain a similar user group of the target user Step, specifically performing the following steps: calculating a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users according to the user-product scoring matrix; calculating each comparison user separately based on cosine similarity The similarity between the vector and the target user vector; if the similarity is greater than or equal to a threshold, other users corresponding to the similarity are confirmed as similar users to obtain the similar user group.
  13. 根据权利要求11所述的计算机设备,其中,所述处理器在执行所述分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况的步骤时, 具体执行以下步骤:分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数;The computer device according to claim 11, wherein when the processor executes the step of analyzing a situation where a first tag in each type of cluster is used by a user in the similar user group, the processor specifically executes the following Step: Calculate the total frequency of the first tag in each type of cluster used by the similar user group;
    所述处理器在执行所述根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签的步骤时,具体执行以下步骤:根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签。When the processor executes the step of recommending the tags in the corresponding cluster to the target user according to the situation that the first tag of each cluster is used by the similar user group, the processor specifically executes the following steps: The total frequency corresponding to the first tag of a class of clusters recommends the tag in the corresponding class of clusters to the target user.
  14. 根据权利要求13所述的计算机设备,其中,所述处理器在执行所述分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数的步骤时,具体执行以下步骤:根据每个相似用户对应的相似度及每个相似用户使用一第一标签的次数计算该一第一标签被所述相似用户群体所使用的频数;计算同一类簇中所有第一标签被所述相似用户群体所使用的频数之和,将所述和确认为对应类簇的第一标签被所述相似用户群体所使用的总频数。The computer device according to claim 13, wherein the processor specifically executes the following steps when performing the step of separately calculating the total number of first tags in each type of cluster used by the similar user group : Calculate the frequency with which a first tag is used by the similar user group according to the similarity corresponding to each similar user and the number of times each similar user uses a first tag; calculate all first tags in the same cluster The sum of frequencies used by the similar user group is described, and the sum is confirmed as the total number of frequencies used by the first tag of the corresponding class cluster by the similar user group.
  15. 根据权利要求13所述的计算机设备,其中,所述处理器在执行所述根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签的步骤时,具体执行以下步骤:获取总频数排位在前的预设数量的类簇所包含的所有标签;获取所述目标用户使用过的标签;依据所述目标用户使用过的标签,在所述所有标签中获取所述目标用户未使用过的标签;将所获取的未使用过的标签推荐给所述目标用户。The computer device according to claim 13, wherein when the processor executes the step of recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster, specifically The following steps are performed: obtaining all tags included in a preset number of clusters with a total frequency ranking; obtaining tags used by the target user; and among all tags according to the tags used by the target user Acquiring the unused tags of the target user; recommending the acquired unused tags to the target user.
  16. 一种计算机可读存储介质,所述计算机可读存储介质存储有一个或者一个以上计算机程序,所述一个或者一个以上计算机程序可被一个或者一个以上的处理器执行,以实现以下步骤:获取用户-商品评分矩阵,所述用户-商品评分矩阵包括所有用户以及所述所有用户对所有商品的评分,所述所有用户包括一目标用户以及若干其他用户;根据所述用户-商品评分矩阵计算每一其他用户与目标用户的相似度,以得到所述目标用户的相似用户群体;获取所述相似用户群体所使用过的第一标签;将所述第一标签进行归类以得到每个所述第一标签所属的类簇;分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况;根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签。A computer-readable storage medium stores one or more computer programs, and the one or more computer programs can be executed by one or more processors to implement the following steps: acquiring a user -Product rating matrix, the user-product rating matrix includes all users and the all user ratings for all products, the all users include a target user and several other users; each is calculated according to the user-product rating matrix Similarity between other users and the target user to obtain a similar user group of the target user; obtain a first tag used by the similar user group; classify the first tag to obtain each of the first The cluster to which a tag belongs; analyzes the situation where the first tag in each category cluster is used by users in the similar user group; according to the situation where the first tag in each category cluster is used by the similar user group Recommend tags in the corresponding cluster to the target user.
  17. 根据权利要求16所述的计算机可读存储介质,其中,在实现所述根据所述用户-商品评分矩阵计算每一其他用户与目标用户的相似度,以得到所述目标用户的相似用户群体的步骤时,具体实现以下步骤:根据所述用户-商品评分矩阵计算所述目标用户对应的目标用户向量及所述若干其他用户对应的若干比较用户向量;基于余弦相似性分别计算每一比较用户向量与所述目标用户向量 的相似度;若所述相似度大于或等于阈值,将所述相似度对应的其他用户确认为相似用户,以得到所述相似用户群体。The computer-readable storage medium according to claim 16, wherein in implementing the calculation of the similarity between each other user and the target user according to the user-item scoring matrix, to obtain a similar user group of the target user In the step, the following steps are specifically implemented: calculating a target user vector corresponding to the target user and several comparison user vectors corresponding to the other users according to the user-item scoring matrix; and calculating each comparison user vector separately based on cosine similarity Similarity with the target user vector; if the similarity is greater than or equal to a threshold, other users corresponding to the similarity are confirmed as similar users to obtain the similar user group.
  18. 根据权利要求16所述的计算机可读存储介质,其中,在实现所述分析每一类簇中的第一标签被所述相似用户群体中的用户所使用的情况的步骤时,具体实现以下步骤:分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数;The computer-readable storage medium according to claim 16, wherein when implementing the step of analyzing a situation where a first tag in each cluster is used by a user in the similar user group, the following steps are specifically implemented : Calculate the total frequency of the first tag in each type of cluster used by the similar user group separately;
    在实现所述根据每一类簇的第一标签被所述相似用户群体所使用的情况向所述目标用户推荐对应类簇中的标签的步骤时,具体实现以下步骤:根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签。When implementing the step of recommending a tag in a corresponding class cluster to the target user according to the situation that the first label of each class cluster is used by the similar user group, the following steps are specifically implemented: The total frequency corresponding to the first label recommends the label in the corresponding cluster to the target user.
  19. 根据权利要求18所述的计算机可读存储介质,其中,在实现所述分别计算每一类簇中的第一标签被所述相似用户群体所使用的总频数的步骤时,具体实现以下步骤:根据每个相似用户对应的相似度及每个相似用户使用一第一标签的次数计算该一第一标签被所述相似用户群体所使用的频数;计算同一类簇中所有第一标签被所述相似用户群体所使用的频数之和,将所述和确认为对应类簇的第一标签被所述相似用户群体所使用的总频数。The computer-readable storage medium according to claim 18, wherein when implementing the step of separately calculating the total frequency of the first tag in each type of cluster used by the similar user group, the following steps are specifically implemented: Calculate the frequency with which a first tag is used by the similar user group according to the similarity corresponding to each similar user and the number of times each similar user uses a first tag; calculate all first tags in the same cluster A sum of frequencies used by similar user groups, and the sum is confirmed as a total frequency used by the first tag of the corresponding class cluster by the similar user groups.
  20. 根据权利要求18所述的计算机可读存储介质,其中,在实现所述根据每一类簇的第一标签对应的总频数向所述目标用户推荐对应类簇中的标签的步骤时,具体实现以下步骤:获取总频数排位在前的预设数量的类簇所包含的所有标签;获取所述目标用户使用过的标签;依据所述目标用户使用过的标签,在所述所有标签中获取所述目标用户未使用过的标签;将所获取的未使用过的标签推荐给所述目标用户。The computer-readable storage medium according to claim 18, wherein when implementing the step of recommending the tags in the corresponding cluster to the target user according to the total frequency corresponding to the first tag of each cluster, the specific implementation is implemented The following steps: obtaining all tags included in a preset number of clusters with a total frequency ranking; obtaining tags used by the target user; obtaining among all tags according to the tags used by the target user The unused tags of the target user; recommending the obtained unused tags to the target user.
PCT/CN2018/108915 2018-08-09 2018-09-30 Tag recommendation method and apparatus, computer device, and computer-readable storage medium WO2020029412A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810902677.2A CN109165975B (en) 2018-08-09 2018-08-09 Label recommending method, device, computer equipment and storage medium
CN201810902677.2 2018-08-09

Publications (1)

Publication Number Publication Date
WO2020029412A1 true WO2020029412A1 (en) 2020-02-13

Family

ID=64895309

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/108915 WO2020029412A1 (en) 2018-08-09 2018-09-30 Tag recommendation method and apparatus, computer device, and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN109165975B (en)
WO (1) WO2020029412A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112685656A (en) * 2020-12-22 2021-04-20 航天信息股份有限公司 Label recommendation method and electronic equipment
CN115879980A (en) * 2022-12-15 2023-03-31 中电金信软件有限公司 Method and device for passenger group circle selection and comparative analysis

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110135893A (en) * 2019-04-16 2019-08-16 深圳壹账通智能科技有限公司 Determination method, apparatus, computer equipment and the storage medium of potential user
CN111831894A (en) * 2019-04-23 2020-10-27 北京嘀嘀无限科技发展有限公司 Information matching method and device
CN110992137B (en) * 2019-11-27 2023-09-01 上海风秩科技有限公司 Product recommendation method and device, electronic equipment and storage medium
CN110929166B (en) * 2019-12-27 2023-10-20 咪咕文化科技有限公司 Content recommendation method, electronic equipment and storage medium
CN111861759B (en) * 2020-06-15 2022-12-23 北京百分点科技集团股份有限公司 Matching method and system for product and customer group
CN111768230A (en) * 2020-06-24 2020-10-13 中国平安财产保险股份有限公司 Label recommendation method and device for client portrait system and computer equipment
CN112182390B (en) * 2020-09-29 2024-02-09 中国平安人寿保险股份有限公司 Mail pushing method, device, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103198418A (en) * 2013-03-15 2013-07-10 北京亿赞普网络技术有限公司 Application recommendation method and application recommendation system
US20150186503A1 (en) * 2012-10-12 2015-07-02 Tencent Technology (Shenzhen) Company Limited Method, system, and computer readable medium for interest tag recommendation
CN108205682A (en) * 2016-12-19 2018-06-26 同济大学 It is a kind of for the fusion content of personalized recommendation and the collaborative filtering method of behavior
CN108334645A (en) * 2018-04-10 2018-07-27 华中科技大学 One kind feeding back newer activity recommendation method based on graph model

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103838756A (en) * 2012-11-23 2014-06-04 阿里巴巴集团控股有限公司 Method and device for determining pushed information
CN103678431B (en) * 2013-03-26 2018-01-02 南京邮电大学 A kind of recommendation method to be scored based on standard label and project
CN104217334A (en) * 2013-06-05 2014-12-17 北京京东尚科信息技术有限公司 Product information recommendation method, device and system
CN103412948B (en) * 2013-08-27 2017-10-24 北京交通大学 The Method of Commodity Recommendation and system of collaborative filtering based on cluster
CN104463637A (en) * 2014-12-23 2015-03-25 北京石油化工学院 Commodity recommendation method and device based on electronic business platform and server

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150186503A1 (en) * 2012-10-12 2015-07-02 Tencent Technology (Shenzhen) Company Limited Method, system, and computer readable medium for interest tag recommendation
CN103198418A (en) * 2013-03-15 2013-07-10 北京亿赞普网络技术有限公司 Application recommendation method and application recommendation system
CN108205682A (en) * 2016-12-19 2018-06-26 同济大学 It is a kind of for the fusion content of personalized recommendation and the collaborative filtering method of behavior
CN108334645A (en) * 2018-04-10 2018-07-27 华中科技大学 One kind feeding back newer activity recommendation method based on graph model

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112685656A (en) * 2020-12-22 2021-04-20 航天信息股份有限公司 Label recommendation method and electronic equipment
CN115879980A (en) * 2022-12-15 2023-03-31 中电金信软件有限公司 Method and device for passenger group circle selection and comparative analysis

Also Published As

Publication number Publication date
CN109165975A (en) 2019-01-08
CN109165975B (en) 2023-05-16

Similar Documents

Publication Publication Date Title
WO2020029412A1 (en) Tag recommendation method and apparatus, computer device, and computer-readable storage medium
Ning et al. A comprehensive survey of neighborhood-based recommendation methods
Marinho et al. Social tagging recommender systems
US10528907B2 (en) Automated categorization of products in a merchant catalog
US8140515B2 (en) Personalization engine for building a user profile
US20150278813A1 (en) Determining a temporary transaction limit
US20190317842A1 (en) Feature-Based Application Programming Interface Cognitive Comparative Benchmarking
US7809665B2 (en) Method and system for transitioning from a case-based classifier system to a rule-based classifier system
CN106844407B (en) Tag network generation method and system based on data set correlation
US8949237B2 (en) Detecting overlapping clusters
US20140059089A1 (en) Method and apparatus for structuring a network
US10678831B2 (en) Page journey determination from fingerprint information in web event journals
WO2021155691A1 (en) User portrait generating method and apparatus, storage medium, and device
US20190065587A1 (en) Page journey determination from web event journals
Zhou et al. Enhancing collaborative filtering with multi-label classification
JP7067884B2 (en) Classification device, classification method and classification program
Bordoloi et al. Keyword extraction using supervised cumulative TextRank
Borges et al. A survey on recommender systems for news data
US20220182363A1 (en) Private cloud user insight privacy
Stankova et al. Classification over bipartite graphs through projection
Krishnan et al. RFM-based customer analysis and product recommendation system
Ogunde et al. A recommender system for selecting potential industrial training organizations
CN111209403A (en) Data processing method, device, medium and electronic equipment
US11461676B2 (en) Machine learning-based recommendation engine for storage system usage within an enterprise
Priyanga et al. Mobile app usage pattern prediction using hierarchical flexi-ensemble clustering (HFEC) for mobile service rating

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18929557

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18929557

Country of ref document: EP

Kind code of ref document: A1