WO2020253369A1 - 生成兴趣标签的方法、装置、计算机设备和存储介质 - Google Patents
生成兴趣标签的方法、装置、计算机设备和存储介质 Download PDFInfo
- Publication number
- WO2020253369A1 WO2020253369A1 PCT/CN2020/086369 CN2020086369W WO2020253369A1 WO 2020253369 A1 WO2020253369 A1 WO 2020253369A1 CN 2020086369 W CN2020086369 W CN 2020086369W WO 2020253369 A1 WO2020253369 A1 WO 2020253369A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- user
- application
- application type
- sample
- preference value
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
Definitions
- This application relates to the field of information processing technology, in particular to a method, device, computer equipment and storage medium for generating interest tags.
- differentiated services such as personalized recommendation and diversified marketing have been widely used in people's lives, and these differentiated services are inseparable from user portraits.
- the core job of user portrait is to generate labels for users.
- user behavior can be analyzed and predicted from a macro perspective, which helps to improve the accuracy of the company's marketing behavior for specific users.
- a method for generating interest tags comprising:
- the user usage record in the user usage record set includes the user ID and the application ID;
- the interest tag corresponding to the filtered user identification is determined.
- a device for generating interest tags comprising:
- the usage record acquisition module is used to acquire the user usage record set of the application within a specified time period, and calculate the preference value corresponding to each application ID corresponding to the user ID; the user usage record in the user usage record set includes the user ID And application ID;
- the classification threshold determining module is configured to determine the application type based on the application identifier, and determine the classification threshold of each application type according to the preference value corresponding to the user identifier corresponding to the application identifier under the same application type; There is a preset interest tag corresponding to the application type;
- the user identification screening module is configured to perform condition screening according to the classification threshold according to the user usage data set of each application type determined based on the user usage record set to filter out the user identification;
- the interest tag generation module is used to determine the interest tag corresponding to the screened user identification according to the preset interest tag corresponding to the application type of the user usage data set where the screened user identification is located.
- a computer device includes a memory and a processor, the memory stores a computer program, and the processor implements the steps of the above method for generating interest tags when the computer program is executed.
- a computer-readable storage medium has a computer program stored thereon, and when the computer program is executed by a processor, the steps of the above method for generating interest tags are realized.
- the above method, device, computer equipment, and storage medium for generating interest tags determine the preference value of each application identifier corresponding to the user identifier based on the user usage record set of the application acquired within a specified time period, so as to better characterize user usage The degree of preference for each application. Furthermore, by analyzing the overall distribution of the preference values corresponding to the user IDs under the same application type, the classification threshold of each application type is determined, and the overall preference value under the same application type is fully considered. The distribution situation provides a more accurate screening basis for subsequent screening of user identification. Furthermore, the user usage data set of each application type is filtered according to the corresponding classification threshold, so as to filter out qualified user IDs, which improves the accuracy of generating interest tags for each behavior type.
- Fig. 1 is an application scenario diagram of a method for generating interest tags in an embodiment
- FIG. 2 is a schematic flowchart of a method for generating interest tags in an embodiment
- Figure 3 is a structural block diagram of a device for generating interest tags in an embodiment
- Fig. 4 is an internal structure diagram of a computer device in an embodiment.
- the method for generating interest tags provided in this application can be applied to the application environment as shown in FIG. 1.
- the terminal 102 communicates with the server 104 through the network through the network.
- the server 104 obtains the user usage record set of the application within a specified time period, and calculates the preference value corresponding to each application ID corresponding to the user ID; wherein the user usage record set can be triggered by the terminal 102; and download according to the same application type
- the application identifier of corresponds to the preference value corresponding to the user identifier, and the classification threshold of each application type is determined respectively.
- the server 104 conditionally filters the user usage data set of the corresponding application type according to the obtained classification threshold to filter out the user identification; according to the application type corresponding to the filtered user identification, the server 104 uses the application type as The interest tag of the filtered user identification.
- the terminal 102 may be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices.
- the server 104 may be implemented as an independent server or a server cluster composed of multiple servers.
- a method for generating interest tags is provided. Taking the method applied to the server in FIG. 1 as an example for description, the method includes the following steps:
- Step S202 Obtain the user usage record set of the application within a specified time period, and calculate the preference value corresponding to each application ID corresponding to the user ID; the user usage record in the user usage record set includes the user ID and the application ID.
- the user usage record set includes each user usage record, and each user usage record includes a user ID, an application program ID, and a usage weight.
- User usage records contain a wealth of information, such as the similarity between users, the similarity between applications, and the degree of user preference for each application.
- the user identifier is a unique identifier that distinguishes each user, and may be a user ID (Identification).
- the application identifier is a unique identifier that distinguishes each application.
- the preference value represents the user's preference for using the application corresponding to the application identifier; the preference value is related to the number of users corresponding to the application identifier, the total number of users corresponding to the user usage record set, and the weight of use .
- the user triggers the terminal to generate a user usage record set of each application, and transmits the generated user usage record set to the server through the network, or the user usage record set can be directly stored in the terminal.
- the server may obtain a record set of user usage in a specified time period from each terminal, or may obtain a record set of user use in a specified time period from the server. After the server obtains the user usage record set of the application within the specified time period, it calculates the preference value of each application ID corresponding to the user ID according to the user usage record set.
- the server obtains the number of users corresponding to each application identifier and the total number of users corresponding to the user use record set based on each user use record in the user use record set; and obtains the corresponding user identifier and application identifier Use weight, and then calculate the preference value of the user ID corresponding to each application ID according to the proportion of the number of users in the total number of users and the usage weight.
- Step S204 Determine the application type based on the application identifier, and determine the classification threshold of each application type according to the preference value corresponding to the user identifier corresponding to the application identifier under the same application type, and the application type has a corresponding preset Interest tags.
- the application type refers to the category that distinguishes each application, such as the video type.
- the classification threshold refers to the classification judgment condition of the preference value in the application type to which it belongs. According to the classification threshold, it can be determined whether the user identifier corresponding to the preference value belongs to the application type to which the preference value belongs.
- the classification threshold characterizes the proportion of each user's usage behavior of the application in the overall usage behavior of the application type under the same application type.
- the server calculates the preference value corresponding to each application identifier corresponding to the user identifier based on the user usage record set, and determines the corresponding application type according to each application identifier, and there is a corresponding preset for each application type Interest tag; the preset interest tag can be consistent with the application type, or it can be an identifier that is consistent with the application type.
- the server determines the classification threshold of each application type according to the calculated preference value. The classification threshold can be used to determine whether the user identifier corresponding to the preference value belongs to the application type to which the preference value belongs.
- Step S206 According to the user usage data set of each application type determined based on the user usage record set, condition filtering is performed according to the classification threshold to filter out the user identification.
- the user usage data set includes user usage data sets corresponding to each application program type, and the user usage data set includes user IDs, application IDs, and preference values corresponding to each other.
- the server performs condition filtering according to the classification threshold corresponding to the application type of the user usage data set, so as to filter out the qualified user identification for the user usage data set .
- Step S208 Determine the interest tag corresponding to the filtered user identification according to the application type corresponding to the user usage data set where the filtered user identification is located.
- the interest tag refers to a tag that is different from the user's tendency to have a certain type of behavior; for example, a user often uses a video application, and the corresponding interest tag of the user is a video.
- the server obtains from the database the application type corresponding to the user usage data set where the user ID is located, that is, the interest tag of the user ID is the corresponding application Types of.
- the preference value of each application identifier corresponding to the user identifier is determined based on the user usage record set of the application acquired within a specified time period, which better characterizes the user's preference for using each application. Furthermore, by analyzing the overall distribution of the preference values corresponding to the user IDs under the same application type, the classification threshold of each application type is determined, and the overall preference value under the same application type is fully considered. The distribution situation provides a more accurate screening basis for subsequent screening of user identification. Furthermore, the user usage data set of each application type is filtered according to the corresponding classification threshold, so as to filter out qualified user IDs, which improves the accuracy of generating interest tags for each behavior type.
- the user usage records in the user usage record set also include usage weights; according to the user usage record set of the application within a specified time period, calculating the preference value corresponding to each application identifier corresponding to the user identifier includes The following steps: Obtain the number of users corresponding to each application ID and the total number of users corresponding to the user usage record set; Obtain the usage weight corresponding to the user ID and application ID; According to the proportion of the number of users to the total number of users and the usage weight Calculate the preference value corresponding to each application ID corresponding to the user ID.
- the usage weight characterizes the proportion of the usage degree of a specific application among various applications used by the user.
- the usage weight can be determined according to the installation information, usage times, usage duration, and power consumption of the application.
- the server obtains the number of users corresponding to each application ID and the total number of users corresponding to the user usage record set based on the obtained user usage record set; and obtains the corresponding usage weight from the database according to the user ID and application ID.
- the server calculates the preference value corresponding to each application identifier corresponding to the user identifier according to the acquired total number of users, number of users, and usage weight. That is, the server calculates the preference value corresponding to each application identifier corresponding to the user identifier according to the proportion of the total number of users and the number of users corresponding to the application identifier and the usage weight corresponding to the application identifier.
- the preference value is positively correlated with the usage weight corresponding to the application program identifier, and is positively correlated with the proportion of the number of users corresponding to the application program.
- the proportion of the number of users increases as the total number of users corresponding to the user usage record set increases, and decreases as the number of users corresponding to the application identifier increases.
- the preference value may be the product of the proportion of the number of users corresponding to the application identifier and the use weight corresponding to the application identifier; the proportion of the number of users may be the logarithmic value of the ratio of the total number of users to the number of users corresponding to the application identifier.
- the denominator of the variable parameter x in the function is 0, and 1 can also be added to the denominator of x.
- each application identifier based on the number of users corresponding to each application identifier, the total number of users corresponding to the user usage record set, and the usage weight corresponding to each application identifier corresponding to the user identifier, the calculation of each application identifier corresponds to The preference value corresponding to the user ID.
- determining the classification threshold of each application type according to the preference value corresponding to the user identifier corresponding to the application identifier under the same application type includes the following steps: based on the application identifier under the same application type Corresponding to the preference value corresponding to the user ID, sort the respective preference values of the same application type in ascending order to obtain the sorting result of the preference value; according to the sorting result of the preference value, calculate each corresponding to the same application type The quantile of the preference value; the classification threshold of each application type is determined according to the quantile.
- the value range of the quantile is greater than 0 and less than or equal to 1.
- the server based on the obtained user usage record set and the calculated preference value, under the same application type, the server respectively sorts the respective preference values of the same application type in ascending order to obtain each application The sort result of the preference value of the type. According to the obtained ranking results of each preference value, the server calculates the quantile of each preference value corresponding to the same application type; and determines the classification threshold corresponding to each application type according to the quantile, that is, the classification threshold.
- the value range can be between 0 and 1, and can be 1.
- the ranking result corresponding to each application type is obtained; further, the ranking result of each preference value corresponding to each application type is calculated according to the ranking result.
- Quantile determine the classification threshold of each behavior type according to the calculated quantiles. The overall distribution of quantiles of each application type is used to determine the classification threshold, and the overall distribution is fully considered, which provides a basis for the subsequent generation of interest tags.
- calculating the quantile of each preference value corresponding to each application type according to the sorting result of the preference value includes the following steps: according to the sorting result of the preference value, determining each application type The occurrence probability of each preference value in the corresponding ranking result; the cumulative probability of each preference value under each application type is determined according to the occurrence probability, and the quantile of each preference value under each application type is obtained.
- the occurrence probability refers to the probability of each preference value in the user usage data set corresponding to a certain behavior type.
- Cumulative probability refers to adding up the occurrence probabilities of all preference values that do not exceed the preference value in the user usage data set corresponding to a certain behavior type, and the result is the cumulative probability.
- the server separately calculates the occurrence probability of each preference value corresponding to each application type in the sorting result according to the obtained sorting result of the preference value corresponding to each application type. Based on the calculated occurrence probability, the server determines the cumulative probability of each preference value corresponding to each application type according to the occurrence probability, that is, the cumulative probability is the quantile of the corresponding preference value.
- the occurrence probability of each preference value under each application type in the corresponding ranking result is determined based on the ranking result of the preference value, and the cumulative probability of each preference value under each application type is further obtained according to the occurrence probability , So as to get the quantile of each preference value under each application type.
- the cumulative probability is used to calculate the quantile, which reflects the overall proportion of each application type in the overall situation, fully considers the relationship between the data, and provides a more accurate screening basis for subsequent screening of user identification.
- calculating the quantile of each preference value corresponding to each application type according to the ranking result of the preference value includes the following steps: obtaining the ranking of each preference value under each application type The ranking position in the result and the number of ranking users corresponding to the application type to which each application identifier belongs; the ranking position of each preference value under each application type in the ranking result is divided by the number of ranking users to obtain each application The quantile of each preference value corresponding to each type.
- the sort position refers to the sorting of each element in a data set according to a certain logic, and the position of each element in the data set.
- the number of sorted users refers to the total number of all corresponding elements in a data set.
- the server respectively obtains the ranking position and each preference value of each preference value corresponding to each application type in the preference value ranking result.
- the server After the server obtains the corresponding data, it divides the ranking bit of each preference value corresponding to each application type by the number of ranking users corresponding to the application type, that is, the calculated result is the preference value corresponding to each application type. Quantile.
- the data set includes preference values corresponding to each application identifier corresponding to the user identifier; each preference value is sorted in ascending order to obtain the preference value sorting result. If the preference value A in the data set is ranked 5 in the corresponding ranking result, and the number of ranked users of the preference value A in the application type is 10, then the quantile of the preference value is 5/10*100 %, that is, the quantile is 50%.
- the ranking result of the preference value is: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9; when the preference value is 6, the corresponding quantile is 70%.
- each preference value corresponding to each application type is determined based on the ranking position of each preference value in the ranking result and the number of ranking users corresponding to each application type.
- the quantile of the value By determining the quantile by the ranking position and the number of ranking users, the amount of calculation can be further reduced at the computer level, thereby increasing the calculation speed and the rate of generating interest tags.
- determining the classification threshold of each application type according to the quantile includes the following steps: according to the quantile, corresponding to each application type, respectively filtering out scores greater than or equal to the corresponding first preset threshold. Digits; corresponding to each application type, calculate the difference between adjacent quantiles based on the filtered quantile; obtain the quantile corresponding to each maximum difference calculated for each application type , Get the classification threshold of each application type.
- the preset threshold is a threshold value for judging the quantile set in advance, and the threshold can be stored in a database; the preset threshold is a threshold value for the quantile corresponding to each application type.
- the difference is the calculation result obtained by subtracting two data; it can be the result obtained by subtracting two adjacent quantiles.
- the server obtains the preset threshold value of the corresponding application type from the database for the quantile corresponding to each application type. According to the preset threshold, the quantiles greater than or equal to the preset threshold are filtered out. Corresponding to each application type, the server calculates the difference between two adjacent quantiles according to the filtered quantiles. The server obtains the two quantiles corresponding to the largest difference according to the calculated difference value for each application type, and uses the quantile with the lower rank as the classification threshold corresponding to the application type.
- the classification threshold of the preference value corresponding to each application type is determined based on the quantile, and the quantile with obvious distribution among the application types is selected as the classification threshold of the application type. Furthermore, making full use of the overall distribution characteristics of data of each application type provides a guarantee for the accuracy of interest tags.
- the method includes the following steps: obtaining user usage of known interest tags Record sample set; adjust the classification threshold according to the user's use of the record sample set; according to the user's use of the data set, and perform condition screening according to the adjusted classification threshold to filter out the user identification.
- the user usage record sample set includes various user usage record samples
- the user usage record set includes user usage data sets corresponding to each application type
- the user usage data sets include user IDs, application IDs and preference values corresponding to each other.
- the server obtains a sample set of user usage records of interest tags from a database or a terminal, and adjusts the classification threshold corresponding to each application type according to the obtained sample set of user usage records. Further, based on the user usage data set, the server conditionally filters each preference value corresponding to each application type according to the adjusted classification threshold, so as to filter out user IDs that meet the above-mentioned preference value filtering conditions.
- the classification threshold corresponding to each application type is adjusted based on the user's usage record sample set of known interest tags, so as to obtain the adjusted classification threshold.
- the classification threshold is tested by using a sample set of user records to improve the accuracy of interest tags.
- the user usage record sample in the user usage record sample set includes the sample user ID, interest tag, sample application type, sample application ID and sample usage weight; the classification threshold is adjusted according to the user usage record sample set, It includes the following steps: Determine the sample user usage data set of each sample application type according to the known interest label according to the user usage record sample set.
- the sample usage data set includes the corresponding sample user ID, sample application ID, interest label and sample preference Value; Based on the sample user usage data set of the known label of each sample application type, calculate the quantile of each sample preference value of each sample application type; According to the sample user usage data set of the known label, according to the classification threshold Perform conditional screening to filter out sample user IDs; determine the predicted interest label corresponding to the selected user ID according to the application type corresponding to the user usage data set where the sample user ID is screened; predict according to the sample user data set The recall rate of each sample application type calculated by the interest tag and the known corresponding interest tag, and the classification threshold is adjusted.
- the user usage record sample set includes various user usage record samples, and each user usage record sample includes sample user identification, interest tag, sample application type, sample application identification, and sample usage weight.
- the sample user ID is a unique ID that distinguishes each sample user.
- the sample application type is a type corresponding to each application of the sample user, the sample application type and the application type are corresponding, and the application type includes all sample application types.
- the sample application ID is a unique ID that distinguishes each application.
- the sample preference value represents the degree of preference of the sample user corresponding to the sample user identifier to use the sample application corresponding to the sample application identifier.
- the user usage record sample set includes sample user usage data sets corresponding to each sample application type; the sample user usage data set includes corresponding sample user IDs, sample application IDs, interest tags, and sample preference values.
- the interest tag refers to a tag that is different from the user's tendency to have a certain type of application type.
- the corresponding interest tag of the user may be a video.
- the predicted interest tag is the predicted interest tag generated by the interest tag generation model.
- the recall rate is the ratio of the number of users whose predicted interest label and known interest label are consistent with each sample application type for each sample application type to the total number of users of the sample application type. The closer the recall rate is to 1, the higher the consistency between the predicted interest label and the known interest label for the corresponding sample application type, and the more appropriate selection of the classification threshold for the sample application type.
- the server obtains a sample set of user usage records with interest tags from a database or terminal, and classifies the obtained sample set of user usage records according to known interest tags to obtain sample users corresponding to each sample application type. Use data sets. Based on the sample user usage data set corresponding to each sample application type obtained by classification, the server calculates the quantile of each sample preference value corresponding to each sample application type.
- the server searches the database for the corresponding classification threshold according to each sample application type, and filters the sample user usage data set according to the found classification threshold.
- the selected sample user identification is obtained.
- the filtering condition is: corresponding to each sample user's usage data set, the sample preference value is greater than or equal to the corresponding classification threshold.
- the server searches the database for the sample application type corresponding to the sample user usage data set where the sample user identifier is located according to the selected sample user identifier, that is, the predicted interest tag of the sample user identifier may correspond to the found sample application type. Based on the predicted interest label of the sample user’s use of the data set and the known corresponding interest label, corresponding to each type of sample application, the server determines whether the predicted interest label identified by each sample user is consistent with the known interest label, and uses the label Record the judgment result and store it in the server. When the judgment result is consistent, it can be marked as 1; otherwise, it can be marked as 0. For example, in a sample application type, the known interest tag identified by a sample user is a movie. If the predicted interest tag is also a movie, it is recorded as 1; if the predicted interest tag identified by the sample user is eating, then Recorded as 0.
- the server calculates the recall rate of each type of sample application; and then adjusts the corresponding classification threshold according to the recall rate of each type of sample application. If the recall rate does not meet the adjustment threshold, the classification threshold does not need to be adjusted; if the recall rate meets the adjustment threshold, the classification threshold is adjusted. Then determine the predicted label of the sample user's usage data set according to the adjusted classification threshold, and calculate the recall rate of each type of sample application type, until the recall rate of the user's usage record sample set does not meet the adjusted threshold range, stop Adjust the corresponding classification threshold; the adjustment threshold can be set as: the recall rate is lower than 95%. .
- the classification threshold is adjusted based on the user's use record sample set of known interest tags, and the classification threshold is adjusted according to the calculated recall rate of each application type until the search of each application type The rate does not meet the adjustment threshold.
- the classification threshold is tested by using a sample set of user records, and the accuracy of the interest label is verified by the recall rate, which further improves the accuracy of the interest label.
- steps in the flowchart of FIG. 2 are displayed in sequence as indicated by the arrows, these steps are not necessarily performed in sequence in the order indicated by the arrows. Unless specifically stated in this article, the execution of these steps is not strictly limited in order, and these steps can be executed in other orders. Moreover, at least some of the steps in FIG. 2 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but can be executed at different times. The execution of these sub-steps or stages The sequence is not necessarily performed sequentially, but may be performed alternately or alternately with at least a part of other steps or sub-steps or stages of other steps.
- an apparatus 300 for generating interest tags including: a usage record acquisition module 302, a classification threshold determination module 304, a screening user identification module 306, and an interest tag generation module 308, wherein :
- the usage record obtaining module 302 is used to obtain the user usage record set of the application within a specified time period, and calculate the preference value corresponding to each application ID corresponding to the user ID; the user usage record in the user usage record set includes the user ID and Application ID.
- the classification threshold determination module 304 is used to determine the application type based on the application identifier, and determine the classification threshold of each application type according to the preference value corresponding to the user identifier corresponding to the application identifier under the same application type; the application type There is a corresponding preset interest tag.
- the user identification filtering module 306 is used to filter the user identification according to the user usage data set of each application type determined based on the user usage record set, and perform condition filtering according to the classification threshold.
- the interest tag generation module 308 is configured to determine the interest tag corresponding to the filtered user identification according to the application type corresponding to the user usage data set where the screened user identification is located.
- the aforementioned usage record obtaining module includes: a data obtaining module and a preference value calculating module.
- the data acquisition module is used to obtain the number of users corresponding to each application ID and the total number of users corresponding to the user usage record set; to obtain the usage weight corresponding to the user ID and the application ID; the preference value calculation module is used to calculate the total number of users The proportion of the number to the number of users and the weight of use are calculated, and the preference value corresponding to each application identifier corresponding to the user identifier is calculated.
- the above-mentioned classification threshold determination module includes: a ranking module, a quantile acquisition module, and a classification threshold calculation module.
- the sorting module is used to sort the preference values corresponding to the same application types in ascending order based on the application identifiers under the same application type corresponding to the preference values corresponding to the user identifiers to obtain the sorting results of the preference values;
- the acquisition module is used to calculate the quantile of each preference value corresponding to the same application type according to the ranking result of the preference value; the classification threshold calculation module is used to determine the classification threshold of each application type according to the quantile.
- the aforementioned quantile calculation module includes: a probability calculation module and a cumulative probability calculation module.
- the probability calculation module is used to determine the occurrence probability of each preference value of each application type in the corresponding ranking result according to the ranking result of the preference value; the cumulative probability calculation module is used to determine the occurrence probability of each application type according to the occurrence probability
- the cumulative probability of each preference value is the quantile of each preference value under each application type.
- the above-mentioned quantile obtaining module includes: a ranking data obtaining module and a quantile calculating module.
- the ranking data acquisition module is used to acquire the ranking position of each preference value under each application type in the ranking result and the number of ranking users corresponding to the application type to which each application identifier belongs;
- the quantile calculation module is used to The ranking position of each preference value under each application type in the ranking result is divided by the number of ranking users to obtain the quantile of each preference value corresponding to each application type.
- the aforementioned classification threshold calculation module includes: a first screening module, a difference calculation module, and a second screening module.
- the first filtering module is used to filter out quantiles greater than or equal to the corresponding preset threshold according to the quantile and corresponding to each application type;
- the difference calculation module is used to correspond to each application type, Calculate the difference between adjacent quantiles according to the filtered quantile;
- the second filtering module is used to obtain the quantile corresponding to each largest difference calculated for each application type to obtain each application The classification threshold of the program type.
- the aforementioned screening user identification module includes: a usage record sample acquisition module, a classification threshold adjustment module, and a condition screening module.
- Use record sample acquisition module used to obtain user usage record sample sets with known interest tags
- classification threshold adjustment module used to adjust classification thresholds according to user usage record sample sets
- condition screening module used according to user usage data Set, and filter the conditions according to the adjusted classification threshold to filter out the user ID.
- the aforementioned classification threshold adjustment module includes: a sample user usage record set acquisition module, a sample user data set determination module, a sample quantile calculation module, a sample user identification screening module, a predicted interest label generation module, and recall rate Calculation module.
- the sample user usage record set acquisition module is used to adjust the classification threshold according to the user usage record sample set.
- the sample user data set determination module is used to determine each sample application according to the known interest label according to the user usage record sample set Types of sample user usage data sets.
- the sample usage data sets include corresponding sample user IDs, sample application IDs, interest labels, and sample preference values; sample quantile calculation module for the known labels based on each sample application type
- the sample user usage data set of calculates the quantile of each sample preference value of each sample application type
- the sample user identification filter module is used to filter the sample user usage data set according to the known label and according to the classification threshold.
- the predicted interest label generation module is used to determine the user identification corresponding to the selected user identification according to the preset interest label corresponding to the sample application type of the sample user usage data set where the selected sample user identification is located Predicted interest label; recall rate calculation module, used to calculate the recall rate of each type of sample application type based on the predicted interest label of the sample user data set and the known corresponding interest label, and adjust the classification threshold.
- the preference value of each application identifier corresponding to the user identifier is determined based on the user usage record set of the application acquired within a specified time period, which better characterizes the user's preference for using each application. Furthermore, by analyzing the overall distribution of the preference values corresponding to the user IDs under the same application type, the classification threshold of each application type is determined, and the overall preference value under the same application type is fully considered. The distribution situation provides a more accurate screening basis for subsequent screening of user identification. Furthermore, the user usage data set of each application type is filtered according to the corresponding classification threshold, so as to filter out qualified user IDs, which improves the accuracy of generating interest tags for each behavior type.
- the various modules in the above apparatus for generating interest tags may be implemented in whole or in part by software, hardware, and combinations thereof.
- the foregoing modules may be embedded in the form of hardware or independent of the processor in the computer device, or may be stored in the memory of the computer device in the form of software, so that the processor can call and execute the operations corresponding to the foregoing modules.
- a computer device is provided.
- the computer device may be a server, and its internal structure diagram may be as shown in FIG. 4.
- the computer equipment includes a processor, a memory, a network interface and a database connected through a system bus. Among them, the processor of the computer device is used to provide calculation and control capabilities.
- the memory of the computer device includes a non-volatile storage medium and an internal memory.
- the non-volatile storage medium stores an operating system, a computer program, and a database.
- the internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium.
- the database of the computer equipment is used to store user usage record sets, user usage data sets, and classification threshold data.
- the network interface of the computer device is used to communicate with an external terminal through a network connection.
- the computer program is executed by the processor to realize a method of generating interest tags.
- FIG. 4 is only a block diagram of part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied.
- the specific computer device may Including more or fewer parts than shown in the figure, or combining some parts, or having a different arrangement of parts.
- a computer device including a memory and a processor, the memory stores a computer program, and the processor implements the following steps when executing the computer program: Obtain a set of user usage records of an application within a specified time period Calculate the preference value corresponding to the user ID for each application ID; user usage records in the user usage record set include the user ID and the application ID; determine the application type based on the application ID, and according to the applications under the same application type
- the program identifier corresponds to the preference value corresponding to the user identifier, and the classification threshold of each application type is determined respectively;
- the application type has a corresponding preset interest tag; the user usage data set of each application type determined based on the user usage record set , And perform conditional filtering according to the classification threshold to filter out the user identification; according to the preset interest tag corresponding to the application type of the user usage data set where the screened user identification is located, the interest tag corresponding to the filtered user identification is determined.
- the processor further implements the following steps when executing the computer program: acquiring the number of users corresponding to each application identifier and the total number of users corresponding to the user usage record set; acquiring the usage weight corresponding to the user identifier and the application identifier ; According to the proportion of the total number of users to the number of users and the usage weight, calculate the preference value corresponding to each application identifier corresponding to the user identifier.
- the processor further implements the following steps when executing the computer program: based on the preference value corresponding to the user identifier corresponding to the application identifier under the same application type, the preference values corresponding to the same application type are performed in ascending order Sort to obtain the sorting result of the preference value; According to the sorting result of the preference value, calculate the quantile of each preference value corresponding to the same application type; Determine the classification threshold of each application type according to the quantile.
- the processor further implements the following steps when executing the computer program: according to the ranking result of the preference value, determining the occurrence probability of each preference value under each application type in the corresponding ranking result; determining each application according to the occurrence probability The cumulative probability of each preference value under the program type is the quantile of each preference value under each application type.
- the processor further implements the following steps when executing the computer program: acquiring the ranking position of each preference value under each application type in the ranking result and the ranking user corresponding to the application type to which each application identifier belongs Divide the ranking position of each preference value under each application type in the ranking result by the number of ranking users to obtain the quantile of each preference value corresponding to each application type.
- the processor further implements the following steps when executing the computer program: according to the quantile, corresponding to each application program type, respectively filtering out the quantile greater than or equal to the corresponding preset threshold; corresponding to each application Program type, calculate the difference between adjacent quantiles according to the filtered quantile; get the quantile corresponding to each largest difference calculated for each application type, and get the classification of each application type Threshold.
- the processor further implements the following steps when executing the computer program: obtaining a sample set of user usage records with known interest tags; adjusting the classification threshold according to the sample set of user usage records; and according to the user usage data set
- the adjusted classification threshold is subjected to conditional filtering to filter out user identification.
- the processor further implements the following steps when executing the computer program: according to the sample set of user usage records, the sample user usage data set of each sample application type is determined according to the known interest tag, and the sample usage data set includes the corresponding sample User identification, sample application identification, interest label, and sample preference value; based on the sample user usage data set of the known label of each sample application type, calculate the quantile of each sample preference value of each sample application type; Sample user usage data sets with known labels are filtered according to the classification threshold to filter out the sample user ID; according to the preset interest label corresponding to the sample application type of the user usage data set where the selected sample user ID is located, determine The predicted interest label corresponding to the selected user identification; the recall rate of each type of sample application type calculated according to the predicted interest label of the sample user data set and the known corresponding interest label, and the classification threshold is adjusted.
- the preference value of each application identifier corresponding to the user identifier is determined based on the user usage record set of the application acquired within a specified time period, which better characterizes the user's preference for using each application. Furthermore, by analyzing the overall distribution of the preference values corresponding to the user IDs under the same application type, the classification threshold of each application type is determined, and the overall preference value under the same application type is fully considered. The distribution situation provides a more accurate screening basis for subsequent screening of user identification. Furthermore, the user usage data set of each application type is filtered according to the corresponding classification threshold, so as to filter out qualified user IDs, which improves the accuracy of generating interest tags for each behavior type.
- a computer-readable storage medium on which a computer program is stored.
- the following steps are implemented: Obtain a set of user usage records of the application within a specified time period, and calculate Each application ID corresponds to the preference value corresponding to the user ID; the user usage record in the user usage record set includes the user ID and the application ID; the application type is determined based on the application ID, and the corresponding application ID under the same application type Based on the preference value corresponding to the user identification, the classification threshold of each application type is determined respectively; the application type has a corresponding preset interest tag; according to the user usage data set of each application type determined based on the user usage record set, and according to The classification threshold is conditionally filtered to filter out the user identification; according to the preset interest label corresponding to the application type of the user usage data set where the screened user identification is located, the interest label corresponding to the screened user identification is determined.
- the following steps are implemented: obtain the number of users corresponding to each application identifier and the total number of users corresponding to the user usage record set; obtain the usage weight corresponding to the user identifier and the application identifier ; According to the proportion of the total number of users to the number of users and the usage weight, calculate the preference value corresponding to each application identifier corresponding to the user identifier.
- the following steps are implemented: based on the preference value corresponding to the user identifier corresponding to the application identifier under the same application type, the preference values corresponding to the same application type are performed in ascending order Sort to obtain the sorting result of the preference value; According to the sorting result of the preference value, calculate the quantile of each preference value corresponding to the same application type; Determine the classification threshold of each application type according to the quantile.
- the following steps are implemented: according to the ranking result of the preference value, determine the occurrence probability of each preference value under each application type in the corresponding ranking result; determine each application according to the occurrence probability
- the cumulative probability of each preference value under the program type is the quantile of each preference value under each application type.
- the following steps are implemented: obtaining the ranking position of each preference value under each application type in the ranking result and the ranking user corresponding to the application type to which each application identifier belongs Divide the ranking position of each preference value under each application type in the ranking result by the number of ranking users to obtain the quantile of each preference value corresponding to each application type.
- the following steps are implemented: according to the quantile, corresponding to each application type, the quantiles that are greater than or equal to the corresponding preset threshold are filtered out; corresponding to each application Program type, calculate the difference between adjacent quantiles according to the filtered quantile; get the quantile corresponding to each largest difference calculated for each application type, and get the classification of each application type Threshold.
- the following steps are implemented: obtain a sample set of user usage records with known interest tags; adjust the classification threshold according to the sample set of user usage records; The adjusted classification threshold is subjected to conditional filtering to filter out user identification.
- a sample user usage data set of each sample application type is determined according to known interest tags, and the sample usage data set includes corresponding samples User identification, sample application identification, interest label, and sample preference value; based on the sample user usage data set of the known label of each sample application type, calculate the quantile of each sample preference value of each sample application type; Sample user usage data sets with known labels are filtered according to the classification threshold to filter out the sample user ID; according to the preset interest label corresponding to the sample application type of the user usage data set where the selected sample user ID is located, determine The predicted interest label corresponding to the selected user identification; the recall rate of each type of sample application type calculated according to the predicted interest label of the sample user data set and the known corresponding interest label, and the classification threshold is adjusted.
- the preference value of each application identifier corresponding to the user identifier is determined based on the user usage record set of the application acquired within a specified time period, which better characterizes the user's preference for using each application. Furthermore, by analyzing the overall distribution of the preference values corresponding to the user IDs under the same application type, the classification threshold of each application type is determined, and the overall preference value under the same application type is fully considered. The distribution situation provides a more accurate screening basis for subsequent screening of user identification. Furthermore, the user usage data set of each application type is filtered according to the corresponding classification threshold, so as to filter out qualified user IDs, which improves the accuracy of generating interest tags for each behavior type.
- Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
- Volatile memory may include random access memory (RAM) or external cache memory.
- RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
- SRAM static RAM
- DRAM dynamic RAM
- SDRAM synchronous DRAM
- DDRSDRAM double data rate SDRAM
- ESDRAM enhanced SDRAM
- SLDRAM synchronous chain Channel
- memory bus Radbus direct RAM
- RDRAM direct memory bus dynamic RAM
- RDRAM memory bus dynamic RAM
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (20)
- 一种生成兴趣标签的方法,其中,所述方法包括:获取在指定时间段内应用程序的用户使用记录集,计算每个应用程序标识对应于用户标识所对应的偏好值;所述用户使用记录集中的用户使用记录包括用户标识和应用程序标识;基于应用程序标识确定应用程序类型,根据相同应用程序类型下的应用程序标识对应于用户标识所对应的所述偏好值,分别确定各应用程序类型的分类阈值;所述应用程序类型存在对应的预设兴趣标签;根据基于所述用户使用记录集确定的各应用程序类型的用户使用数据集,并按照所述分类阈值进行条件筛选,以筛选出用户标识;依照筛选出的用户标识所在用户使用数据集的应用程序类型所对应的预设兴趣标签,确定筛选出的用户标识所对应的兴趣标签。
- 根据权利要求1所述的方法,其中,所述用户使用记录集中的用户使用记录还包括使用权重;所述根据在指定时间段内应用程序的用户使用记录集,计算每个应用程序标识对应于用户标识所对应的偏好值包括:获取每个应用程序标识对应的用户数以及所述用户使用记录集对应的总用户数;获取与所述用户标识和所述应用程序标识对应的使用权重;根据所述总用户数与所述用户数的比重以及所述使用权重,计算每个应用程序标识对应于用户标识所对应的偏好值。
- 根据权利要求1所述的方法,其中,所述根据相同应用程序类型下的应用程序标识对应于用户标识所对应的所述偏好值,分别确定各应用程序类型的分类阈值包括:基于所述相同应用程序类型下的应用程序标识对应于用户标识所对应的偏好值,将相同应用程序类型各自对应的偏好值按升序进行排序,得到偏好值的排序结果;根据所述偏好值的排序结果,计算相同应用程序类型下各自对应的每个偏好值的分位数;依据所述分位数确定各应用程序类型的分类阈值。
- 根据权利要求3所述的方法,其中,所述根据所述偏好值的排序结果,计算各应用程序类型下各自对应的每个偏好值的分位数包括:根据所述偏好值的排序结果,确定各应用程序类型下的每个偏好值在相应排序结果中的出现概率;根据所述出现概率确定各应用程序类型下的每个偏好值的累积概率,得到各应用程序类型下的每个偏好值的分位数;或,获取各应用程序类型下的每个偏好值在所处排序结果中的排序位以及各应用程序标识所属应用程序类型对应的排序用户数;将各应用程序类型下的每个偏好值在所处排序结果中的排序位除以所述排序用户数,获得各应用程序类型各自对应的每个偏好值的分位数。
- 根据权利要求3所述的方法,其中,所述依据所述分位数确定各应用程序类型的分类阈值包括:依据所述分位数,对应于每个应用程序类型,分别筛选出大于或等于相应预设阈值的分位数;对应于每个应用程序类型,根据筛选出的分位数计算相邻的分位数的差值;获取对应各应用程序类型计算出的每个最大的差值所对应的分位数,得到各应用程序类型的分类阈值。
- 根据权利要求1所述的方法,其中,所述根据基于所述用户使用记录集确定的各应用程序类型的用户使用数据集,并按照所述分类阈值进行条件筛选,以筛选出用户标识包括:获取已知兴趣标签的用户使用记录样本集;根据所述用户使用记录样本集,对所述分类阈值进行调整;根据所述用户使用数据集,并按照所述调整后的分类阈值进行条件筛选,以筛选出用户标识。
- 根据权利要求6所述的方法,其中,所述用户使用记录样本集中用户使用记录样本包括样本用户标识、兴趣标签、样本应用程序类型、样本应用程序标识和样本使用权重;所述根据所述用户使用记录样本集,对所述分类阈值进行调整包括:根据所述用户使用记录样本集,按所述已知兴趣标签确定各样本应用程序类型的样本用户使用数据集,所述样本使用数据集包括对应的样本用户标识、样本应用程序标识、兴趣标签和样本偏好值;基于各样本应用程序类型的已知标签的样本用户使用数据集,计算各样本应用程序类型的每个样本偏好值的分位数;根据所述已知标签的样本用户使用数据集,按照所述分类阈值进行条件筛选,以筛选出样本用户标识;依照筛选出的样本用户标识所在样本用户使用数据集的样本应用程序类型所对应的预设兴趣标签,确定筛选出的用户标识所对应的预测兴趣标签;根据所述样本用户数据集的预测兴趣标签和已知的相应兴趣标签计算出的每类样本应用程序类型的查全率,调整所述分类阈值。
- 一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,其中,所述处理器执行所述计算机程序时实现如下步骤:获取在指定时间段内应用程序的用户使用记录集,计算每个应用程序标识对应于用户标识所对应的偏好值;所述用户使用记录集中的用户使用记录包括用户标识和应用程序标识;基于应用程序标识确定应用程序类型,根据相同应用程序类型下的应用程序标识对应于用户标识所对应的所述偏好值,分别确定各应用程序类型的分类阈值;所述应用程序类型存在对应的预设兴趣标签;根据基于所述用户使用记录集确定的各应用程序类型的用户使用数据集,并按照所述分类阈值进行条件筛选,以筛选出用户标识;依照筛选出的用户标识所在用户使用数据集的应用程序类型所对应的预设兴趣标签,确定筛选出的用户标识所对应的兴趣标签。
- 根据权利要求8所述的计算机设备,其中,所述用户使用记录集中的用户使用记录还包括使用权重;所述处理器执行所述计算机程序实现所述根据在指定时间段内应用程序的用户使用记录集,计算每个应用程序标识对应于用户标识所对应的偏好值,包括:获取每个应用程序标识对应的用户数以及所述用户使用记录集对应的总用户数;获取与所述用户标识和所述应用程序标识对应的使用权重;根据所述总用户数与所述用户数的比重以及所述使用权重,计算每个应用程序标识对应于用户标识所对应的偏好值。
- 根据权利要求8所述的计算机设备,其中,所述处理器执行所述计算机程序实现所述根据相同应用程序类型下的应用程序标识对应于用户标识所对应的所述偏好值,分别确定各应用程序类型的分类阈值,包括:基于所述相同应用程序类型下的应用程序标识对应于用户标识所对应的偏好值,将相同应用程序类型各自对应的偏好值按升序进行排序,得到偏好值的排序结果;根据所述偏好值的排序结果,计算相同应用程序类型下各自对应的每个偏好值的分位数;依据所述分位数确定各应用程序类型的分类阈值。
- 根据权利要求10所述的计算机设备,其中,所述处理器执行所述计算机程序实现所述根据所述偏好值的排序结果,计算各应用程序类型下各自对应的每个偏好值的分位数,包括:根据所述偏好值的排序结果,确定各应用程序类型下的每个偏好值在相应排序结果中的出现概率;根据所述出现概率确定各应用程序类型下的每个偏好值的累积概率,得到各应用程序类型下的每个偏好值的分位数;或,获取各应用程序类型下的每个偏好值在所处排序结果中的排序位以及各应用程序标识所属应用程序类型对应的排序用户数;将各应用程序类型下的每个偏好值在所处排序结果中的排序位除以所述排序用户数,获得各应用程序类型各自对应的每个偏好值的分位数。
- 根据权利要求10所述的计算机设备,其中,所述处理器执行所述计算机程序实现所述依据所述分位数确定各应用程序类型的分类阈值,包括:依据所述分位数,对应于每个应用程序类型,分别筛选出大于或等于相应预设阈值的分位数;对应于每个应用程序类型,根据筛选出的分位数计算相邻的分位数的差值;获取对应各应用程序类型计算出的每个最大的差值所对应的分位数,得到各应用程序类型的分类阈值。
- 根据权利要求8所述的计算机设备,其中,所述处理器执行所述计算机程序实现所述根据基于所述用户使用记录集确定的各应用程序类型的用户使用数据集,并按照所述分类阈值进行条件筛选,以筛选出用户标识,包括:获取已知兴趣标签的用户使用记录样本集;根据所述用户使用记录样本集,对所述分类阈值进行调整;根据所述用户使用数据集,并按照所述调整后的分类阈值进行条件筛选,以筛选出用户标识。
- 根据权利要求13所述的计算机设备,其中,所述用户使用记录样本集中用户使用记录样本包括样本用户标识、兴趣标签、样本应用程序类型、样本应用程序标识和样本使用权 重;所述处理器执行所述计算机程序实现所述根据所述用户使用记录样本集,对所述分类阈值进行调整,包括:根据所述用户使用记录样本集,按所述已知兴趣标签确定各样本应用程序类型的样本用户使用数据集,所述样本使用数据集包括对应的样本用户标识、样本应用程序标识、兴趣标签和样本偏好值;基于各样本应用程序类型的已知标签的样本用户使用数据集,计算各样本应用程序类型的每个样本偏好值的分位数;根据所述已知标签的样本用户使用数据集,按照所述分类阈值进行条件筛选,以筛选出样本用户标识;依照筛选出的样本用户标识所在样本用户使用数据集的样本应用程序类型所对应的预设兴趣标签,确定筛选出的用户标识所对应的预测兴趣标签;根据所述样本用户数据集的预测兴趣标签和已知的相应兴趣标签计算出的每类样本应用程序类型的查全率,调整所述分类阈值。
- 一种计算机可读存储介质,其上存储有计算机程序,其中,所述计算机程序被处理器执行时实现如下步骤:获取在指定时间段内应用程序的用户使用记录集,计算每个应用程序标识对应于用户标识所对应的偏好值;所述用户使用记录集中的用户使用记录包括用户标识和应用程序标识;基于应用程序标识确定应用程序类型,根据相同应用程序类型下的应用程序标识对应于用户标识所对应的所述偏好值,分别确定各应用程序类型的分类阈值;所述应用程序类型存在对应的预设兴趣标签;根据基于所述用户使用记录集确定的各应用程序类型的用户使用数据集,并按照所述分类阈值进行条件筛选,以筛选出用户标识;依照筛选出的用户标识所在用户使用数据集的应用程序类型所对应的预设兴趣标签,确定筛选出的用户标识所对应的兴趣标签。
- 根据权利要求15所述的计算机可读存储介质,其中,所述用户使用记录集中的用户使用记录还包括使用权重;所述计算机程序被处理器执行实现所述根据在指定时间段内应用程序的用户使用记录集,计算每个应用程序标识对应于用户标识所对应的偏好值,包括:获取每个应用程序标识对应的用户数以及所述用户使用记录集对应的总用户数;获取与所述用户标识和所述应用程序标识对应的使用权重;根据所述总用户数与所述用户数的比重以及所述使用权重,计算每个应用程序标识对应于用户标识所对应的偏好值。
- 根据权利要求15所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时实现所述根据相同应用程序类型下的应用程序标识对应于用户标识所对应的所述偏好值,分别确定各应用程序类型的分类阈值,包括:基于所述相同应用程序类型下的应用程序标识对应于用户标识所对应的偏好值,将相同应用程序类型各自对应的偏好值按升序进行排序,得到偏好值的排序结果;根据所述偏好值的排序结果,计算相同应用程序类型下各自对应的每个偏好值的分位数;依据所述分位数确定各应用程序类型的分类阈值。
- 根据权利要求17所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时实现所述根据所述偏好值的排序结果,计算各应用程序类型下各自对应的每个偏好值的分位数,包括:根据所述偏好值的排序结果,确定各应用程序类型下的每个偏好值在相应排序结果中的出现概率;根据所述出现概率确定各应用程序类型下的每个偏好值的累积概率,得到各应用程序类型下的每个偏好值的分位数;或,获取各应用程序类型下的每个偏好值在所处排序结果中的排序位以及各应用程序标识所属应用程序类型对应的排序用户数;将各应用程序类型下的每个偏好值在所处排序结果中的排序位除以所述排序用户数,获得各应用程序类型各自对应的每个偏好值的分位数。
- 根据权利要求17所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时实现所述依据所述分位数确定各应用程序类型的分类阈值,包括:依据所述分位数,对应于每个应用程序类型,分别筛选出大于或等于相应预设阈值的分位数;对应于每个应用程序类型,根据筛选出的分位数计算相邻的分位数的差值;获取对应各应用程序类型计算出的每个最大的差值所对应的分位数,得到各应用程序类型的分类阈值。
- 根据权利要求15所述的计算机可读存储介质,其中,所述计算机程序被处理器执行时实现所述根据基于所述用户使用记录集确定的各应用程序类型的用户使用数据集,并按照所述分类阈值进行条件筛选,以筛选出用户标识,包括:获取已知兴趣标签的用户使用记录样本集;根据所述用户使用记录样本集,对所述分类阈值进行调整;根据所述用户使用数据集,并按照所述调整后的分类阈值进行条件筛选,以筛选出用户标识。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910525807.XA CN110377821A (zh) | 2019-06-18 | 2019-06-18 | 生成兴趣标签的方法、装置、计算机设备和存储介质 |
CN201910525807.X | 2019-06-18 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020253369A1 true WO2020253369A1 (zh) | 2020-12-24 |
Family
ID=68249072
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/086369 WO2020253369A1 (zh) | 2019-06-18 | 2020-04-23 | 生成兴趣标签的方法、装置、计算机设备和存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN110377821A (zh) |
WO (1) | WO2020253369A1 (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110377821A (zh) * | 2019-06-18 | 2019-10-25 | 深圳壹账通智能科技有限公司 | 生成兴趣标签的方法、装置、计算机设备和存储介质 |
CN111079023B (zh) * | 2019-12-30 | 2023-06-16 | Oppo广东移动通信有限公司 | 目标帐户的识别方法、装置、终端及存储介质 |
BR112022015834A2 (pt) | 2020-02-11 | 2022-09-27 | Citrix Systems Inc | Sistemas e métodos para acesso agilizado a aplicativos |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104700289A (zh) * | 2015-03-17 | 2015-06-10 | 中国联合网络通信集团有限公司 | 广告投放方法和装置 |
CN106503269A (zh) * | 2016-12-08 | 2017-03-15 | 广州优视网络科技有限公司 | 应用推荐的方法、装置及服务器 |
US20180189410A1 (en) * | 2015-06-18 | 2018-07-05 | International Business Machines Corporation | Identification of Target Audience for Content Delivery in Social Networks by Quantifying Semantic Relations and Crowdsourcing |
US20180349944A1 (en) * | 2017-05-31 | 2018-12-06 | Facebook, Inc. | Evaluating content publisher options against benchmark publisher |
US20190034535A1 (en) * | 2017-07-25 | 2019-01-31 | Yandex Europe Ag | Method and system for generating a user-personalization interest parameter for identifying personalized targeted content item |
CN110377821A (zh) * | 2019-06-18 | 2019-10-25 | 深圳壹账通智能科技有限公司 | 生成兴趣标签的方法、装置、计算机设备和存储介质 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8479103B2 (en) * | 2009-09-15 | 2013-07-02 | International Business Machines Corporation | Visualization of real-time social data informatics |
CN107908686B (zh) * | 2017-10-31 | 2020-01-14 | Oppo广东移动通信有限公司 | 信息推送方法、装置、服务器以及可读存储介质 |
-
2019
- 2019-06-18 CN CN201910525807.XA patent/CN110377821A/zh active Pending
-
2020
- 2020-04-23 WO PCT/CN2020/086369 patent/WO2020253369A1/zh active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104700289A (zh) * | 2015-03-17 | 2015-06-10 | 中国联合网络通信集团有限公司 | 广告投放方法和装置 |
US20180189410A1 (en) * | 2015-06-18 | 2018-07-05 | International Business Machines Corporation | Identification of Target Audience for Content Delivery in Social Networks by Quantifying Semantic Relations and Crowdsourcing |
CN106503269A (zh) * | 2016-12-08 | 2017-03-15 | 广州优视网络科技有限公司 | 应用推荐的方法、装置及服务器 |
US20180349944A1 (en) * | 2017-05-31 | 2018-12-06 | Facebook, Inc. | Evaluating content publisher options against benchmark publisher |
US20190034535A1 (en) * | 2017-07-25 | 2019-01-31 | Yandex Europe Ag | Method and system for generating a user-personalization interest parameter for identifying personalized targeted content item |
CN110377821A (zh) * | 2019-06-18 | 2019-10-25 | 深圳壹账通智能科技有限公司 | 生成兴趣标签的方法、装置、计算机设备和存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN110377821A (zh) | 2019-10-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020253369A1 (zh) | 生成兴趣标签的方法、装置、计算机设备和存储介质 | |
US9092549B2 (en) | Recommendation of search keywords based on indication of user intention | |
WO2021012790A1 (zh) | 页面数据生成方法、装置、计算机设备及存储介质 | |
US10152479B1 (en) | Selecting representative media items based on match information | |
WO2018157818A1 (zh) | 用户偏好的推测方法、装置、终端设备及存储介质 | |
WO2020143156A1 (zh) | 热点视频标注处理方法、装置、计算机设备及存储介质 | |
WO2022105129A1 (zh) | 内容数据推荐方法、装置、计算机设备及存储介质 | |
CN110457577B (zh) | 数据处理方法、装置、设备和计算机存储介质 | |
CN110555164B (zh) | 群体兴趣标签的生成方法、装置、计算机设备和存储介质 | |
CN110674144A (zh) | 用户画像生成方法、装置、计算机设备和存储介质 | |
WO2021179631A1 (zh) | 卷积神经网络模型压缩方法、装置、设备及存储介质 | |
CN111400126B (zh) | 网络服务异常数据检测方法、装置、设备和介质 | |
WO2023029356A1 (zh) | 基于句向量模型的句向量生成方法、装置及计算机设备 | |
CN112784168B (zh) | 信息推送模型训练方法以及装置、信息推送方法以及装置 | |
CN112104505B (zh) | 应用推荐方法、装置、服务器和计算机可读存储介质 | |
CN111209929A (zh) | 访问数据处理方法、装置、计算机设备及存储介质 | |
CN110598090B (zh) | 兴趣标签的生成方法、装置、计算机设备和存储介质 | |
CN113010795A (zh) | 用户动态画像生成方法、系统、存储介质及电子设备 | |
CN109697155B (zh) | It系统性能评估方法、装置、设备及可读存储介质 | |
CN107392220B (zh) | 数据流的聚类方法和装置 | |
CN110162535B (zh) | 用于执行个性化的搜索方法、装置、设备以及存储介质 | |
WO2019019387A1 (zh) | 信息推送建议生成方法、装置、计算机设备和存储介质 | |
WO2022213662A1 (zh) | 应用推荐方法、系统、终端以及存储介质 | |
CN112767027B (zh) | 一种基于业务感知的云成本预测方法和系统 | |
CN111353052B (zh) | 一种多媒体对象推荐方法、装置、电子设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20827747 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20827747 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 29/03/2022) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20827747 Country of ref document: EP Kind code of ref document: A1 |