WO2015158308A1 - 判断用户年龄段的方法及装置 - Google Patents

判断用户年龄段的方法及装置 Download PDF

Info

Publication number
WO2015158308A1
WO2015158308A1 PCT/CN2015/076905 CN2015076905W WO2015158308A1 WO 2015158308 A1 WO2015158308 A1 WO 2015158308A1 CN 2015076905 W CN2015076905 W CN 2015076905W WO 2015158308 A1 WO2015158308 A1 WO 2015158308A1
Authority
WO
WIPO (PCT)
Prior art keywords
age
data
user
users
consumption data
Prior art date
Application number
PCT/CN2015/076905
Other languages
English (en)
French (fr)
Inventor
李青锋
牟川
Original Assignee
北京京东尚科信息技术有限公司
北京京东世纪贸易有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京京东尚科信息技术有限公司, 北京京东世纪贸易有限公司 filed Critical 北京京东尚科信息技术有限公司
Priority to US15/303,947 priority Critical patent/US20170032398A1/en
Priority to AU2015246423A priority patent/AU2015246423A1/en
Publication of WO2015158308A1 publication Critical patent/WO2015158308A1/zh
Priority to AU2018203129A priority patent/AU2018203129A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0204Market segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0721Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment within a central processing unit [CPU]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • G06F11/0754Error or fault detection not based on redundancy by exceeding limits
    • G06F11/076Error or fault detection not based on redundancy by exceeding limits by exceeding a count or rate limit, e.g. word- or bit count limit
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/067Enterprise or organisation modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0254Targeted advertisements based on statistics

Definitions

  • the present invention relates to the field of Internet information analysis, and in particular to a method and apparatus for determining a user age range.
  • age is not required. Even if a small number of people fill in this information, some people are perfunctory and inaccurate, resulting in such important data missing in the database. serious.
  • age is an important information of a user is because there are very big differences in the living habits, attitudes and values of users of different ages. For e-commerce, shopping habits are very different. Therefore, by grasping the age of the user, it is possible to conduct targeted marketing for the user, thereby increasing user stickiness.
  • Tencent's relevant technicians estimate the user's age based on massive data.
  • the method includes: acquiring basic age data of a user, and assigning an initial weight value to the basic age data; obtaining an age of the user in different basic age data according to the initial weight value and age similarity of the user in different basic age data Weight; find basic age data as described in The age with the largest age weight, the age with the highest age weight is taken as the initial estimated age of the user.
  • Other prior art related to the present invention mainly include: Naive Bayesian algorithm technology, massive data processing technology, and python programming technology.
  • the existing solution is to segment the age of the user, that is to say the final age of all users.
  • the disadvantage of this is that the granularity is coarse and the age of the user cannot be expressed finely.
  • the object of the present invention is to more accurately determine the age range of a user by analyzing the user's consumption data, thereby achieving targeted marketing according to age group characteristics.
  • a method for determining a user age range based on user's consumption data comprising: acquiring a plurality of consumption data of a plurality of users; and based on the acquired plurality of consumption data Modeling to create a model that satisfies a particular condition, the modeling further comprising: dividing the consumption data into training data and test data; calculating a number of users of the training data in a plurality of predefined age segments, calculating Calculating the number of each of the three-level categories of the plurality of predetermined age groups, and calculating, based on the number of users and the number of the third-level categories, each tuple of the test data belongs to the a probability of each of a plurality of predefined age ranges; selecting an age segment to which the maximum probability of the probabilities belongs as an age segment to which the user of the corresponding tuple belongs; comparing the plurality of predefined age segments with the selected The error between the age groups, the prediction error rate is obtained
  • dividing the consumption data into training data and test data further comprises: segmenting the consumption data according to the plurality of predetermined age groups; and removing the amount of the third-level category in the consumption data is less than a predetermined number The amount of consumption data.
  • the ratio of the number of the training data and the test data is 7:3.
  • the predetermined threshold is 0.7.
  • the method further comprises selectively providing the user with an advertisement, suggestion, report, notification, message, media, or any combination thereof based on the selected age group.
  • an apparatus for determining a user age range based on user's consumption data comprising: an input module for acquiring a plurality of consumptions of a plurality of users Data; a modeling module for modeling based on the acquired plurality of consumption data to create a model that satisfies a specific condition, the modeling module further comprising: a calculation module configured to Dividing the consumption data into training data and test data; calculating a number of users of the training data in a plurality of predefined age groups; calculating the training data at each of the plurality of predetermined age groups a quantity of the category; and calculating, based on the number of users and the number of the third-level categories, a probability that each tuple of the test data belongs to each of the plurality of predefined age ranges;
  • the selection module is configured to select an age segment to which the maximum probability of the probabilities belongs as an age segment to which the user of the corresponding tuple belongs; a comparison module, the comparison module
  • the modeling module is further configured to: segment the consumption data in accordance with the plurality of predetermined age segments; and remove the amount of third-level categories in the consumption data that is less than a predetermined number of consumption data.
  • the ratio of the number of the training data and the test data is 7:3.
  • the predetermined threshold is 0.7.
  • the apparatus further comprises a presentation module for selectively providing advertisements, suggestions, reports, notifications, messages, media, or any combination thereof to the user based on the selected age group.
  • the age range of the user can be accurately and automatically determined.
  • FIG. 1 illustrates a view of an apparatus 100 for determining a user's age range, in accordance with an embodiment of the present invention.
  • FIG. 2 illustrates a schematic diagram of a scheme 200 for determining a user's age range in accordance with the present invention.
  • FIG. 3 illustrates a flow diagram of a method 300 for estimating a user age segment based on user's consumption data, in accordance with an embodiment of the present invention.
  • a method and apparatus for determining a user's age range is disclosed in accordance with an embodiment of the present invention.
  • numerous specific details are set forth However, it is apparent to those skilled in the art that the embodiments of the invention may be practiced without these specific details.
  • the applications and services provided to the user often depend on the age of the user as an important factor in providing an effective service.
  • users of different ages may be interested in different services.
  • ads, content, apps, and the like are typically designed for a specific age audience. For example, college students usually belong to the group of household consumption, while adults usually belong to the group of household consumption. Therefore, knowing the age range of users can help to provide users with Service.
  • relevant advertisements, content, and applications can be pushed to the user in relation to age, so that the user device does not bear the burden of a large amount of other information irrelevant to the age range of the user.
  • some services require users to be at a certain age, and product information about children of different ages needs to be targeted at consumers with children of the appropriate age group.
  • the age range of the user can be determined by considering multiple aspects of the user. For example, the user's consumption data during a particular time period may reflect the user's age range. For example, a family with a child has a different spending habit than a single person or family without a child, and a family with children of different ages also has a difference. Therefore, analyzing the user's consumption data can estimate the user's age range.
  • analysis can be performed for consumption data of a user in a specific time period such as the most recent year.
  • the specific time period is selected as the consumption data of the most recent year because the user's age will also increase with the passage of time, and the consumption characteristics of the most recent year reflect the behavioral habits of the current age, along with the user. As the age grows, the user's spending habits will change accordingly, so the annual behavior can be a true reflection of consumer behavior and characteristics during that age.
  • time units can be used to more accurately reflect trends or changes in consumption characteristics of a particular age group, for example, three months and six months.
  • the e-commerce can set a plurality of predefined age groups in the system, each age group including a specific age range.
  • the age range can also be customized by the user. For example, you can divide the age range into the following five segments:
  • Paragraph 1 15-18 years old: groups without spending power
  • FIG. 1 illustrates a device for determining a user's age range according to an embodiment of the present invention. 100 view.
  • the apparatus 100 includes an input module 101, a modeling module 103, an application module 105, a presentation module 107, and a controller 109.
  • the functions of these modules can be combined in one or more components or performed by other components having equivalent functions.
  • the input module 101 is used to input consumption data of the user during a particular time period.
  • the modeling module 103 is used to model the consumption data to create a model that meets certain conditions.
  • the application module 105 is for estimating the age range of the user based on the model created at the modeling module 103.
  • the presentation module 107 is for selectively providing advertisements, suggestions, reports, notifications, messages, media, or any combination thereof to the user based on the estimated age group.
  • the controller 109 is used to monitor tasks, including tasks performed by the input module 101, the modeling module 103, the application module 105, and the rendering module 107.
  • the modeling module 103 further includes a calculation module 111, a selection module 113, and a comparison module 115.
  • the calculation module 111 may generate training data and test data based on the input data, calculate a number of users of the training data in a plurality of predefined age groups, and calculate the training data for each of the plurality of predetermined age groups The number of grades. Then, the calculation module 111 calculates a probability that each tuple of the test data belongs to each of the plurality of predefined age segments based on the number of users and the number of the third-level categories.
  • the selection module 113 selects an age segment to which the maximum probability of the probabilities belongs as the age segment to which the user of the corresponding tuple belongs.
  • the comparison module 115 is configured to compare the error between the known age segment and the selected age segment in the test data to obtain a prediction accuracy rate.
  • the modeling module 103 outputs a model whose prediction accuracy rate is greater than or equal to a predetermined threshold, preferably outputs a model with a prediction accuracy rate greater than or equal to 0.7.
  • the application module 105 calculates the age range of the user using the model output by the modeling module 103, and outputs the calculation result to the presentation module 107.
  • modeling module 103 determines that the user age segment is related to the naive Bayesian algorithm.
  • the naive Bayesian algorithm is a probabilistic classification algorithm based on very simple points. The class idea, for the given item to be classified, solves the probability of occurrence of each category under the condition of occurrence of this item, and the maximum considers which category the item to be classified belongs to. For example, if the probability of occurrence of a particular user at multiple ages set by the e-commerce is determined, the age segment in which the greatest probability occurs is the age segment to which the particular user belongs.
  • Xk has two cases of classification attribute and continuous attribute. In this model, it is a classification attribute. If the classification attribute, then
  • Ci) (the value of the attribute Ak is the number of tuples of xk in Ci) / (the number of tuples of Ci in D).
  • Classification usually involves two steps: the establishment of the model and the application of the model.
  • model a data set that has been identified for a category.
  • the data set used to build the model is called the training set, and the single tuple in the training set is called the training sample.
  • Each tuple in the training set belongs to a certain category, and the categories are represented by class labels.
  • Learning models are provided in the form of classification rules or mathematical formulas. In practice, the number of samples that are known to be classified According to the training set, after learning the training set, the rules about classification are obtained, so that the new data is classified.
  • test set refers to a set of independent samples that have not been used in designing the identification and classification system to evaluate the ability of the model to predict, etc., in order to validate the model.
  • Figure 2 illustrates a schematic diagram of a scheme 200 for determining a user's age range in accordance with the present invention.
  • the scheme for judging the user age segment mainly includes two parts, a model creation and a model application, wherein the model creation includes: the modeling data is split into training data and test data (the ratio is 7:3), and the training data is passed through Nao Subei.
  • the Bayesian algorithm generates a Bayesian model, and the test data evaluates the quality of the model through the Bayesian model. By continuously adjusting the features and classification labels, a better model is finally obtained.
  • the model application includes, for example, predicting all user data that conforms to the characteristics of the model through the model and finally obtaining a large amount of user age data.
  • the final data characteristics the user's third-level category consumption data in the most recent year, the specific modeling data can be shown in Table 1.
  • the method and steps for entering a data set are:
  • the input data format is shown in Table 2:
  • Input data format is shown in Table 4.
  • the output data format is shown in Table 5.
  • the data is divided into training data and test data in a ratio of 7:3. Model the training data and evaluate the model with the test data.
  • the user age range is estimated according to the following steps.
  • a tuple in the test data to belong to the category with the highest probability of each category as the category to which the user of the tuple belongs. Specifically, the age group corresponding to the maximum probability that each person in D_Test belongs to each age group is selected as the age group to which the user belongs.
  • the evaluation model can be performed, for example, according to the following criteria: (1) prediction accuracy; (2) model creation speed and speed of use; (3) robustness; (4) adaptation of the model to data with noise or vacancy values Ability; (5) scalability; (6) when data increases The adaptability of the model; (7) interpretability, that is, the degree of comprehensibility of the model.
  • the prediction accuracy is above 70%; the algorithm is very efficient, and the prediction of 30 million users can be completed within 5 minutes.
  • the e-commerce can selectively provide advertisements, suggestions, reports, notifications, messages, media, or any combination thereof to the user.
  • FIG. 3 illustrates a flow diagram of a method 300 for estimating a user age segment based on user's consumption data, in accordance with an embodiment of the present invention.
  • method 300 begins at step 301.
  • the input module 101 acquires a plurality of consumption data of a plurality of users.
  • the calculation module 111 generates training data and test data.
  • the calculation module 111 calculates the number of users of the training data in a plurality of predefined age segments.
  • the calculation module 105 calculates the number of each of the three-level categories of the training data in the plurality of predetermined age groups, and then, in step 311, the calculation module 105 is based on the number of users and the third-level category.
  • the selection module 113 selects the age segment to which the greatest probability among the probabilities belongs as the age segment to which the user of the corresponding tuple belongs.
  • the comparison module 115 compares the error between the known age segment and the selected age segment in the test data to obtain a prediction accuracy rate and outputs a model with a prediction accuracy rate greater than a particular threshold.
  • the application module 105 calculates the age range of the user using the model output by the modeling module 103 and outputs the calculation result to the presentation module 107.
  • the presentation module 107 selectively presents content, such as advertisements, suggestions, reports, notifications, messages, media, or any combination thereof, to the user based on the selected age group.
  • the method 300 ends at step 321.
  • the e-commerce can determine the age range of the registered user in a more accurate and simple manner, for example, the prediction accuracy rate is 70%. Therefore, e-commerce such as Jingdong Mall can customize the service, content, communication (for example, marketing, advertising) and the like to be more effectively associated with the user according to the age of the user, thereby enabling Strong support for marketing. At the same time, for users accessing these e-commerce websites, the user experience is significantly improved and convenient and personalized services are provided.

Abstract

一种判断用户年龄段的方法及装置。该方法包括:获取多个用户的多个消费数据(303);基于所获取的多个消费数据来建模以创建满足特定条件的模型,建模进一步包括:将消费数据划分成训练数据和测试数据(305);计算训练数据在多个预定义的年龄段中的用户数量(307),计算训练数据在多个预定的年龄段的每个三级品类的数量(309),并且基于用户数量和三级品类的数量,计算测试数据的每个元组属于多个预定义的年龄段中的每个的概率(311);选择概率中的最大概率所属的年龄段作为对应元组的用户所属的年龄段(313);比较多个预定义的年龄段与所选择的年龄段之间的误差,得到预测误差率,并输出预测误差率大于或等于预定阈值的模型(315);以及利用所输出的模型来计算用户的年龄段(317)。

Description

判断用户年龄段的方法及装置 技术领域
本发明涉及互联网信息分析领域,并且具体地涉及判断用户年龄段的方法及装置。
背景技术
近些年来互联网快速发展,给人们带来了极大的便利和好处,人们可以通过互联网来进行娱乐、购物、交友等活动。网站也通过用户的注册信息向用户提供更加舒适,针对性强的服务,但是由于网络的虚拟性,许多用户不愿过多透露个人信息。
为了提高用户的注册时间效率,年龄并不是必填项,即使很少的一部分人填了此项信息,仍然有一部分人是敷衍了事,并不准确,从而造成如此重要的数据在数据库中缺失严重。年龄之所以是一个用户的重要信息是因为不同年龄用户的生活习惯、人生态度、价值观有非常大的区别,对于电商来说就是购物习惯差别很大。因此掌握用户的年龄,就可以对用户进行定向营销,从而提高用户粘性。
由于珍贵的用户年龄信息十分有限,而且存在一定误差,所以一部分人通过互联网行业数据和经验将用户的年龄进行过滤,从而得到相对准确的年龄数据,这种方法只能获取一部分用户的年龄,只是庞大用户群的冰山一角。
腾讯公司相关技术人员基于海量数据对用户年龄进行了估算。方法包括:获取用户的基本年龄数据,为所述基本年龄数据赋初始权值;根据所述初始权值以及用户在不同基本年龄数据中的年龄相似度,获取用户在不同基本年龄数据中的年龄权值;查找基本年龄数据中所述 年龄权值最大的年龄,将年龄权值最大的年龄作为用户的初步估算年龄。其他与本发明相关的现有技术主要有:朴素贝叶斯算法技术,海量数据处理技术,python编程技术。
现有的方案是将用户的年龄分段,也就是说最终得到的是所有用户的年龄段。这样的缺点的是粒度较粗,不能精细地表达用户的年龄。
因此,需要一种可以更精确地判断用户年龄的技术方案。
发明内容
本发明的目的是通过分析用户的消费数据来更精确地确定用户的年龄段,从而实现根据年龄段特征的定向营销。
根据本发明的一个实施例,提供了一种用于基于用户的消费数据来判断用户年龄段的方法,所述方法包括:获取多个用户的多个消费数据;基于所获取的多个消费数据来建模以创建满足特定条件的模型,所述建模进一步包括:将所述消费数据划分成训练数据和测试数据;计算所述训练数据在多个预定义的年龄段中的用户数量,计算所述训练数据在所述多个预定的年龄段的每个三级品类的数量,并且基于所述用户数量和所述三级品类的数量,计算所述测试数据的每个元组属于所述多个预定义的年龄段中的每个的概率;选择所述概率中的最大概率所属的年龄段作为对应元组的用户所属的年龄段;比较所述多个预定义的年龄段与所选择的年龄段之间的误差,得到预测误差率,并输出所述预测误差率大于或等于预定阈值的模型;以及利用所输出的模型来计算用户的年龄段。
优选地,将所述消费数据划分成训练数据和测试数据进一步包括:将所述消费数据按照所述多个预定的年龄段进行分段;以及去掉所述消费数据中三级品类的数量小于预定数目的消费数据。
优选地,所述训练数据和所述测试数据的数目比例是7∶3。
优选地,所述预定阈值是0.7。
优选地,所述方法进一步包括:基于所选择的年龄段,向所述用户选择性地提供广告、建议、报告、通知、消息、媒体或其任何组合。
根据本发明的另一个实施例,提供了一种用于基于用户的消费数据来判断用户年龄段的装置,所述装置包括:输入模块,所述输入模块用于获取多个用户的多个消费数据;建模模块,所述建模模块用于基于所获取的多个消费数据来建模以创建满足特定条件的模型,所述建模模块进一步包括:计算模块,所述计算模块被配置成将所述消费数据划分成训练数据和测试数据;计算所述训练数据在多个预定义的年龄段中的用户数量;计算所述训练数据在所述多个预定的年龄段的每个三级品类的数量;以及基于所述用户数量和所述三级品类的数量,计算所述测试数据的每个元组属于所述多个预定义的年龄段中的每个的概率;选择模块,所述选择模块被配置成选择所述概率中的最大概率所属的年龄段作为对应元组的用户所属的年龄段;比较模块,所述比较模块被配置成比较所述多个预定义的年龄段与所选择的年龄段之间的误差,得到预测误差率,并输出所述预测误差率大于或等于预定阈值的模型;以及应用模块,所述应用模块用于利用所输出的模型来计算用户的年龄段。
优选地,所述建模模块进一步被配置成:将所述消费数据按照所述多个预定的年龄段进行分段;以及去掉所述消费数据中三级品类的数量小于预定数目的消费数据。
优选地,所述训练数据和所述测试数据的数目比例是7∶3。
优选地,所述预定阈值是0.7。
优选地,所述装置进一步包括呈现模块,所述呈现模块用于基于所选择的年龄段,向所述用户选择性地提供广告、建议、报告、通知、消息、媒体或其任何组合。
根据本发明的判断用户年龄段的方案,可以精确和自动地确定用户的年龄段。根据本公开和附图的下面的详细描述,对本领域的普通技术人员来说其它的目的、特征、以及优点将是显而易见的。
附图说明
附图图示了本发明的实施例,并与说明书一起用于解释本发明的原理。在附图中:
图1图示了根据本发明的实施例的用于判断用户年龄段的装置100的视图。
图2图示了根据本发明的用于判断用户年龄段的方案200的示意图。
图3图示了根据本发明的实施例的用于基于用户的消费数据来估计用户年龄段的方法300的流程图。
具体实施方式
根据本发明的实施例公开了一种用于确定用户年龄段的方法和装置。在以下描述中,为了说明的目的,阐述了多个具体细节以提供对本发明的实施例的全面理解。然而,对于本领域人员显而易见的是,本发明的实施例可以在没有这些具体细节的情况下实现。
如上所述,用于提供给用户的应用和服务经常取决于用户的年龄,作为提供有效服务的重要因素。也就是说,不同年龄的用户可能对不同的服务感兴趣。例如,广告、内容和应用等通常为特定年龄的受众设计。例如,大学生通常属于本位消费的群体,而成年人通常属于家庭消费的群体。因此,获知用户的年龄范围可以行助于向用户提供定 制服务。而且,可以与年龄相关的向用户推送相关广告、内容和应用,从而使用户设备不承受对于用户的年龄范围无关的大量其他信息的负担。另外,一些服务需要用户处于某个年龄,关于不同年龄的儿童的产品信息需要针对具有相应年龄段儿童的消费者。
通过考虑用户的多个方面可以确定用户的年龄段。例如,用户在特定时间段期间的消费数据可以反映该用户的年龄段。例如,具有孩子的家庭与不具有孩子的单身人士或家庭具有不同的消费习惯,并且具有处于不同年龄段孩子的家庭也具有差异。因此,分析用户的消费数据可以估计用户的年龄段。
例如,可以针对用户在诸如最近一年的特定时间段中的消费数据进行分析。所述特定时间段被选择为最近一年的消费数据是因为随着时间的推移,用户的年龄也是会随着增加,最近一年的消费特征反应的是当前的年龄的行为习惯,随着用户年龄的增长,用户的消费习惯也会相应地变化,因此以年为单位可以真实反映该年龄段期间的消费行为和特性。当然,为了更精确反映特定年龄段消费特性的趋势或变化,也可以使用其他时间单位,例如,三个月、六个月。
例如,根据互联网的用户特征和电商的实际情况,电商可以在系统中设置多个预定义的年龄段,每个年龄段包括特定的年龄范围。替代地,也可以由用户自定义年龄段。例如,可以将年龄段划分成以下5段:
第1段:15-18岁:没有消费能力的群体
第2段:19-25岁:未婚,处于一种本位消费的群体
第3段:26-35岁:小孩上幼儿园的消费群体
第4段:36-45岁:小孩上小学、初中、高中的消费群体
第5段:46-55岁:孩子上大学的消费群体
图1图示了根据本发明的实施例的用于判断用户年龄段的装置 100的视图。在图1中,装置100包括输入模块101、建模模块103、应用模块105、呈现模块107和控制器109。本领域技术人员理解,这些模块的功能可以组合在一个或多个组件中或由具有等同功能的其他组件执行。
在这个实施例中,输入模块101用于输入用户在特定时间段期间的消费数据。建模模块103用于对消费数据进行建模以创建满足特定条件的模型。应用模块105用于基于在建模模块103所创建的模型来估计用户的年龄段。呈现模块107用于基于所估计的年龄段,向用户选择性地提供广告、建议、报告、通知、消息、媒体或其任何组合。控制器109用于监视任务,包括由输入模块101、建模模块103、应用模块105和呈现模块107执行的任务。
建模模块103进一步包括计算模块111、选择模块113和比较模块115。计算模块111可以基于输入数据来生成训练数据和测试数据,计算训练数据在多个预定义的年龄段中的用户数量,并且计算所述训练数据在所述多个预定的年龄段的每个三级品类的数量。然后,计算模块111基于所述用户数量和所述三级品类的数量,计算所述测试数据的每个元组属于所述多个预定义的年龄段中的每个的概率。选择模块113选择所述概率中的最大概率所属的年龄段作为对应元组的用户所属的年龄段。比较模块115用于比较测试数据中的已知年龄段和所选择的年龄段之间的误差,从而得到预测正确率。建模模块103输出预测正确率大于或等于预定阈值的模型,优选地输出预测正确率大于或等于0.7的模型。
应用模块105利用由建模模块103输出的模型来计算用户的年龄段,并将计算结果输出给呈现模块107。
根据本发明的实施例,建模模块103判断用户年龄段涉及朴素贝叶斯算法。朴素贝叶斯算法是一种概率分类算法,它基于很简单的分 类思想,对于给出的待分类项,求解在此项出现的条件下各个类别出现的概率,哪个最大就认为该待分类项属于哪个类别。例如,如果确定特定用户在电商设定的多个年龄段出现的概率,则最大概率出现的年龄段就是该特定用户所属的年龄段。
朴素贝叶斯算法具体解释如下:
(1)设D是训练元组和相关联的类标号的集合。照例,每个元组用一个n维属性向量X={x1,x2...,xn}表示,描述由n个属性A1,A2,...,An对元组的n个测量。
(2)假定有m个类C1,C2,...Cm。给定元组X,分类法将预测X属于具有最高后验概率(条件X下)的类。也就是说,朴素贝叶斯分类法预测X属于类Ci,当且仅当
P(Ci|X)>P(Cj|X)j>=1且j<=m,j!=i。根据贝叶斯定理
P(Ci|X)=P(X|Ci)*P(Ci)/P(X)
(3)由于P(X)对于所有类为常数,只需要P(X|Ci)*P(Ci)最大化即可。
(4)P(Ci)=|Ci,D|/|D|,其中|Ci,D|是D中Ci类的训练元组数,|D|是D中所有元组数。
Figure PCTCN2015076905-appb-000001
Xk有分类属性和连续属性两种情况,在本模型中是分类属性,如果分类属性,则
P(xk|Ci)=(属性Ak的值为xk在Ci中的元组数)/(D中Ci的元组数)。
分类通常包括两个步骤:模型的建立和模型的应用。
首先,对一个类别已经确定的数据集建立模型。用于建立模型的数据集被称为训练集,训练集中的单个元组称为训练样本。训练集中的每一个元组都属于一个确定的类别,类别用类标号表示。学习模型用分类规则或数学公式的形式提供。在实践中,将已知分类的样本数 据作为训练集,经过对训练集的学习得到关于分类的规律,从而对新数据进行分类。
其次,使用创建的模型将类别未知的元组归入到某个或某几个类中。使用模型进行分类需要评估分类模型的预测准确率。评估的方法很多,通常使用创建的模型在一个测试集进行预测,并将结果和实际值进行比较,得出预测准确率,测试集独立于训练集。如在此使用的“测试集”指的是用于评估模型的预测等能力的在设计识别和分类系统时没有用过的独立样本集,以便验证模型。
例如,图2图示了根据本发明的用于判断用户年龄段的方案200的示意图。该用于判断用户年龄段的方案主要包括两部分,模型创建和模型应用,其中模型创建包括:建模数据拆分成训练数据和测试数据(数目比例为7∶3),训练数据通过朴素贝叶斯算法生成贝叶斯模型,测试数据通过贝叶斯模型评估模型的质量,通过对特征和分类标签的不断调整,最终得到比较优良的模型。模型应用包括:例如将所有符合模型特点的用户数据通过模型进行预测最后得到了大量的用户年龄段数据。最终确定的数据特征:用户在最近一年的三级品类消费数据,具体建模数据可以由下表1所示。
用户id 年龄段 三级品类1 三级品类2 三级品类3 三级品类4
表1年龄模型建模数据
具体实现方案
1.数据集的输入
在一个实施例中,输入数据集的方法和步骤是:
1)将同一个用户的消费商品的三级品类转化成一行,以适应算法的输入格式,如下所示:
输入数据格式如表2:
Figure PCTCN2015076905-appb-000002
表2年龄模型建模源数据
输出数据格式如表3:
字段 用户帐号 生日 三级品类1 三级品类2
eg: Fengguoying 1985/9/24 4833 655
表3年龄模型建模数据(将同一个人的三级品类放到一行)
2)将建模数据按照电商设定的多个预定的年龄段进行分段,同时去掉购买商品的三级品类数量小于特定数目(在该实施例中,是4个)的用户购买数据,以便减少估计误差。
输入数据格式如表4
字段 用户帐号 生日 三级品类1 三级品类2
eg: fengguoying 1985/9/24 4833 655
表4年龄模型建模数据(将同一个人的三级品类放到一行)
输出数据格式如表5
字段 年龄段 三级品类1 三级品类2
eg: 3 4833 655
表5年龄模型建模数据(生日转成年龄,同时分段)
2.训练集和测试集
从选取的数据集中,以7∶3的数目比例将数据分成训练数据和测试数据。用训练数据进行建模,并且用测试数据对模型进行评估。
3.年龄段判断
根据本发明的实施例,基于训练数据和测试数据,根据以下步骤来估计用户年龄段。
(1)计算训练数据各个年龄段类别的用户数量。具体地,计算D_Train各个年龄段的用户数量|Ci|。
(2)计算训练数据各个类别的每个三级品类的数量。具体地,计算D_Train各个年龄段每个三级品类的数量|xk/Ci|。
(3)根据前两步得出和数据计算测试数据每个元组属于各个年龄段的概率。具体地,根据前两步的先验概率得出D_Test每个人属于各个年龄段的概率P(X|Ci)=P(x1|Ci)*P(x2|Ci)*…*P(xn|Ci)。
(4)选择测试数据中某元组属于各类概率最大的年龄段类别作为该元组的用户所属的类别。具体地,选择D_Test中每个人属于各个年龄段的最大概率所对应的年龄段作为该用户所属的年龄段。X属于Cj,当且仅当P(X/Cj)=max(P(X/Ci))i=1,2...6。
(5)对比测试数据中的已知年龄段和所选择的年龄段之间的误差。对比D_Test中的每个已知年龄段和所选择的年龄段之间的误差,得到预测正确的用户D_Test_Correct,
得到预测正确率=|D_Test_Correct|/|D_Test|
(6)重复以上步骤,计算所有用户的年龄段。具体地,如果正确率>=0.7,则利用该模型来计算用户的年龄段,否则停止;根据模型计算所有用户D_All的年龄段,方法同第(3)(4)步骤。
另外,评估模型例如可以根据以下标准来进行:(1)预测准确率;(2)模型的创建速度和使用速度;(3)强壮性;(4)模型对具有噪声或空缺值的数据的适应能力;(5)伸缩性;(6)数据大量增加时 候模型的适应能力;(7)可解释性,即对模型的可理解程度。例如,根据本发明的技术方案,预测准确率在70%以上;算法十分高效,可以5分钟之内完成3000万用户的预测。
基于所计算的用户的年龄段,电商可以向该用户选择性地提供广告、建议、报告、通知、消息、媒体或其任何组合。
图3图示了根据本发明的实施例的用于基于用户的消费数据来估计用户年龄段的方法300的流程图。
如图3所示,方法300在步骤301开始。在步骤303,输入模块101获取多个用户的多个消费数据。在步骤305,计算模块111生成训练数据和测试数据。在步骤307,计算模块111计算所述训练数据在多个预定义的年龄段中的用户数量。在步骤309,计算模块105计算所述训练数据在所述多个预定的年龄段的每个三级品类的数量,然后,在步骤311,计算模块105基于所述用户数量和所述三级品类的数量,计算所述测试数据的每个元组属于所述多个预定义的年龄段中的每个的概率。在步骤313,选择模块113选择所述概率中的最大概率所属的年龄段作为对应元组的用户所属的年龄段。在步骤315,比较模块115比较测试数据中的已知年龄段和所选择的年龄段之间的误差,从而得到预测正确率,并输出预测正确率大于特定阈值的模型。在步骤317,应用模块105利用由建模模块103输出的模型来计算用户的年龄段,并将计算结果输出给呈现模块107。这样,在步骤319,呈现模块107基于所选择的年龄段,向所述用户选择性地呈现内容,诸如广告、建议、报告、通知、消息、媒体或其任何组合。方法300在步骤321结束。
根据本发明的实施例的用于判断用户年龄段的方案,可以使电商以更精确简单的方式确定注册用户的年龄段,例如,预测准确率达70%。因此,诸如京东商城的电商根据用户的年龄段使定制服务、内容、通信(例如,营销、广告)等更行效地与用户相关联,从而能够进行定 向营销提供了强有力的支持。同时对于访问这些电商网站的用户来说,显著提升了用户的体验度并提供便捷的个性化服务。
上述实施例仅是本发明的优选实施例,并不用于限制本发明。对本领域技术人员显而易见的是,在不脱离本发明精神和范围的情况下,可以对本发明的实施例进行各种修改和改变。因此,本发明意在涵盖落入如权利要求所限定的本发明的范围之内的所有的修改或变型。

Claims (10)

  1. 一种用于基于用户的消费数据来判断用户年龄段的方法,所述方法包括:
    获取多个用户的多个消费数据;
    基于所获取的多个消费数据来建模以创建满足特定条件的模型,所述建模进一步包括:
    将所述消费数据划分成训练数据和测试数据;计算所述训练数据在多个预定义的年龄段中的用户数量,计算所述训练数据在所述多个预定的年龄段的每个三级品类的数量,并且基于所述用户数量和所述三级品类的数量,计算所述测试数据的每个元组属于所述多个预定义的年龄段中的每个的概率;
    选择所述概率中的最大概率所属的年龄段作为对应元组的用户所属的年龄段;
    比较所述多个预定义的年龄段与所选择的年龄段之间的误差,得到预测误差率,并输出所述预测误差率大于或等于预定阈值的模型;以及
    利用所输出的模型来计算用户的年龄段。
  2. 根据权利要求1所述的方法,将所述消费数据划分成训练数据和测试数据进一步包括:
    将所述消费数据按照所述多个预定的年龄段进行分段;以及
    去掉所述消费数据中三级品类的数量小于预定数目的消费数据。
  3. 根据权利要求1或2所述的方法,其中,所述训练数据和所述测试数据的数目比例是7∶3。
  4. 根据权利要求1所述的方法,其中,所述预定阈值是0.7。
  5. 根据权利要求1所述的方法,进一步包括:
    基于所选择的年龄段,向所述用户选择性地提供广告、建议、报告、通知、消息、媒体或其任何组合。
  6. 一种用于基于用户的消费数据来判断用户年龄段的装置,所述装置包括:
    输入模块,所述输入模块用于获取多个用户的多个消费数据;
    建模模块,所述建模模块用于基于所获取的多个消费数据来建模以创建满足特定条件的模型,所述建模模块进一步包括:
    计算模块,所述计算模块被配置成将所述消费数据划分成训练数据和测试数据;计算所述训练数据在多个预定义的年龄段中的用户数量;计算所述训练数据在所述多个预定的年龄段的每个三级品类的数量;以及基于所述用户数量和所述三级品类的数量,计算所述测试数据的每个元组属于所述多个预定义的年龄段中的每个的概率;
    选择模块,所述选择模块被配置成选择所述概率中的最大概率所属的年龄段作为对应元组的用户所属的年龄段;
    比较模块,所述比较模块被配置成比较所述多个预定义的年龄段与所选择的年龄段之间的误差,得到预测误差率,并输出所述预测误差率大于或等于预定阈值的模型;以及
    应用模块,所述应用模块用于利用所输出的模型来计算用户的年龄段。
  7. 根据权利要求6所述的装置,所述计算模块进一步被配置成:
    将所述消费数据按照所述多个预定的年龄段进行分段;以及
    去掉所述消费数据中三级品类的数量小于预定数目的消费数据。
  8. 根据权利要求6或7所述的装置,其中,所述训练数据和所述测试数据的数目比例是7∶3。
  9. 根据权利要求6所述的装置,其中,所述预定阈值是0.7。
  10. 根据权利要求6所述的装置,进一步包括:
    呈现模块,所述呈现模块用于基于所选择的年龄段,向所述用户选择性地提供广告、建议、报告、通知、消息、媒体或其任何组合。
PCT/CN2015/076905 2014-04-18 2015-04-17 判断用户年龄段的方法及装置 WO2015158308A1 (zh)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US15/303,947 US20170032398A1 (en) 2014-04-18 2015-04-17 Method and apparatus for judging age brackets of users
AU2015246423A AU2015246423A1 (en) 2014-04-18 2015-04-17 Method and apparatus for judging age brackets of users
AU2018203129A AU2018203129A1 (en) 2014-04-18 2018-05-04 Method and apparatus for judging age brackets of users

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410158028.8 2014-04-18
CN201410158028.8A CN103927675B (zh) 2014-04-18 2014-04-18 判断用户年龄段的方法及装置

Publications (1)

Publication Number Publication Date
WO2015158308A1 true WO2015158308A1 (zh) 2015-10-22

Family

ID=51145889

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/076905 WO2015158308A1 (zh) 2014-04-18 2015-04-17 判断用户年龄段的方法及装置

Country Status (4)

Country Link
US (1) US20170032398A1 (zh)
CN (1) CN103927675B (zh)
AU (2) AU2015246423A1 (zh)
WO (1) WO2015158308A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106126597A (zh) * 2016-06-20 2016-11-16 乐视控股(北京)有限公司 用户属性预测方法及装置

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103927675B (zh) * 2014-04-18 2017-07-11 北京京东尚科信息技术有限公司 判断用户年龄段的方法及装置
CN104410710B (zh) * 2014-12-15 2018-04-03 北京国双科技有限公司 数据推送方法和装置
CN104992060A (zh) * 2015-06-25 2015-10-21 腾讯科技(深圳)有限公司 用户年龄估计方法及装置
KR20170033549A (ko) * 2015-09-17 2017-03-27 삼성전자주식회사 디스플레이 장치, 그 제어 방법 및 컴퓨터 판독가능 기록 매체
CN105931066A (zh) * 2015-09-24 2016-09-07 中国银联股份有限公司 一种交易数据处理方法及装置
CN107239456B (zh) * 2016-03-28 2020-10-30 创新先进技术有限公司 年龄段识别方法及装置
CN108022116B (zh) * 2016-11-01 2021-06-29 北京京东尚科信息技术有限公司 对用户建模的方法、系统及终端设备
CN106503863A (zh) * 2016-11-10 2017-03-15 北京红马传媒文化发展有限公司 基于决策树模型的年龄特征的预测方法、系统及终端
US10929772B1 (en) * 2016-12-20 2021-02-23 Facebook, Inc. Systems and methods for machine learning based age bracket determinations
CN106651057B (zh) * 2017-01-03 2020-04-10 有米科技股份有限公司 一种基于安装包序列表的移动端用户年龄预测方法
CN108510336B (zh) * 2017-02-23 2021-11-12 北京京东尚科信息技术有限公司 用于确定用户数据信息的方法、装置、电子设备和存储介质
CN108470285B (zh) * 2017-02-23 2021-11-12 北京京东尚科信息技术有限公司 用于获取用户数据信息的方法、装置、电子设备和存储介质
CN107103366B (zh) * 2017-04-24 2020-06-30 北京京东尚科信息技术有限公司 用于生成用户的年龄信息的方法和装置
CN107316205A (zh) * 2017-05-27 2017-11-03 银联智惠信息服务(上海)有限公司 识别持卡人属性的方法、装置、计算机可读介质及系统
CN108335131B (zh) * 2018-01-19 2022-06-03 北京奇艺世纪科技有限公司 一种预估用户年龄段的方法、装置及电子设备
CN108985173B (zh) * 2018-06-19 2022-04-05 奕通信息科技(上海)股份有限公司 面向有标记噪声表观年龄数据库的深度网络迁移学习方法
CN110796506A (zh) * 2018-08-03 2020-02-14 北京京东尚科信息技术有限公司 一种异常订单判定方法和装置
CN109376927A (zh) * 2018-10-24 2019-02-22 阿里巴巴集团控股有限公司 一种用户年龄预测方法、装置及设备
CN109614544B (zh) * 2018-10-30 2023-11-03 北京奇虎科技有限公司 一种用户的个人信息的预测方法及装置
CN111125181B (zh) * 2018-10-31 2023-03-14 北京国双科技有限公司 获得年龄占比的方法和装置、机器可读存储介质及处理器
KR102537781B1 (ko) 2018-11-13 2023-05-30 삼성전자주식회사 전자 장치 및 이의 제어 방법
KR102224089B1 (ko) * 2019-01-16 2021-03-08 주식회사 카카오 음악 연령에 기초하여 음악 컨텐츠를 추천하는 방법 및 장치
CN109636491A (zh) * 2019-01-25 2019-04-16 西窗科技(苏州)有限公司 一种搜索引擎广告关键词投放的优化方法及装置
US20200293590A1 (en) * 2019-03-17 2020-09-17 Kirill Rebrov Computer-implemented Method and System for Age Classification of First Names
CN111324509B (zh) * 2020-02-18 2023-07-11 广东小天才科技有限公司 一种对应用沉迷的识别方法和装置
US11924219B1 (en) * 2023-10-11 2024-03-05 KYC AVC UK Ltd. Age assurance during an interactive query workflow

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101359995A (zh) * 2008-09-28 2009-02-04 腾讯科技(深圳)有限公司 提供在线服务的方法和装置
US20130035912A1 (en) * 2011-08-04 2013-02-07 Margines Edward Y Systems and methods of processing personality information
CN103309990A (zh) * 2013-06-18 2013-09-18 上海晶樵网络信息技术有限公司 基于互联网用户公开信息的用户多维度分析与监测方法
CN103927675A (zh) * 2014-04-18 2014-07-16 北京京东尚科信息技术有限公司 判断用户年龄段的方法及装置

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6430539B1 (en) * 1999-05-06 2002-08-06 Hnc Software Predictive modeling of consumer financial behavior
US7594189B1 (en) * 2005-04-21 2009-09-22 Amazon Technologies, Inc. Systems and methods for statistically selecting content items to be used in a dynamically-generated display
US7672912B2 (en) * 2006-10-26 2010-03-02 Microsoft Corporation Classifying knowledge aging in emails using Naïve Bayes Classifier
US9996844B2 (en) * 2008-09-30 2018-06-12 Excalibur Ip, Llc Age-targeted online marketing using inferred age range information
US8352319B2 (en) * 2009-03-10 2013-01-08 Google Inc. Generating user profiles
US20140236708A1 (en) * 2010-02-01 2014-08-21 Nevallco, Llc Methods and apparatus for a predictive advertising engine
US20120310729A1 (en) * 2010-03-16 2012-12-06 Dalto John H Targeted learning in online advertising auction exchanges
US8655695B1 (en) * 2010-05-07 2014-02-18 Aol Advertising Inc. Systems and methods for generating expanded user segments
US20120030020A1 (en) * 2010-08-02 2012-02-02 International Business Machines Corporation Collaborative filtering on spare datasets with matrix factorizations
US9092797B2 (en) * 2010-09-22 2015-07-28 The Nielsen Company (Us), Llc Methods and apparatus to analyze and adjust demographic information
US20120130805A1 (en) * 2010-11-18 2012-05-24 Google Inc. Selecting media advertisements for presentation based on their predicted playtimes
US8738549B2 (en) * 2010-12-21 2014-05-27 International Business Machines Corporation Predictive modeling
CN102663026B (zh) * 2012-03-22 2015-09-23 浙江盘石信息技术股份有限公司 一种定向投放网络广告的实现方法
WO2014093621A2 (en) * 2012-12-15 2014-06-19 Thomson Licensing Proposing objects to a user to efficiently discover demographics from item ratings
US11308503B2 (en) * 2013-03-15 2022-04-19 Tunein, Inc. System and method for providing crowd sourced metrics for network content broadcasters
CN103577195B (zh) * 2013-11-14 2016-10-12 中国联合网络通信集团有限公司 一种软件需求分析量化方法及系统
US20150161633A1 (en) * 2013-12-06 2015-06-11 Asurion, Llc Trend identification and reporting
US10115121B2 (en) * 2013-12-11 2018-10-30 Adobe Systems Incorporated Visitor session classification based on clickstreams

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101359995A (zh) * 2008-09-28 2009-02-04 腾讯科技(深圳)有限公司 提供在线服务的方法和装置
US20130035912A1 (en) * 2011-08-04 2013-02-07 Margines Edward Y Systems and methods of processing personality information
CN103309990A (zh) * 2013-06-18 2013-09-18 上海晶樵网络信息技术有限公司 基于互联网用户公开信息的用户多维度分析与监测方法
CN103927675A (zh) * 2014-04-18 2014-07-16 北京京东尚科信息技术有限公司 判断用户年龄段的方法及装置

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106126597A (zh) * 2016-06-20 2016-11-16 乐视控股(北京)有限公司 用户属性预测方法及装置

Also Published As

Publication number Publication date
CN103927675A (zh) 2014-07-16
CN103927675B (zh) 2017-07-11
AU2015246423A1 (en) 2016-11-03
US20170032398A1 (en) 2017-02-02
AU2018203129A1 (en) 2018-05-24

Similar Documents

Publication Publication Date Title
WO2015158308A1 (zh) 判断用户年龄段的方法及装置
Eslami et al. Understanding consumer engagement in social media: The role of product lifecycle
US11373233B2 (en) Item recommendations using convolutions on weighted graphs
KR101871747B1 (ko) 유사성향 기반 사용자-관광상품 추천 방법 및 시스템
Bifet Mining big data in real time
US11699095B2 (en) Cross-domain recommender systems using domain separation networks and autoencoders
CN104620267A (zh) 用于推断用户人口统计的方法和装置
JPWO2017159403A1 (ja) 予測システム、方法およびプログラム
JP6311851B2 (ja) 共クラスタリングシステム、方法およびプログラム
KR101639656B1 (ko) 광고 제공 방법 및 서버 장치
KR20150113644A (ko) 협업필터링 기반의 상품 추천 시스템에서 추천 기법을 선택하는 방법
Lee et al. Development of interval-valued fuzzy GRA with SERVPERF based on subjective and objective weights for evaluation of airline service quality: A case study of Korea low-cost carriers
TWI634499B (zh) 資料分析方法、系統及非暫態電腦可讀取媒體
US20200098031A1 (en) Product recommending apparatus and non-transitory computer readable medium
Lee et al. Determining user needs through abnormality detection and heterogeneous embedding of usage sequence
Son et al. Topic diversity of online consumer reviews and its effect on review helpfulness
Alamsyah et al. A Comparison of Indonesia’ s E-Commerce Sentiment Analysis for Marketing Intelligence Effort (case study of Bukalapak, Tokopedia and Elevenia)
Wang et al. A preliminary analysis of relationships between traveller characteristics and hotel review ratings
JP6306951B2 (ja) ソーシャルネットワークサービスにおけるノイズ投稿の分類方法およびシステム
JP5735034B2 (ja) プロフィール推定モデル学習装置、プロフィール推定装置、方法、及びプログラム
Lee et al. Abnormal usage sequence detection for identification of user needs via recurrent neural network semantic variational autoencoder
US20170185907A1 (en) Method of probabilistic inference using open statistics
KR20200142871A (ko) 명시적 및 내연적 평가 정보를 이용한 아이템 추천 방법 및 장치
CN110162714A (zh) 内容推送方法、装置、计算设备和计算机可读存储介质
KR102332348B1 (ko) 다중 상태 파급 모델을 이용한 영향력 평가 방법 및 장치

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15779830

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 15303947

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: IDP00201606972

Country of ref document: ID

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2015246423

Country of ref document: AU

Date of ref document: 20150417

Kind code of ref document: A

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 08.02.2017)

122 Ep: pct application non-entry in european phase

Ref document number: 15779830

Country of ref document: EP

Kind code of ref document: A1