WO2017041226A1 - Commodity information pushing method - Google Patents

Commodity information pushing method Download PDF

Info

Publication number
WO2017041226A1
WO2017041226A1 PCT/CN2015/089130 CN2015089130W WO2017041226A1 WO 2017041226 A1 WO2017041226 A1 WO 2017041226A1 CN 2015089130 W CN2015089130 W CN 2015089130W WO 2017041226 A1 WO2017041226 A1 WO 2017041226A1
Authority
WO
WIPO (PCT)
Prior art keywords
candidate
item set
information
frequent
item
Prior art date
Application number
PCT/CN2015/089130
Other languages
French (fr)
Chinese (zh)
Inventor
罗辉
李灵
李光煌
谭和华
Original Assignee
深圳市赛亿科技开发有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市赛亿科技开发有限公司 filed Critical 深圳市赛亿科技开发有限公司
Priority to CN201580001597.4A priority Critical patent/CN107251075A/en
Priority to PCT/CN2015/089130 priority patent/WO2017041226A1/en
Publication of WO2017041226A1 publication Critical patent/WO2017041226A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present invention relates to the field of information processing, and in particular, to a method for pushing commodity information.
  • the technical problem to be solved by the present invention is to provide a method for accurately pushing peripheral goods related to purchased products.
  • a method for pushing product information includes the following steps:
  • Receiving a request for a customer to purchase an item the request for purchasing the item at least including category information of the item, wherein one category of the item includes at least one specific item;
  • the information of the third party purchase commodity is combined into at least one candidate item set according to a preset method, wherein the candidate item set whose support degree is greater than or equal to the preset support degree is a frequent item set;
  • the method for determining the candidate item set and finding the frequent item set and the longest frequent item set includes the following steps:
  • each candidate item set of the layer is an item, and traversing the layer to calculate the support degree of all candidate item sets;
  • the candidate set of the new layer is generated by the frequent set of items found in the previous time, and the support of all candidate sets of the new layer is calculated to generate a new frequent set of items, and so on;
  • the generation of the frequent item set is ended, and the last frequent item set containing the most types of goods is the longest frequent item set.
  • the obtained information of the third party purchase commodity is preprocessed.
  • the e-commerce pushes related products to the mobile phone number or mailbox registered by the consumer.
  • pre-processing the obtained information of the third-party purchased product includes: unifying the data format and eliminating duplicate data.
  • the process of unifying the data format includes: data type conversion, attribute construction, data discretization, and data standardization.
  • the commodity information pushing method of the present invention uses an association algorithm to analyze consumer product information to provide more accurate product pushing or advertising. For the manufacturer, it is able to sell a certain product to the customer, push the related products or peripheral parts and consumables to the customer, broaden the sales channel, and improve the marketing efficiency. For the customer, according to the new product advertisements released by the merchant and the information pushed by the merchant, more product information is obtained, which facilitates the purchase of the customer and broadens the purchase channel.
  • FIG. 2 is a flow chart of finding a frequent item set and a longest frequent item set according to the present invention.
  • the commodity information pushing method includes the following steps:
  • step S1 in order to quickly and accurately locate the consumer's shopping demand, when the merchant enters the network platform provided by the e-commerce, the e-commerce classifies the merchant's merchandise, and the common merchandise categories such as clothing, 3C, and daily necessities. After the merchants to be settled are verified by the e-commerce, their product information will be published to the corresponding information release section according to the classification of the e-commerce. After the consumers purchase the goods on the online platform provided by the e-commerce, the relevant product information (including the product category, product name, price, purchase time, etc.) is stored in the back-end server of the e-commerce. For example, if the consumer A currently purchases the apparel product, the product information of the remaining consumers (ie, third parties) who also have the purchase record of the apparel product may be selected.
  • the relevant product information including the product category, product name, price, purchase time, etc.
  • the product information may also be selected according to the personal information (including age, gender, etc.) of the consumer A. For example, if the consumer A is a female, the product information of the remaining female consumers is selected.
  • the commodity information obtained in step S1 can also be preprocessed.
  • some items are very time-sensitive, and a time T can be defined. If the purchase time of the item has exceeded the time T today, the item can be deleted from the information of the third-party purchased item obtained in step S1.
  • the data discretization method may be: discretizing the attributes of the continuous values into a plurality of intervals.
  • the price of a commodity is continuous, but consumers tend to choose within a range, not an independent and exact value. Therefore, for the convenience of analysis, the interval can be divided according to the price segment, and each interval is represented by a numerical value.
  • the numbers 1, 2, 3, 4, ... can be used to represent: 0 to 30, 31 to 60, 60 to 90, 90 to 120, ... different price ranges.
  • S2 defines support, confidence, and association rules.
  • the two thresholds of support and confidence are two concepts describing association rules.
  • Support supports the importance of association rules in the database.
  • Confidence measures the credibility of association rules.
  • Support (X, Y) ⁇ T1 means that among all the purchase records selected, at least the purchase of T1 (available percentage) is presented as two items of X and Y being purchased by the same consumer.
  • Confidence(X,Y) ⁇ T2 means that the purchase of at least T2 (available percentage) is presented as the purchase of X by the same consumer.
  • the purchase record of the clothing product in which the consumer A currently purchases the clothing product and the third party purchases the information stored in the background server is as shown in Table 1, wherein one serial number represents a consumer, A, B. , C, D, E, F each represents different commodities of clothing:
  • the information of the third-party purchased product is combined into at least one candidate item set according to a preset method, wherein the candidate item set whose support degree is greater than or equal to the preset support degree is a frequent item set; and the frequent item including the most product type is selected. Set as the longest frequent item set.
  • the preset support degree is the defined minimum support degree. As shown in Figure 2, the following methods can be used:
  • the project set shown in Table 2 is the first candidate project set, and the count and support of each project set are as follows:
  • the minimum support degree threshold F1 is set to 60%, wherein the candidate item set ⁇ B ⁇ , ⁇ F ⁇ support degree ⁇ F1, candidate item set ⁇ A ⁇ , ⁇ C ⁇ , ⁇ D ⁇ , ⁇ E ⁇ ⁇ F1.
  • the minimum support count value is at least 3. Therefore, it can also be judged based on the minimum support count value.
  • the candidate item set has a minimum support degree threshold requirement, the candidate item set is extracted as a frequent item set.
  • the candidate item sets ⁇ A ⁇ , ⁇ C ⁇ , ⁇ D ⁇ , and ⁇ E ⁇ are frequent item sets.
  • step S34 it can be seen from step S33 that the situation of the second layer candidate item set is as shown in Table 3:
  • the candidate project sets ⁇ A, D ⁇ and ⁇ C, D ⁇ do not satisfy the minimum support, while the candidate project sets ⁇ A, C ⁇ , ⁇ A, E ⁇ , ⁇ C, E ⁇ , ⁇ D, E ⁇ are frequent items. set.
  • the situation of the third-level candidate project set is shown in Table 4:
  • step S35 it is known from step S34 that the longest frequent item set is ⁇ A, C, E ⁇ .
  • multiple candidate association rules can be obtained, as follows:
  • the support degree of the above association rule satisfies the minimum support degree threshold requirement.
  • the processing reliability is the defined minimum confidence threshold. If the minimum confidence threshold F2 is 0.75, the candidate association rules of step S5 all meet the minimum confidence requirement. The association rule defined by step S2 knows that the candidate shown in step S5 is known. The association rules are all associated rules that meet the requirements.
  • the product information pushing method of the invention analyzes the consumer's product information by using an association algorithm to provide more accurate product pushing or advertising. For the manufacturer, it is able to sell a certain product to the customer, push the related products or peripheral parts and consumables to the customer, broaden the sales channel, and improve the marketing efficiency. For the customer, according to the new product advertisements released by the merchant and the information pushed by the merchant, more product information is obtained, which facilitates the purchase of the customer and broadens the purchase channel.

Abstract

A commodity information pushing method, comprising the following steps: receiving and responding to a request of a consumer for purchasing a commodity (S1); defining a support degree, a confidence degree and an association rule (S2); finding out a frequent item set and a longest frequent item set (S3); forming a plurality of candidate association rules (S4); respectively calculating the confidence degrees of the candidate association rules (S5); generating an association rule (S6); and pushing corresponding commodity information to the consumer according to the association rule (S7). According to the present invention, an association algorithm is used to analyse information about a commodity of a consumer, so that more accurate commodity pushing or advertisement publication can be provided.

Description

一种商品信息推送方法Product information pushing method 技术领域Technical field
本发明涉及信息处理领域,尤其涉及一种商品信息推送方法。The present invention relates to the field of information processing, and in particular, to a method for pushing commodity information.
背景技术Background technique
随着电子商务技术的发展,能精确定位消费者需求的商品推送技术成为研究热点。现有技术中,通过统计消费者们购买某类型商品的次数或者商品价格区间分布情况来向消费者们推送商品的方法已经不能满足商家的需求,能够记录消费者们购买的商品情况、分析消费者们购买的习惯,然后智能推送与消费者购买的商品相关的周边商品(包括零配件)的方法才是目前广大商家的需求。例如,消费者们购买手机,商品推送系统就能推送充电宝、耳机;消费者们购买开发板,商品推送系统就能推送下载器、杜邦线。由此可见,能精确推送与购买商品相关的周边商品的方法不但有利于提升商家销售额,也提升了消费者们的购买体验。With the development of e-commerce technology, commodity push technology that can accurately locate consumer demand has become a research hotspot. In the prior art, by counting the number of times a consumer purchases a certain type of product or the distribution of a commodity price interval, the method of pushing the product to the consumer can no longer satisfy the demand of the merchant, and can record the situation of the commodity purchased by the consumer and analyze the consumption. The habit of buying, and then intelligently pushing the surrounding products (including spare parts) related to the goods purchased by consumers is the demand of the majority of businesses. For example, when consumers buy mobile phones, the product push system can push charging treasures and headphones; consumers buy development boards, and the product push system can push downloaders and DuPont lines. It can be seen that the method of accurately pushing the surrounding goods related to the purchase of goods not only helps to enhance the sales of the merchants, but also enhances the purchasing experience of the consumers.
发明内容Summary of the invention
本发明所要解决的技术问题在于提供一种能精确推送与已购买商品相关的周边商品的方法。The technical problem to be solved by the present invention is to provide a method for accurately pushing peripheral goods related to purchased products.
一种商品信息推送方法,包括有以下步骤:A method for pushing product information includes the following steps:
接收到客户购买商品的请求,所述购买商品的请求至少包括商品的类别信息,其中,所述商品的一个类别包括至少一种具体商品;Receiving a request for a customer to purchase an item, the request for purchasing the item at least including category information of the item, wherein one category of the item includes at least one specific item;
响应客户请求,根据所述客户购买商品的类别信息获取至少一个第三方购买商品的信息,其中,所述第三方购买商品的信息至少包含该客户购买商品的类别信息;Responding to the client request, obtaining, according to the category information of the purchased product of the customer, information of at least one third-party purchased product, wherein the information of the third-party purchased product includes at least category information of the purchased product of the customer;
定义支持度、置信度和关联规则;Define support, confidence, and association rules;
所述第三方购买商品的信息根据预设的方法组合成至少一个候选项目集,其中支持度大于或等于预设支持度的候选项目集为频繁项目集;The information of the third party purchase commodity is combined into at least one candidate item set according to a preset method, wherein the candidate item set whose support degree is greater than or equal to the preset support degree is a frequent item set;
选择包含商品种类最多的频繁项目集为最长频繁项目集; Select the frequent itemsets that contain the most types of products as the longest frequent itemsets;
将构成所述最长频繁项目集的所有商品任意组合以形成多个候选关联规则;All the commodities constituting the longest frequent item set are arbitrarily combined to form a plurality of candidate association rules;
分别计算所述候选关联规则的置信度;Calculating the confidence of the candidate association rule separately;
选择候选关联规则的置信度满足大于或等于预设的置信度的候选关联规则作为规定的关联规则;以及Selecting a candidate association rule whose candidate association rule satisfies a preset confidence value greater than or equal to a preset confidence degree as a specified association rule;
根据所述规定的关联规则向消费者推送相应的商品信息。Pushing corresponding product information to the consumer according to the prescribed association rule.
进一步地,确定候选项目集,找出所述频繁项目集及最长频繁项目集的方法包括以下步骤:Further, the method for determining the candidate item set and finding the frequent item set and the longest frequent item set includes the following steps:
确定第一层候选项目集,该层的每个候选项目集为一种商品,遍历该层计算所有候选项目集的支持度;Determining a first layer candidate item set, each candidate item set of the layer is an item, and traversing the layer to calculate the support degree of all candidate item sets;
将每个候选项目集的支持度与预设支持度进行比较;Compare the support of each candidate item set with the preset support level;
若候选项目集的支持度大于或等于预设支持度,则提取该候选项目集作为频繁项目集;If the support degree of the candidate item set is greater than or equal to the preset support degree, extracting the candidate item set as a frequent item set;
新一层的候选项目集由前一次发现的频繁项目集产生,并计算新一层所有候选项目集的支持度以产生新的频繁项目集,依次类推;The candidate set of the new layer is generated by the frequent set of items found in the previous time, and the support of all candidate sets of the new layer is calculated to generate a new frequent set of items, and so on;
若候选项目集的支持度小于预设支持度,则结束频繁项目集的产生,最后得到的包含商品种类最多的频繁项目集为最长频繁项目集。If the support degree of the candidate item set is less than the preset support degree, the generation of the frequent item set is ended, and the last frequent item set containing the most types of goods is the longest frequent item set.
进一步地,对获取的所述第三方购买商品的信息进行预处理。Further, the obtained information of the third party purchase commodity is preprocessed.
进一步地,电商向消费者所登记的手机号或者邮箱推送相关的商品。Further, the e-commerce pushes related products to the mobile phone number or mailbox registered by the consumer.
进一步地,对获取的所述第三方购买商品的信息进行预处理的内容包括:进行数据格式的统一化、消除重复数据。Further, pre-processing the obtained information of the third-party purchased product includes: unifying the data format and eliminating duplicate data.
进一步地,对数据格式的统一化过程包括:数据类型转换、属性构造、数据离散化、数据标准化。Further, the process of unifying the data format includes: data type conversion, attribute construction, data discretization, and data standardization.
本发明与现有技术相比的有益效果是:本发明的商品信息推送方法,运用关联算法,分析消费者的商品信息,以提供更精准的商品推送或者广告发布。对厂商而言,能够在向顾客销售某一产品的同时,向顾客推送与其相关的产品或者周边零配件、耗材,拓宽了销售渠道,提高了营销效率。对顾客而言,能够根据商家发布的新品广告和商家推送的信息,获知更多的产品信息,方便了顾客的购买,拓宽了购买渠道。 The beneficial effects of the present invention compared with the prior art are: the commodity information pushing method of the present invention uses an association algorithm to analyze consumer product information to provide more accurate product pushing or advertising. For the manufacturer, it is able to sell a certain product to the customer, push the related products or peripheral parts and consumables to the customer, broaden the sales channel, and improve the marketing efficiency. For the customer, according to the new product advertisements released by the merchant and the information pushed by the merchant, more product information is obtained, which facilitates the purchase of the customer and broadens the purchase channel.
附图说明DRAWINGS
图1为本发明在一些实施例中的流程图。1 is a flow chart of some embodiments of the present invention.
图2为本发明找出频繁项目集及最长频繁项目集的流程图。2 is a flow chart of finding a frequent item set and a longest frequent item set according to the present invention.
具体实施方式detailed description
为了更充分理解本发明的技术内容,下面结合具体实施例对本发明的技术方案作进一步介绍和说明。In order to more fully understand the technical content of the present invention, the technical solutions of the present invention will be further described and illustrated below in conjunction with specific embodiments.
如图1所示,在一些实施例中,商品信息推送方法,包括有以下步骤:As shown in FIG. 1, in some embodiments, the commodity information pushing method includes the following steps:
S1,接收到客户购买商品的请求,所述购买商品的请求至少包括商品的类别信息,其中,所述商品的一个类别包括至少一种具体商品;响应客户请求,根据所述客户购买商品的类别信息获取至少一个第三方购买商品的信息,其中,所述第三方购买商品的信息至少包含该客户购买商品的类别信息。S1. Receiving, by the customer, a request for purchasing a commodity, where the request for purchasing the commodity includes at least category information of the commodity, wherein one category of the commodity includes at least one specific commodity; and responding to the customer request, according to the category of the customer purchasing the commodity The information acquires information of at least one third party purchase item, wherein the information of the third party purchase item includes at least category information of the item purchased by the customer.
对于步骤S1,为了快速、准确定位消费者的购物需求,当商家入驻电商提供的网络平台时,电商会对商家的商品进行分类,常见的商品类别如服装类、3C类、日用品类等。拟入驻的商家通过电商的验证后,其商品信息会按照电商的分类方式发布到相应的信息发布版块中。消费者们在电商提供的网络平台上购买商品后,相关的商品信息(包括商品类别、商品名称、价格、购买时间等)便被储存至电商的后台服务器内。例如,如果消费者甲当前购买了服装类产品,则可以选取同样有服装类产品购买记录的其余消费者(即第三方)的商品信息。For step S1, in order to quickly and accurately locate the consumer's shopping demand, when the merchant enters the network platform provided by the e-commerce, the e-commerce classifies the merchant's merchandise, and the common merchandise categories such as clothing, 3C, and daily necessities. After the merchants to be settled are verified by the e-commerce, their product information will be published to the corresponding information release section according to the classification of the e-commerce. After the consumers purchase the goods on the online platform provided by the e-commerce, the relevant product information (including the product category, product name, price, purchase time, etc.) is stored in the back-end server of the e-commerce. For example, if the consumer A currently purchases the apparel product, the product information of the remaining consumers (ie, third parties) who also have the purchase record of the apparel product may be selected.
还可根据消费者甲的个人信息(包括年龄、性别等)来选取商品信息,例如消费者甲为女性,则选取其余女性消费者的商品信息。The product information may also be selected according to the personal information (including age, gender, etc.) of the consumer A. For example, if the consumer A is a female, the product information of the remaining female consumers is selected.
为了减少无关数据量,以提高商品推送的效率和准确度,还可以对步骤S1得到的商品信息进行预处理。In order to reduce the amount of irrelevant data, in order to improve the efficiency and accuracy of commodity push, the commodity information obtained in step S1 can also be preprocessed.
例如,有些商品的时效性很强,可以定义一个时间T,如果该商品的购买时间距离今日已超过时间T,则可将该商品从步骤S1得到的第三方购买商品的信息中删除。For example, some items are very time-sensitive, and a time T can be defined. If the purchase time of the item has exceeded the time T today, the item can be deleted from the information of the third-party purchased item obtained in step S1.
在一些实施例中,存在将多个电商的后台服务器的商品信息合并的情况,由于商品信息来源不同,对同一商品的定义可能不一样,比如某种零件的的底座, 有的用“基底”表示,有的用“基座”表示,有的用“底座”表示,但实质上是一种商品,所以需要统一定义。因此,还需要进行数据格式的统一化、消除重复数据等过程。本领域技术人员可以理解的,对数据格式的统一化过程包括:数据类型转换、属性构造、数据离散化、数据标准化。In some embodiments, there is a case where product information of a plurality of e-commerce back-end servers is merged. Since the source of the product information is different, the definition of the same item may be different, such as the base of a certain part, Some are represented by "base", some are represented by "base", and some are represented by "base", but they are essentially a commodity, so they need to be uniformly defined. Therefore, it is also necessary to unify the data format and eliminate duplicate data. Those skilled in the art can understand that the process of unifying the data format includes: data type conversion, attribute construction, data discretization, and data standardization.
其中,数据离散化的方法可以为:将连续取值的属性离散成若干个区间。比如,商品的价格是连续的,但消费者往往在一个区间段内进行选择,并非是某个独立确切的值。故为了分析的方便,可根据价格段划分区间,每个区间用一个数值表示。例如,可以分别用数字1、2、3、4......来代表:0~30、31~60、60~90、90~120......这些不同的价格区间。The data discretization method may be: discretizing the attributes of the continuous values into a plurality of intervals. For example, the price of a commodity is continuous, but consumers tend to choose within a range, not an independent and exact value. Therefore, for the convenience of analysis, the interval can be divided according to the price segment, and each interval is represented by a numerical value. For example, the numbers 1, 2, 3, 4, ... can be used to represent: 0 to 30, 31 to 60, 60 to 90, 90 to 120, ... different price ranges.
S2,定义支持度、置信度和关联规则。S2, defines support, confidence, and association rules.
支持度和置信度两个阈值是描述关联规则的两个概念,支持度(support)反映的是关联规则在数据库中的重要性,置信度(confidence)衡量关联规则的可信程度。Support(X,Y)≥T1表示:在所选取的所有购买记录中,至少有T1(可用百分数表示)的购买呈现为X与Y这两种商品被同一消费者购买。Confidence(X,Y)≥T2表示:至少有T2(可用百分数表示)的购买呈现为在购买了X情况下,同一消费者又购买了商品Y。The two thresholds of support and confidence are two concepts describing association rules. Support supports the importance of association rules in the database. Confidence measures the credibility of association rules. Support (X, Y) ≥ T1 means that among all the purchase records selected, at least the purchase of T1 (available percentage) is presented as two items of X and Y being purchased by the same consumer. Confidence(X,Y) ≥ T2 means that the purchase of at least T2 (available percentage) is presented as the purchase of X by the same consumer.
设定最小支持度(min-Support)=F1,最小置信度(min-Confidence)=F2。其中F1和F2的取值范围为[0,1]。如果经过挖掘找到的关联规则{X,Y}同时满足最小支持度、最小置信度条件,则表示可接受{X,Y}的关联规则,关联规则便产生了。可描述为:Support(X,Y)≥F1 and Confidence(X,Y)≥F2。Set minimum support (min-Support) = F1, minimum confidence (min-Confidence) = F2. The range of F1 and F2 is [0, 1]. If the association rule {X, Y} found by mining finds both the minimum support and the minimum confidence condition, it means that the association rule of {X, Y} is acceptable, and the association rule is generated. It can be described as: Support (X, Y) ≥ F1 and Confidence (X, Y) ≥ F2.
例如,消费者甲当前购买了服装类产品,后台服务器中所存储的第三方购买商品的信息的服装类产品的购买记录如表1所示,其中,一个序号代表一位消费者,A、B、C、D、E、F各代表服装类的不同商品:For example, the purchase record of the clothing product in which the consumer A currently purchases the clothing product and the third party purchases the information stored in the background server is as shown in Table 1, wherein one serial number represents a consumer, A, B. , C, D, E, F each represents different commodities of clothing:
表1Table 1
序号Serial number 商品commodity
11 {A,C}{A, C}
22 {A,D,E,F}{A, D, E, F}
33 {B,C,D,E}{B, C, D, E}
44 {A,C,D,E}{A, C, D, E}
55 {A,B,C,E}{A, B, C, E}
根据表1,消费者们购买D商品在包含{C,E}两种商品中的支持度和置信度分别为:According to Table 1, the degree of support and confidence of consumers purchasing D products in products containing {C, E} are:
支持度Support=2/5=0.4(在所选取的所有购买记录中,至少有0.4的购买呈现为同一消费者购买了D、C、E三种商品。)Support Support=2/5=0.4 (At least 0.4 of the purchases in the selected purchases are presented as the same consumer who purchased D, C, and E.)
置信度Confidence=2/3≈0.67(在所选取的所有购买记录中,至少有0.67的购买呈现为在购买了D的情况下,同一消费者又购买了C、E两种商品。)Confidence=2/3≈0.67 (At least 0.67 of the purchases selected are presented as the purchase of D, the same consumer purchased both C and E.)
S3,所述第三方购买商品的信息根据预设的方法组合成至少一个候选项目集,其中支持度大于或等于预设支持度的候选项目集为频繁项目集;选择包含商品种类最多的频繁项目集为最长频繁项目集。S3. The information of the third-party purchased product is combined into at least one candidate item set according to a preset method, wherein the candidate item set whose support degree is greater than or equal to the preset support degree is a frequent item set; and the frequent item including the most product type is selected. Set as the longest frequent item set.
此处预设支持度为定义的最小支持度,结合图2所示,具体可以采用如下方法:Here, the preset support degree is the defined minimum support degree. As shown in Figure 2, the following methods can be used:
S31,确定第一层候选项目集(K=1),该层的每个候选项目集为一种商品,遍历该层计算所有候选项目集的支持度。S31. Determine a first layer candidate item set (K=1), and each candidate item set of the layer is a commodity, and traverse the layer to calculate the support degree of all candidate item sets.
表2所示的项目集为第一层候选项目集,各项目集的计数及支持度情况如下:The project set shown in Table 2 is the first candidate project set, and the count and support of each project set are as follows:
表2Table 2
候选项目集Candidate set 计数count 支持度Support
{A}{A} 44 80%80%
{B}{B} 22 40%40%
{C}{C} 44 80%80%
{D}{D} 33 60%60%
{E}{E} 44 80%80%
{F}{F} 11 20%20%
S32,将每个候选项目集的支持度与最小支持度阈值进行比较。S32. Compare the support degree of each candidate item set with a minimum support degree threshold.
假设最小支持度阈值F1设为60%,其中,候选项目集{B}、{F}的支持度<F1,候选项目集{A}、{C}、{D}、{E}≥F1。It is assumed that the minimum support degree threshold F1 is set to 60%, wherein the candidate item set {B}, {F} support degree <F1, candidate item set {A}, {C}, {D}, {E} ≥ F1.
本领域相关技术人员可以理解的,若要最小支持度阈值F1满足要求,则最小支持度计数值至少为3。因此,也可以根据最小支持度计数值来判断。It will be understood by those skilled in the art that if the minimum support threshold F1 satisfies the requirement, the minimum support count value is at least 3. Therefore, it can also be judged based on the minimum support count value.
S33,若候选项目集中有满足最小支持度阈值要求的,则提取该候选项目集作为频繁项目集。S33. If the candidate item set has a minimum support degree threshold requirement, the candidate item set is extracted as a frequent item set.
根据步骤S32可知,候选项目集{A}、{C}、{D}、{E}为频繁项目集。According to step S32, the candidate item sets {A}, {C}, {D}, and {E} are frequent item sets.
S34,新一层的候选项目集由前一次发现的频繁项目集产生,并计算新一层(K=K+1)所有候选项目集的支持度以生产新的频繁项目集,依次类推。S34, the candidate item set of the new layer is generated by the frequent item set found last time, and the support degree of all candidate item sets of the new layer (K=K+1) is calculated to produce a new frequent item set, and so on.
针对步骤S34,由步骤S33可知,第二层候选项目集的情况如表3所示:For step S34, it can be seen from step S33 that the situation of the second layer candidate item set is as shown in Table 3:
表3table 3
候选项目集Candidate set 计数count
{A,C}{A, C} 33
{A,D}{A,D} 22
{A,E}{A,E} 33
{C,D}{C,D} 22
{C,E}{C,E} 33
{D,E}{D,E} 33
候选项目集{A,D}和{C,D}不满足最小支持度,而候选项目集{A,C}、{A,E}、{C,E}、{D,E}为频繁项目集。依此方法类推,继续下一步,第三层候选项目集的情况如表4所示:The candidate project sets {A, D} and {C, D} do not satisfy the minimum support, while the candidate project sets {A, C}, {A, E}, {C, E}, {D, E} are frequent items. set. In this analogy, continue to the next step, the situation of the third-level candidate project set is shown in Table 4:
表4Table 4
候选项目集Candidate set 计数count
{A,C,E}{A, C, E} 33
由表4可知,{A,C,E}为频繁项目集依此方法类推,继续下一步:As can be seen from Table 4, {A, C, E} is a frequent item set. By analogy with this method, continue to the next step:
表5table 5
候选项目集Candidate set 计数count
{A,C,D,E}{A, C, D, E} 11
由于候选项目集{A,C,D,E}的计数值为1,不满足最小支持度阈值的要求,不是频繁项目集。Since the count value of the candidate item set {A, C, D, E} is 1, the requirement of the minimum support degree threshold is not satisfied, and it is not a frequent item set.
S35,若候选项目集的支持度小于最小支持度阈值要求的,则结束频繁项目集的产生,最后得到的包含商品种类最多的频繁项目集为最长频繁项目集。S35. If the support degree of the candidate item set is less than the minimum support degree threshold, the generation of the frequent item set is ended, and the frequently obtained frequent item set including the most common item type is the longest frequent item set.
针对步骤S35,由步骤S34可知,最长频繁项目集为{A,C,E}。For step S35, it is known from step S34 that the longest frequent item set is {A, C, E}.
S4,将构成上述最长频繁项目集的所有商品任意组合以形成多个候选关联规则。S4, arbitrarily combine all the commodities constituting the longest frequent item set described above to form a plurality of candidate association rules.
根据S3得到的最长频繁项目集{A,C,E},可得到多个候选关联规则,如下所示:According to the longest frequent items set {A, C, E} obtained by S3, multiple candidate association rules can be obtained, as follows:
(1)规则{A,C}→{E}(1) Rules {A, C}→{E}
(2)规则{A,E}→{C}(2) Rules {A, E}→{C}
(3)规则{C,E}→{A}(3) Rules {C, E}→{A}
(4)规则{A}→{C,E}(4) Rules {A}→{C,E}
(5)规则{C}→{A,E}(5) Rules {C}→{A,E}
(6)规则{E}→{A,C}(6) Rules {E}→{A,C}
由于上述候选关联规则由最长频繁项目集的所有商品任意组合形成,因此上述关联规则的支持度都满足最小支持度阈值要求。Since the candidate association rule is formed by any combination of all the commodities of the longest frequent item set, the support degree of the above association rule satisfies the minimum support degree threshold requirement.
S5,分别计算上述候选关联规则的置信度。S5, respectively calculating the confidence of the candidate association rule.
(1)规则{A,C}→{E}的置信度Confidence==3/3=1(1) Confidence of the rule {A, C}→{E} Confidence==3/3=1
(2)规则{A,E}→{C}的置信度Confidence==3/3=1(2) Confidence of the rule {A, E}→{C} Confidence==3/3=1
(3)规则{C,E}→{A}的置信度Confidence==3/3=1(3) Confidence of the rule {C, E}→{A} Confidence==3/3=1
(4)规则{A}→{C,E}的置信度Confidence==3/4=0.75 (4) Confidence of rule {A}→{C,E} Confidence==3/4=0.75
(5)规则{C}→{A,E}的置信度Confidence==3/4=0.75(5) Confidence of rule {C}→{A, E} Confidence==3/4=0.75
(6)规则{E}→{A,C}的置信度Confidence==3/4=0.75(6) Confidence of rule {E}→{A, C} Confidence==3/4=0.75
S6,关联规则的产生:选择候选关联规则的置信度满足大于或等于预设的置信度的候选关联规则作为规定的关联规则S6. Generating an association rule: selecting a candidate association rule whose candidate association rule satisfies a predetermined confidence greater than or equal to a predetermined confidence rule as a specified association rule
此处置信度为定义的最小置信度阈值,假设最小置信度阈值F2为0.75,则步骤S5的候选关联规则均满足最小置信度要求,由步骤S2定义的关联规则可知,步骤S5示出的候选关联规则均为满足规定的关联规则。The processing reliability is the defined minimum confidence threshold. If the minimum confidence threshold F2 is 0.75, the candidate association rules of step S5 all meet the minimum confidence requirement. The association rule defined by step S2 knows that the candidate shown in step S5 is known. The association rules are all associated rules that meet the requirements.
S7,根据上述规定的关联规则向消费者们推送相应的商品信息。S7, pushing corresponding product information to consumers according to the association rule specified above.
(1)规则{A,C}→{E},代表若消费者已购买A、C商品,则应向其推送E商品。(1) The rules {A, C} → {E}, if the consumer has purchased the A and C goods, they should push the E product to them.
(2)规则{A,E}→{C},代表若消费者已购买A、E商品,则应向其推送C商品。(2) The rules {A, E} → {C}, if the consumer has purchased the A, E goods, they should push the C goods to them.
(3)规则{C,E}→{A},代表若消费者已购买C、E商品,则应向其推送A商品。(3) The rule {C, E}→{A} means that if the consumer has purchased the C or E goods, they should push the A product to them.
(4)规则{A}→{C,E},代表若消费者已购买A商品,则应向其推送C、E商品。(4) The rule {A}→{C,E} means that if the consumer has purchased the A product, the C and E products should be pushed to it.
(5)规则{C}→{A,E},代表若消费者已购买C商品,则应向其推送A、E商品。(5) The rule {C}→{A, E} means that if the consumer has purchased the C product, the A and E products should be pushed to it.
(6)规则{E}→{A,C},代表若消费者已购买E商品,则应向其推送A、C商品。(6) The rule {E}→{A,C} means that if the consumer has purchased the E product, the A and C products should be pushed to it.
本发明的商品信息推送方法,运用关联算法,分析消费者的商品信息,以提供更精准的商品推送或者广告发布。对厂商而言,能够在向顾客销售某一产品的同时,向顾客推送与其相关的产品或者周边零配件、耗材,拓宽了销售渠道,提高了营销效率。对顾客而言,能够根据商家发布的新品广告和商家推送的信息,获知更多的产品信息,方便了顾客的购买,拓宽了购买渠道。The product information pushing method of the invention analyzes the consumer's product information by using an association algorithm to provide more accurate product pushing or advertising. For the manufacturer, it is able to sell a certain product to the customer, push the related products or peripheral parts and consumables to the customer, broaden the sales channel, and improve the marketing efficiency. For the customer, according to the new product advertisements released by the merchant and the information pushed by the merchant, more product information is obtained, which facilitates the purchase of the customer and broadens the purchase channel.
以上陈述仅以实施例来进一步说明本发明的技术内容,以便于读者更容易理解,但不代表本发明的实施方式仅限于此,任何依本发明所做的技术延伸或再创造,均受本发明的保护。 The above description is only to clarify the technical content of the present invention by way of examples, so that the reader can understand the present invention more easily, but the embodiment of the present invention is not limited thereto, and any technology extension or re-creation according to the present invention is subject to the present invention. Protection of the invention.

Claims (6)

  1. 一种商品信息推送方法,其特征在于,包括有以下步骤:A commodity information pushing method, comprising the following steps:
    接收到客户购买商品的请求,所述购买商品的请求至少包括商品的类别信息,其中,所述商品的一个类别包括至少一种具体商品;Receiving a request for a customer to purchase an item, the request for purchasing the item at least including category information of the item, wherein one category of the item includes at least one specific item;
    响应客户请求,根据所述客户购买商品的类别信息获取至少一个第三方购买商品的信息,其中,所述第三方购买商品的信息至少包含该客户购买商品的类别信息;Responding to the client request, obtaining, according to the category information of the purchased product of the customer, information of at least one third-party purchased product, wherein the information of the third-party purchased product includes at least category information of the purchased product of the customer;
    定义支持度、置信度和关联规则;Define support, confidence, and association rules;
    所述第三方购买商品的信息根据预设的方法组合成至少一个候选项目集,其中支持度大于或等于预设支持度的候选项目集为频繁项目集;The information of the third party purchase commodity is combined into at least one candidate item set according to a preset method, wherein the candidate item set whose support degree is greater than or equal to the preset support degree is a frequent item set;
    选择包含商品种类最多的频繁项目集为最长频繁项目集;Select the frequent itemsets that contain the most types of products as the longest frequent itemsets;
    将构成所述最长频繁项目集的所有商品任意组合以形成多个候选关联规则;All the commodities constituting the longest frequent item set are arbitrarily combined to form a plurality of candidate association rules;
    分别计算所述候选关联规则的置信度;Calculating the confidence of the candidate association rule separately;
    选择候选关联规则的置信度满足大于或等于预设的置信度的候选关联规则作为规定的关联规则;以及Selecting a candidate association rule whose candidate association rule satisfies a preset confidence value greater than or equal to a preset confidence degree as a specified association rule;
    根据所述规定的关联规则向消费者推送相应的商品信息。Pushing corresponding product information to the consumer according to the prescribed association rule.
  2. 如权利要求1所述的商品信息推送方法,其特征在于,确定所述候选项目集,找出所述频繁项目集及最长频繁项目集的方法包括以下步骤:The article information pushing method according to claim 1, wherein the method of determining the candidate item set and finding the frequent item set and the longest frequent item set comprises the following steps:
    确定第一层候选项目集,该层的每个候选项目集为一种商品,遍历该层计算所有候选项目集的支持度;Determining a first layer candidate item set, each candidate item set of the layer is an item, and traversing the layer to calculate the support degree of all candidate item sets;
    将每个候选项目集的支持度与预设支持度进行比较;Compare the support of each candidate item set with the preset support level;
    若候选项目集的支持度大于或等于预设支持度,则提取该候选项目集作为频繁项目集;If the support degree of the candidate item set is greater than or equal to the preset support degree, extracting the candidate item set as a frequent item set;
    新一层的候选项目集由前一次发现的频繁项目集产生,并计算新一层所有候选项目集的支持度以产生新的频繁项目集,依次类推;The candidate set of the new layer is generated by the frequent set of items found in the previous time, and the support of all candidate sets of the new layer is calculated to generate a new frequent set of items, and so on;
    若候选项目集的支持度小于预设支持度,则结束频繁项目集的产生,最后得到的包含商品种类最多的频繁项目集为最长频繁项目集。 If the support degree of the candidate item set is less than the preset support degree, the generation of the frequent item set is ended, and the last frequent item set containing the most types of goods is the longest frequent item set.
  3. 如权利要求1所述的商品信息推送方法,其特征在于,对获取的所述第三方购买商品的信息进行预处理。The article information pushing method according to claim 1, wherein the acquired information of the third party purchased product is preprocessed.
  4. 如权利要求1所述的商品信息推送方法,其特征在于,电商向消费者所登记的手机号或者邮箱推送相关的商品。The product information pushing method according to claim 1, wherein the e-commerce pushes the related product to the mobile phone number or the mailbox registered by the consumer.
  5. 如权利要求3所述的商品信息推送方法,其特征在于,对获取的所述第三方购买商品的信息进行预处理的内容包括:进行数据格式的统一化、消除重复数据。The product information pushing method according to claim 3, wherein the pre-processing of the acquired information of the third-party purchased product comprises: unifying the data format and eliminating duplicate data.
  6. 如权利要求5所述的商品信息推送方法,其特征在于,对数据格式的统一化过程包括:数据类型转换、属性构造、数据离散化、数据标准化。 The commodity information pushing method according to claim 5, wherein the process of unifying the data format comprises: data type conversion, attribute construction, data discretization, and data standardization.
PCT/CN2015/089130 2015-09-08 2015-09-08 Commodity information pushing method WO2017041226A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201580001597.4A CN107251075A (en) 2015-09-08 2015-09-08 A kind of merchandise news method for pushing
PCT/CN2015/089130 WO2017041226A1 (en) 2015-09-08 2015-09-08 Commodity information pushing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2015/089130 WO2017041226A1 (en) 2015-09-08 2015-09-08 Commodity information pushing method

Publications (1)

Publication Number Publication Date
WO2017041226A1 true WO2017041226A1 (en) 2017-03-16

Family

ID=58240485

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/089130 WO2017041226A1 (en) 2015-09-08 2015-09-08 Commodity information pushing method

Country Status (2)

Country Link
CN (1) CN107251075A (en)
WO (1) WO2017041226A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110162694A (en) * 2019-04-02 2019-08-23 莫毓昌 A kind of recommender system and method based on pairs of correlation rule
CN111275523A (en) * 2020-01-17 2020-06-12 青梧桐有限责任公司 Method and system for calculating recommended amount based on irregular data
CN111815339A (en) * 2019-04-12 2020-10-23 中国移动通信集团河南有限公司 Marketing information pushing method and equipment
CN112163882A (en) * 2020-09-24 2021-01-01 山西财经大学 Fresh and fresh convenience store commodity associated display system and display method thereof
CN112818088A (en) * 2021-02-23 2021-05-18 平安科技(深圳)有限公司 Commodity search data processing method, commodity search data processing device, commodity search equipment and storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111429174B (en) * 2020-03-13 2022-06-03 苏州浪潮智能科技有限公司 Commodity recommendation method, device, equipment and medium based on video analysis

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101493925A (en) * 2009-03-09 2009-07-29 浙江工商大学 Retail industry dime ticket generating method by employing increment type excavation
US7698170B1 (en) * 2004-08-05 2010-04-13 Versata Development Group, Inc. Retail recommendation domain model
CN103514267A (en) * 2013-09-04 2014-01-15 快传(上海)广告有限公司 Gateway correlation information obtaining method and system
CN103744904A (en) * 2013-12-25 2014-04-23 北京京东尚科信息技术有限公司 Method and device for providing information
CN104699755A (en) * 2015-01-07 2015-06-10 中国电子科技集团公司第三十研究所 Intelligent multi-target comprehensive identification method based on data mining

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7698170B1 (en) * 2004-08-05 2010-04-13 Versata Development Group, Inc. Retail recommendation domain model
CN101493925A (en) * 2009-03-09 2009-07-29 浙江工商大学 Retail industry dime ticket generating method by employing increment type excavation
CN103514267A (en) * 2013-09-04 2014-01-15 快传(上海)广告有限公司 Gateway correlation information obtaining method and system
CN103744904A (en) * 2013-12-25 2014-04-23 北京京东尚科信息技术有限公司 Method and device for providing information
CN104699755A (en) * 2015-01-07 2015-06-10 中国电子科技集团公司第三十研究所 Intelligent multi-target comprehensive identification method based on data mining

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110162694A (en) * 2019-04-02 2019-08-23 莫毓昌 A kind of recommender system and method based on pairs of correlation rule
CN110162694B (en) * 2019-04-02 2021-01-05 莫毓昌 Recommendation system and method based on paired association rules
CN111815339A (en) * 2019-04-12 2020-10-23 中国移动通信集团河南有限公司 Marketing information pushing method and equipment
CN111815339B (en) * 2019-04-12 2023-12-26 中国移动通信集团河南有限公司 Marketing information pushing method and device
CN111275523A (en) * 2020-01-17 2020-06-12 青梧桐有限责任公司 Method and system for calculating recommended amount based on irregular data
CN112163882A (en) * 2020-09-24 2021-01-01 山西财经大学 Fresh and fresh convenience store commodity associated display system and display method thereof
CN112818088A (en) * 2021-02-23 2021-05-18 平安科技(深圳)有限公司 Commodity search data processing method, commodity search data processing device, commodity search equipment and storage medium
WO2022178932A1 (en) * 2021-02-23 2022-09-01 平安科技(深圳)有限公司 Commodity search data processing method, apparatus and device, and storage medium
CN112818088B (en) * 2021-02-23 2023-09-29 平安科技(深圳)有限公司 Commodity search data processing method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN107251075A (en) 2017-10-13

Similar Documents

Publication Publication Date Title
WO2017041226A1 (en) Commodity information pushing method
AU2019253767B2 (en) System architecture for customer genome construction and analysis
US10282748B2 (en) System and method for measuring advertising effectiveness
TWI533245B (en) Product sale preditiction system, product sale preditiction method and non-transitory computer readable storage medium thereof
CN107332910B (en) Information pushing method and device
Cebollada et al. Online category pricing at a multichannel grocery retailer
CN107563857B (en) Customer purchasing behavior analysis method based on logic reasoning negative association rule pruning technology
US20130166372A1 (en) Utilizing real-time metrics to normalize an advertisement based on consumer reaction
US11762819B2 (en) Clustering model analysis for big data environments
CN104615721B (en) For the method and system based on return of goods related information Recommendations
JP2017220155A (en) Customer management in electronic commerce site
Chung et al. The effect of perceived value on customer loyalty in a low-priced cosmetic brand of south korea: The moderating effect of gender
JP2018045505A (en) Determination device, determination method, and determination program
GB2591677A (en) System and method for providing recommendations based on consumer location
US10810626B2 (en) Automated lists
US20200090110A1 (en) System and method for cognitive and preemptive fashion inventory management &amp; order fulfilment
JP6143930B1 (en) Marketing support method, program, computer storage medium, and marketing support system
KR101479379B1 (en) Method of sharing customer rate of merchandise based on social network
Mahmud et al. Factors influencing customers’ attitude towards online shopping: Evidence from Dhaka city
KR101567217B1 (en) Method and system for recommendation with collaborative filtering using social network analysis
Lin Big Data Analysis of Personalized Recommendation in E-Commerce
WO2015118587A1 (en) Matching system, matching method, and non-transitory computer-readable medium on which program is stored
CN110610402A (en) Commodity pushing method and computer storage medium
Serwah et al. Customer analytics for online retailers using weighted k-means and RFM analysis
Purwanto et al. Clustering of potencial customer in digital trade transaction

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15903333

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15903333

Country of ref document: EP

Kind code of ref document: A1