CN111538901B - Article recommendation method and device, server and storage medium - Google Patents
Article recommendation method and device, server and storage medium Download PDFInfo
- Publication number
- CN111538901B CN111538901B CN202010294127.4A CN202010294127A CN111538901B CN 111538901 B CN111538901 B CN 111538901B CN 202010294127 A CN202010294127 A CN 202010294127A CN 111538901 B CN111538901 B CN 111538901B
- Authority
- CN
- China
- Prior art keywords
- preset
- target
- item
- items
- exposure
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 42
- 238000012216 screening Methods 0.000 claims abstract description 23
- 230000015654 memory Effects 0.000 claims description 14
- 230000006399 behavior Effects 0.000 claims description 8
- 230000000694 effects Effects 0.000 abstract description 7
- 238000012360 testing method Methods 0.000 abstract description 7
- 238000001914 filtration Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 230000003203 everyday effect Effects 0.000 description 4
- 101000827703 Homo sapiens Polyphosphoinositide phosphatase Proteins 0.000 description 3
- 102100023591 Polyphosphoinositide phosphatase Human genes 0.000 description 3
- 238000004904 shortening Methods 0.000 description 3
- 101001121408 Homo sapiens L-amino-acid oxidase Proteins 0.000 description 2
- 102100026388 L-amino-acid oxidase Human genes 0.000 description 2
- 101100012902 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FIG2 gene Proteins 0.000 description 2
- 101100233916 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KAR5 gene Proteins 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000013500 data storage Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000009191 jumping Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0631—Item recommendations
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Accounting & Taxation (AREA)
- Finance (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Development Economics (AREA)
- Economics (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明实施例公开了一种物品推荐方法、装置、服务器及储存介质,该方法包括:根据预设筛选规则对预设物品集合中的预设物品进行筛选,选出目标物品;若所述目标物品的推荐次数小于第一曝光阈值,向根据所述目标物品的预设属性搜索到的预设用户推荐所述目标物品;若所述目标物品的推荐次数大于所述第一曝光阈值,根据所述目标物品的点击率在预设分组中进行排序召回;若所述目标物品的推荐次数大于第二曝光阈值,将所述目标物品与所述预设物品集合中的其他所述预设物品一并按照预设策略进行排序召回。本实施例的技术方案,缩短新物品的迭代周期,且快速试探出新物品的效果质量,能够满足资讯行业推荐系统对物品提出的高时效性要求。
The embodiment of the present invention discloses an item recommendation method, device, server and storage medium. The method includes: screening the preset items in the preset item set according to the preset screening rules to select the target item; if the target The recommended times of the item is less than the first exposure threshold, recommending the target item to the preset user searched according to the preset attribute of the target item; if the recommended times of the target item is greater than the first exposure threshold, according to the The click rate of the target item is sorted and recalled in the preset grouping; if the number of recommendations of the target item is greater than the second exposure threshold, the target item is combined with other preset items in the preset item set And sort and recall according to the preset strategy. The technical solution of this embodiment shortens the iteration cycle of new items, and quickly tests out the effect quality of new items, which can meet the high timeliness requirements for items put forward by the recommendation system of the information industry.
Description
技术领域Technical Field
本发明实施例涉及人工智能技术,尤其涉及一种物品推荐方法、装置、服务器及储存介质。Embodiments of the present invention relate to artificial intelligence technology, and more particularly to an item recommendation method, device, server and storage medium.
背景技术Background Art
随着传播技术的飞快发展以及数据量的剧增,用户对资讯行业提出了更高的时效性和个性化的要求。With the rapid development of communication technology and the dramatic increase in data volume, users have placed higher demands on timeliness and personalization in the information industry.
现有技术中新物品的迭代方式主要有如下两种:其一为通过物品自身的内容与老物品建立某种关联,然后以该关联关系进行曝光。那么新物品的进一步曝光很依赖于老物品的曝光程度和点击情况。若老物品被用户点击少,那么其排序则相对靠后,那么新物品的曝光周期会被进一步延长,曝光次数也受到其相关老物品的推荐情况的影响。其二为通过模型预测用户对物品的评分,从而进行排序召回。但一般而言,用户行为数据量较大,计算周期长,有一定的滞后性,使得物品的迭代周期延长。因此新物品的迭代周期相对较长。There are two main ways to iterate new items in the prior art: one is to establish a certain association with old items through the content of the item itself, and then expose them based on the association. Then the further exposure of the new item depends on the exposure and click situation of the old item. If the old item is clicked less by the user, then its ranking will be relatively backward, then the exposure cycle of the new item will be further extended, and the number of exposures will also be affected by the recommendation of its related old items. The second is to predict the user's rating of the item through a model, and then perform sorting and recall. But generally speaking, the amount of user behavior data is large, the calculation cycle is long, and there is a certain lag, which prolongs the iteration cycle of the item. Therefore, the iteration cycle of new items is relatively long.
与此同时,老物品由于得到充分曝光,其物品质量优劣性可以根据用户的反馈而得到;而新物品在短时间无法得到曝光,其质量无法得知。现有技术中按照统一排序召回方法,新物品一般处于劣势,那么质量可能优质的新物品推荐给用户的概率大大降低,新物品无法曝光。At the same time, since old items are fully exposed, their quality can be determined based on user feedback; however, new items cannot be exposed in a short period of time, and their quality cannot be known. In the prior art, new items are generally at a disadvantage according to the unified sorting and recall method, so the probability of recommending new items with high quality to users is greatly reduced, and new items cannot be exposed.
发明内容Summary of the invention
本发明实施例提供一种物品推荐方法、装置、服务器及储存介质,以实现缩短新物品的迭代周期,快速试探出新物品的效果质量。The embodiments of the present invention provide an item recommendation method, device, server and storage medium to shorten the iteration cycle of new items and quickly test the effect quality of new items.
第一方面,本发明实施例提供了一种物品推荐方法,包括:In a first aspect, an embodiment of the present invention provides an item recommendation method, comprising:
根据预设筛选规则对预设物品集合中的预设物品进行筛选,选出目标物品;Filter the preset items in the preset item set according to the preset filtering rules to select the target item;
若所述目标物品的推荐次数小于第一曝光阈值,向根据所述目标物品的预设属性搜索到的预设用户推荐所述目标物品;If the number of recommendations of the target item is less than a first exposure threshold, recommending the target item to a preset user searched according to a preset attribute of the target item;
若所述目标物品的推荐次数大于所述第一曝光阈值,根据所述目标物品的点击率在预设分组中进行排序召回;If the number of recommendations of the target item is greater than the first exposure threshold, sort and recall the target item in the preset grouping according to the click rate of the target item;
若所述目标物品的推荐次数大于第二曝光阈值,将所述目标物品与所述预设物品集合中的其他所述预设物品一并按照预设策略进行排序召回。If the number of recommendations of the target item is greater than the second exposure threshold, the target item and other preset items in the preset item set are sorted and recalled according to a preset strategy.
第二方面,本发明实施例还提供了一种物品推荐装置,包括:In a second aspect, an embodiment of the present invention further provides an item recommendation device, comprising:
目标物品筛选模块,用于根据预设筛选规则对预设物品集合中的预设物品进行筛选,选出目标物品;A target item screening module is used to screen preset items in a preset item set according to preset screening rules to select target items;
目标物品推荐模块,用于若所述目标物品的推荐次数小于第一曝光阈值,向根据所述目标物品的预设属性搜索到的预设用户推荐所述目标物品;a target item recommendation module, configured to recommend the target item to a preset user searched according to a preset attribute of the target item if the number of recommendations of the target item is less than a first exposure threshold;
第一排序召回模块,用于若所述目标物品的推荐次数大于所述第一曝光阈值,根据所述目标物品在所属类目的点击率进行排序召回;A first sorting and recalling module, configured to sort and recall the target item according to the click rate of the target item in the category to which it belongs if the number of recommendations of the target item is greater than the first exposure threshold;
第二排序召回模块,用于若所述目标物品的推荐次数大于第二曝光阈值,将所述目标物品与所述预设物品集合中的其他预设物品一并按照预设策略进行排序召回。The second sorting and recalling module is used to sort and recall the target object and other preset objects in the preset object set according to a preset strategy if the number of recommendations of the target object is greater than a second exposure threshold.
第三方面,本发明实施例还提供了一种服务器,所述服务器包括:In a third aspect, an embodiment of the present invention further provides a server, the server comprising:
一个或多个处理器;one or more processors;
存储器,用于存储一个或多个程序,a memory for storing one or more programs,
当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如本发明任意实施例所提供的物品推荐方法。When the one or more programs are executed by the one or more processors, the one or more processors implement the item recommendation method provided by any embodiment of the present invention.
第四方面,本发明实施例还提供了一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行如本发明任意实施例所提供物品推荐方法。In a fourth aspect, an embodiment of the present invention further provides a storage medium comprising computer executable instructions, which, when executed by a computer processor, are used to execute the item recommendation method provided by any embodiment of the present invention.
本发明实施例通过筛选目标物品进行推荐,并根据曝光量选择对应的排序召回策略,解决新物品的迭代周期相对较长,新物品在短时间无法得到曝光,而导致质量无法得知的问题,实现缩短新物品的迭代周期,快速试探出新物品的质量的效果。The embodiment of the present invention recommends by screening target items and selects a corresponding sorting and recall strategy according to the exposure amount, thereby solving the problem that the iteration cycle of new items is relatively long and new items cannot be exposed in a short time, resulting in the inability to know the quality. This achieves the effect of shortening the iteration cycle of new items and quickly testing the quality of new items.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是本发明实施例一中的一种物品推荐方法的流程图;FIG1 is a flow chart of an item recommendation method in Embodiment 1 of the present invention;
图2是本发明实施例二中的一种物品推荐方法的流程图;FIG2 is a flow chart of an item recommendation method in Embodiment 2 of the present invention;
图3是本发明实施例二中的一种物品推荐方法的流程图;FIG3 is a flow chart of an item recommendation method in Embodiment 2 of the present invention;
图4是本发明实施例三中的一种物品推荐装置的结构示意图;FIG4 is a schematic diagram of the structure of an item recommendation device in Embodiment 3 of the present invention;
图5是本发明实施例四中的一种服务器的结构示意图。FIG5 is a schematic diagram of the structure of a server in Embodiment 4 of the present invention.
具体实施方式DETAILED DESCRIPTION
下面结合附图和实施例对本发明作进一步的详细说明。可以理解的是,此处所描述的具体实施例仅仅用于解释本发明,而非对本发明的限定。另外还需要说明的是,为了便于描述,附图中仅示出了与本发明相关的部分而非全部结构。The present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It is to be understood that the specific embodiments described herein are only used to explain the present invention, rather than to limit the present invention. It should also be noted that, for ease of description, only parts related to the present invention, rather than all structures, are shown in the accompanying drawings.
实施例一Embodiment 1
图1为本发明实施例一提供的一种物品推荐方法的流程图,本实施例可适用于新物品冷启动推荐的情况,该方法可以由物品推荐装置来执行,该装置可以由硬件和/或软件来实现,该方法具体包括如下步骤:FIG1 is a flow chart of an item recommendation method provided by Embodiment 1 of the present invention. This embodiment is applicable to the case of cold-start recommendation of new items. The method may be executed by an item recommendation device, which may be implemented by hardware and/or software. The method specifically includes the following steps:
步骤110、根据预设筛选规则对预设物品集合中的预设物品进行筛选,选出目标物品;Step 110: Filter the preset items in the preset item set according to the preset filtering rule to select the target item;
其中,从数据库中获取存储的预设物品集合,预设物品集合中存储有预设物品,这些预设物品是需要通过推荐引擎推荐给用户的,并且从数据库中获取预设物品的相关信息,值得注意的是,以预设物品的某个维度进行筛选,从而判断预设物品是否为新物品,则该维度信息不可或缺,示例性的,维度为面世时间差,也就是当前时间与物品的面世时间的差值。筛选的目标是从预设物品集中找到新物品,可选的,根据预设筛选规则对预设物品集合中的预设物品进行筛选,选出目标物品,包括:将面世时间差小于第一阈值的预设物品确定为预选目标物品;将面世时间差大于第二阈值,且推荐次数超过预设曝光量阈值的预选目标物品剔除;其中,第一阈值大于第二阈值;将剩余的预选目标物品确定为目标物品。第一阈值是区分新物品和老物品的界限时间,对于面世时间差大于等于第一阈值的预设物品则认为属于老物品,示例性的,第一阈值一般设置为1天,根据时效性要求自行设置。另外,对于预选目标物品范围中面世时间差大于第二阈值的,需要判断其曝光程度的高低,如果其得到较多曝光,也就是推荐次数超过预设曝光量阈值,就需要将其剔除,不将其选为目标物品进行推荐。示例性的,第二阈值可以设置为7200秒,一般可以选用其他推荐模块的最长运行时间。该预设曝光量阈值主要通过分析,一般可以取其他推荐逻辑的平均曝光量值。上述预设筛选规则筛选出目标物品,其余的预设物品则采用与老物品一样的预设策略统一进行排序召回。Among them, a preset item set stored in a database is obtained, and the preset item set stores preset items, which need to be recommended to users through a recommendation engine, and relevant information of the preset items is obtained from the database. It is worth noting that the preset items are screened by a certain dimension to determine whether the preset items are new items, and the dimension information is indispensable. Exemplarily, the dimension is the time difference between the appearance, that is, the difference between the current time and the appearance time of the item. The goal of screening is to find new items from the preset item set. Optionally, the preset items in the preset item set are screened according to the preset screening rules to select the target items, including: determining the preset items whose appearance time difference is less than the first threshold as pre-selected target items; removing the pre-selected target items whose appearance time difference is greater than the second threshold and whose recommendation times exceed the preset exposure threshold; wherein the first threshold is greater than the second threshold; and determining the remaining pre-selected target items as target items. The first threshold is the boundary time for distinguishing new items from old items. For preset items whose appearance time difference is greater than or equal to the first threshold, they are considered to be old items. Exemplarily, the first threshold is generally set to 1 day, which can be set according to the timeliness requirements. In addition, for the pre-selected target items whose release time difference is greater than the second threshold, it is necessary to determine the degree of exposure. If they receive more exposure, that is, the number of recommendations exceeds the preset exposure threshold, they need to be eliminated and not selected as target items for recommendation. Exemplarily, the second threshold can be set to 7200 seconds, and the longest running time of other recommendation modules can generally be selected. The preset exposure threshold is mainly obtained through analysis, and generally the average exposure value of other recommendation logics can be taken. The above preset screening rules filter out the target items, and the remaining preset items are uniformly sorted and recalled using the same preset strategy as the old items.
步骤120、若目标物品的推荐次数小于第一曝光阈值,向根据目标物品的预设属性搜索到的预设用户推荐目标物品;Step 120: If the number of recommendations of the target item is less than the first exposure threshold, recommend the target item to a preset user searched according to the preset attributes of the target item;
其中,第一曝光阈值为试探曝光阈值,对于达到该试探曝光阈值的目标物品可以认为是已经向足够多用户推荐过,而对于推荐次数小于第一曝光阈值的目标物品,需要将其进行强制推荐以增加曝光量,这样也可以根据用户的反馈确定新物品的质量效果。预设属性可以是物品的类目和/或标签等信息,获取用户最近有点击行为的物品的类目和/或标签等信息,将这部分信息与需曝光的新物品的类目和/或标签信息进行匹配,对于点击了相匹配的物品的用户,即为预设用户。强制一定数量的预设用户推荐目标物品。Among them, the first exposure threshold is a trial exposure threshold. For target items that reach the trial exposure threshold, it can be considered that they have been recommended to enough users. For target items that are recommended less than the first exposure threshold, they need to be forcibly recommended to increase exposure. In this way, the quality effect of new items can also be determined based on user feedback. The preset attributes can be information such as the category and/or label of the item. The category and/or label information of the items that the user has recently clicked is obtained, and this part of information is matched with the category and/or label information of the new item to be exposed. The user who clicks the matching item is the preset user. A certain number of preset users are forced to recommend the target item.
步骤130、若目标物品的推荐次数大于第一曝光阈值,根据目标物品的点击率在预设分组中进行排序召回;Step 130: If the number of recommendations of the target item is greater than the first exposure threshold, sort and recall the target item in the preset grouping according to the click rate of the target item;
其中,对于推荐次数已经大于第一曝光阈值的目标物品,收集这些目标物品的点击行为,计算目标物品的点击率,由于这些目标物品属于新物品,所以按照目标物品的类目和/或标签分组,这样目标物品会处于预设分组下,根据在同一预设分组的目标物品的点击率从大到小,对目标物品进行排序召回。Among them, for target items whose recommendation times are greater than the first exposure threshold, the click behaviors of these target items are collected and the click-through rates of the target items are calculated. Since these target items are new items, they are grouped according to the categories and/or tags of the target items. In this way, the target items will be in a preset group. The target items are sorted and recalled from large to small according to the click-through rates of the target items in the same preset group.
步骤140、若目标物品的推荐次数大于第二曝光阈值,将目标物品与预设物品集合中的其他预设物品一并按照预设策略进行排序召回。Step 140: If the number of recommendations of the target item is greater than the second exposure threshold, the target item and other preset items in the preset item set are sorted and recalled according to a preset strategy.
其中,对于推荐次数已经大于第二曝光阈值的目标物品,认为已经进入老物品的范围,对于这样的目标物品,可以采用与老物品一样的预设策略进行排序召回。示例性的,老物品采用协同过滤或点击通过率(Click-Through-Rate,CTR)排序等策略进行排序召回,如此实现了新物品与老物品召回排序方式上的区别处理和最终统一。Among them, for the target items whose recommendation times are greater than the second exposure threshold, it is considered that they have entered the range of old items. For such target items, the same preset strategy as the old items can be used for sorting and recall. Exemplarily, the old items are sorted and recalled using strategies such as collaborative filtering or click-through rate (CTR) sorting, so that the new items and the old items are processed differently and finally unified in the sorting method of recall.
本实施例的技术方案,通过筛选目标物品进行推荐,并根据曝光量选择对应的排序召回策略,解决新物品的迭代周期相对较长,新物品在短时间无法得到曝光,而导致质量无法得知的问题,实现缩短新物品的迭代周期,快速试探出新物品的质量的效果。The technical solution of this embodiment, by screening target items for recommendation and selecting corresponding sorting and recall strategies according to the exposure amount, solves the problem that the iteration cycle of new items is relatively long and new items cannot be exposed in a short time, resulting in the inability to know the quality, thereby shortening the iteration cycle of new items and quickly testing the quality of new items.
实施例二Embodiment 2
图2为本发明实施例二提供的一种物品推荐方法的流程图,本实施例的技术方案在上述技术方案的基础上进一步细化,该方法具体包括:FIG2 is a flow chart of an item recommendation method provided by Embodiment 2 of the present invention. The technical solution of this embodiment is further refined on the basis of the above technical solution. The method specifically includes:
步骤210、根据预设筛选规则对预设物品集合中的预设物品进行筛选,选出目标物品;Step 210: Filter the preset items in the preset item set according to the preset filtering rule to select the target item;
步骤220、根据第一曝光阈值确定公式计算第一曝光阈值。Step 220: Determine a formula according to the first exposure threshold A first exposure threshold is calculated.
其中,all_pv表示预设物品每天曝光量总和,ucb_ratio表示试探性曝光的占比,item_ucb表示每天新物品的数量,item_ucb_ratio表示曝光新物品的占比。试探性曝光占比一般在5%-10%之间,本实施例中选取5%。Among them, all_pv represents the total exposure of preset items every day, ucb_ratio represents the proportion of tentative exposure, item_ucb represents the number of new items every day, and item_ucb_ratio represents the proportion of new items exposed. The proportion of tentative exposure is generally between 5% and 10%, and 5% is selected in this embodiment.
步骤230、若所述目标物品的推荐次数小于所述第一曝光阈值,获取预设时间内用户点击行为的对象物品;Step 230: If the number of recommendations of the target item is less than the first exposure threshold, obtain the target item of the user's click behavior within a preset time;
步骤240、将对象物品与目标物品在预设属性方面进行匹配;
步骤250、将相匹配的对象物品对应的用户,确定为预设用户;Step 250: Determine the user corresponding to the matched object item as the preset user;
步骤260、向预设数量的预设用户推荐目标物品。Step 260: Recommend the target item to a preset number of preset users.
其中,物品的预设属性可以选择类目,将用户最近点击的物品的类目与新物品的类目进行匹配,推荐给匹配成功的预设用户,一共推荐给预设数量的预设用户。根据用户类目偏好,推荐出相同类目的新物品,这样更容易快速试探出新物品的类目效果。Among them, the preset attributes of the items can select categories, match the category of the item that the user recently clicked with the category of the new item, and recommend it to the preset users who have successfully matched, and recommend it to a preset number of preset users in total. According to the user's category preference, new items of the same category are recommended, which makes it easier to quickly test the category effect of the new item.
可选的,在向预设数量的预设用户推荐目标物品之前,还包括:Optionally, before recommending the target item to a preset number of preset users, the method further includes:
根据预设数量确定公式计算预设数量,其中,ucb_max_pv表示第一曝光阈值,user_cnt表示预设用户数量。Determine the formula based on the preset quantity Calculate the preset number, where ucb_max_pv represents the first exposure threshold, and user_cnt represents the preset number of users.
步骤270、若目标物品的推荐次数大于第一曝光阈值,根据目标物品的点击率在预设分组中进行排序召回;Step 270: If the number of recommendations of the target item is greater than the first exposure threshold, sort and recall the target item in the preset grouping according to the click rate of the target item;
其中,可选的,获取针对目标物品的点击行为,根据点击率确定公式计算目标物品的点击率,其中,ctri表示预设物品i的点击率,pvi表示预设物品i的曝光量,cki表示预设物品i的点击量;Optionally, obtain the click behavior for the target item and determine the formula based on the click rate Calculate the click rate of the target item, where ctri represents the click rate of preset item i, pvi represents the exposure of preset item i, and cki represents the click volume of preset item i;
在预设分组中,按照目标物品的点击率从大到小进行排序召回。In the preset grouping, the target items are sorted and recalled from high to low according to their click rate.
步骤280、若目标物品的推荐次数大于第二曝光阈值,将目标物品与预设物品集合中的其他预设物品一并按照预设策略进行排序召回。Step 280: If the number of recommendations of the target item is greater than the second exposure threshold, the target item and other preset items in the preset item set are sorted and recalled according to a preset strategy.
图3示出了本发明实施例提供的一种物品推荐方法的流程图,该流程图中示出了解决资讯行业中推荐系统的物品冷启动问题方法的较为合适实现方式的流程图,该实现方式的步骤如下:FIG3 shows a flow chart of an item recommendation method provided by an embodiment of the present invention, and the flow chart shows a flow chart of a more suitable implementation method of a method for solving the item cold start problem of a recommendation system in the information industry. The steps of the implementation method are as follows:
1.给出所有物品集合,该实例中新物品的筛选条件主要以时间维度展开的,因此必须获取物品的面世时间(item_produce_time)。1. Given a collection of all items, the screening criteria for new items in this example are mainly based on the time dimension, so the item's release time (item_produce_time) must be obtained.
2.根据筛选条件筛选出符合新物品冷启动的物品:2. Filter out items that meet the new item cold start criteria:
1)若物品的时间差(time_diff,该值为当前时间-物品面世时间)超过一定阈值(max_time_diff,该值一般设置为1天,根据时效性要求自行设置),则该物品不属于新物品,选用常规排序召回方法,跳转至步骤9;1) If the time difference of the item (time_diff, which is the current time minus the item release time) exceeds a certain threshold (max_time_diff, which is generally set to 1 day and can be set according to timeliness requirements), the item is not a new item, and the conventional sorting and recall method is used, jumping to step 9;
2)若物品的时间差(time_diff)超过一个阈值(choice_time_diff=7200,一般选用其他推荐模块的最长运行时间),且推荐次数超过曝光量阈值(pv_max),虽然该物品属于新物品,但是它得到了足够的曝光,可以与老物品采取统计方式进行排序召回,跳转至步骤9;2) If the time difference (time_diff) of the item exceeds a threshold (choice_time_diff = 7200, generally the longest running time of other recommendation modules is selected), and the number of recommendations exceeds the exposure threshold (pv_max), although the item is a new item, it has received enough exposure and can be sorted and recalled with old items in a statistical manner, and jump to step 9;
3)余下新物品符合新物品冷启动的条件,进行步骤3。3) The remaining new items meet the conditions for new item cold start, and proceed to step 3.
3.获取新物品的类目、标签等信息(至少一个维度上的信息,尽量选取准备率较高的维度),本案例上采取了类目,原因为通过类目可以快速定位推荐的用户群,且在该案例中类目信息比较准确。3. Obtain the category, label and other information of the new item (information on at least one dimension, try to select the dimension with a higher preparation rate). In this case, categories are used because categories can be used to quickly locate the recommended user group, and in this case, category information is relatively accurate.
4.若物品的推荐数量达到试探曝光阈值(ucb_max_pv,其中试探性曝光占比一般在5%-10%之间,本案例选取了5%),进行步骤7,否则进行步骤5。4. If the number of recommended items reaches the tentative exposure threshold (ucb_max_pv, where the tentative exposure ratio is generally between 5% and 10%, and 5% is selected in this case), proceed to step 7, otherwise proceed to step 5.
5.将用户最近点击的物品的类目与新物品的类目进行匹配,推荐给用户,一共推荐给ucb_num个用户。该系统根据用户类目偏好,推荐出相同类目的新物品,这样更容易快速试探出新物品的类目效果。5. Match the category of the item that the user recently clicked with the category of the new item and recommend it to the user, a total of ucb_num users. The system recommends new items of the same category based on the user's category preferences, which makes it easier to quickly test the category effect of the new item.
6.若新物品的推荐数量达到试探性曝光阈值(ucb_max_pv),则跳转到步骤7,否则跳转至步骤5。6. If the recommended number of new items reaches the tentative exposure threshold (ucb_max_pv), jump to step 7, otherwise jump to step 5.
7.收集新物品的点击行为,计算新物品的点击率(ctr),在同一类目下按照点击率(从大到小)进行排序召回。7. Collect the click behavior of new items, calculate the click rate (ctr) of new items, and sort and recall them according to the click rate (from large to small) in the same category.
8.若新物品的推荐数量达到曝光量阈值(pv_max),进行步骤9,否则进行步骤7。8. If the recommended quantity of the new item reaches the exposure threshold (pv_max), proceed to step 9, otherwise proceed to step 7.
9.和老物品采用统一的计算方式进行排序召回.9. Use the same calculation method as old items for sorting and recall.
因为新物品得到足够的曝光,演变成老物品,不再具备冷启动物品的相关特征,至此完成了一个冷启动物品到老物品的产品周期。Because the new item gets enough exposure and turns into an old item, it no longer has the relevant characteristics of a cold start item, thus completing a product cycle from a cold start item to an old item.
实施例三Embodiment 3
图4为本发明实施例四提供的一种物品推荐装置的结构示意图,如图4所示,物品推荐装置包括:FIG4 is a schematic diagram of the structure of an item recommendation device provided in Embodiment 4 of the present invention. As shown in FIG4 , the item recommendation device includes:
目标物品筛选模块410,用于根据预设筛选规则对预设物品集合中的预设物品进行筛选,选出目标物品;The target
目标物品推荐模块420,用于若目标物品的推荐次数小于第一曝光阈值,向根据目标物品的预设属性搜索到的预设用户推荐目标物品;A target
第一排序召回模块430,用于若目标物品的推荐次数大于第一曝光阈值,根据目标物品在所属类目的点击率进行排序召回;A first sorting and recalling
第二排序召回模块440,用于若目标物品的推荐次数大于第二曝光阈值,将目标物品与预设物品集合中的其他预设物品一并按照预设策略进行排序召回。The second sorting and recalling
本实施例的技术方案,通过筛选目标物品进行推荐,并根据曝光量选择对应的排序召回策略,解决新物品的迭代周期相对较长,新物品在短时间无法得到曝光,而导致质量无法得知的问题,实现缩短新物品的迭代周期,快速试探出新物品的质量的效果。The technical solution of this embodiment, by screening target items for recommendation and selecting corresponding sorting and recall strategies according to the exposure amount, solves the problem that the iteration cycle of new items is relatively long and new items cannot be exposed in a short time, resulting in the inability to know the quality. It achieves the effect of shortening the iteration cycle of new items and quickly testing the quality of new items.
可选的,目标物品筛选模块410,包括:Optionally, the target
预选目标物品确定单元,用于将面世时间差小于第一阈值的预设物品确定为预选目标物品;A pre-selected target item determination unit, configured to determine preset items whose release time difference is less than a first threshold as pre-selected target items;
剔除单元,用于将面世时间差大于第二阈值,且推荐次数超过预设曝光量阈值的预选目标物品剔除;其中,第一阈值大于第二阈值;A rejection unit, used to reject pre-selected target items whose release time difference is greater than a second threshold and whose recommendation times exceed a preset exposure threshold; wherein the first threshold is greater than the second threshold;
目标物品确定单元,用于将剩余的预选目标物品确定为目标物品。The target item determination unit is used to determine the remaining pre-selected target items as target items.
可选的,目标物品推荐模块420,具体用于:Optionally, the target
若目标物品的推荐次数小于第一曝光阈值,获取预设时间内用户点击行为的对象物品;If the number of recommendations of the target item is less than the first exposure threshold, the target item of the user's click behavior within a preset time is obtained;
将对象物品与目标物品在预设属性方面进行匹配;Matching the object item with the target item in terms of preset attributes;
将相匹配的对象物品对应的用户,确定为预设用户;Determine the user corresponding to the matched object item as the preset user;
向预设数量的预设用户推荐目标物品。Recommend target items to a preset number of preset users.
可选的,物品推荐装置,还包括:Optionally, the item recommendation device further includes:
预设数量确定模块,用于在向预设数量的预设用户推荐目标物品之前,根据预设数量确定公式计算预设数量,其中,ucb_max_pv表示第一曝光阈值,user_cnt表示预设用户数量。A preset quantity determination module is used to determine the formula according to the preset quantity before recommending the target item to the preset number of preset users. Calculate the preset number, where ucb_max_pv represents the first exposure threshold, and user_cnt represents the preset number of users.
可选的,物品推荐装置,还包括:Optionally, the item recommendation device further includes:
第一曝光阈值确定模块,用于在若目标物品的推荐次数小于第一曝光阈值,向根据目标物品的预设属性搜索到的预设用户推荐目标物品之前,根据第一曝光阈值确定公式计算第一曝光阈值,其中,all_pv表示预设物品每天曝光量总和,ucb_ratio表示试探性曝光的占比,item_ucb表示每天新物品的数量,item_ucb_ratio表示曝光新物品的占比。The first exposure threshold determination module is used to determine the formula according to the first exposure threshold before recommending the target item to the preset user searched according to the preset attribute of the target item if the number of recommendation times of the target item is less than the first exposure threshold. Calculate the first exposure threshold, where all_pv represents the total exposure of preset items every day, ucb_ratio represents the proportion of tentative exposure, item_ucb represents the number of new items every day, and item_ucb_ratio represents the proportion of new items exposed.
可选的,第一排序召回模块430,具体用于:Optionally, the first sorting and recalling
获取针对目标物品的点击行为,根据点击率确定公式计算目标物品的点击率,其中,ctri表示预设物品i的点击率,pvi表示预设物品i的曝光量,cki表示预设物品i的点击量;Get the click behavior for the target item and determine the formula based on the click rate Calculate the click rate of the target item, where ctri represents the click rate of preset item i, pvi represents the exposure of preset item i, and cki represents the click volume of preset item i;
在预设分组中,按照目标物品的点击率从大到小进行排序召回。In the preset grouping, the target items are sorted and recalled from high to low according to their click rate.
本发明实施例所提供的物品推荐装置可执行本发明任意实施例所提供的物品推荐方法,具备执行方法相应的功能模块和有益效果。The item recommendation device provided in the embodiment of the present invention can execute the item recommendation method provided in any embodiment of the present invention, and has the corresponding functional modules and beneficial effects of the execution method.
实施例四Embodiment 4
图5为本发明实施例四提供的一种服务器的结构示意图,如图5所示,该服务器包括处理器510和存储器520;服务器中处理器510的数量可以是一个或多个,图5中以一个处理器510为例;服务器中的处理器510和存储器520、可以通过总线或其他方式连接,图5中以通过总线连接为例。Figure 5 is a schematic diagram of the structure of a server provided in Embodiment 4 of the present invention. As shown in Figure 5, the server includes a
存储器520作为一种计算机可读存储介质,可用于存储软件程序、计算机可执行程序以及模块,如本发明实施例中的物品推荐方法对应的程序指令/模块(例如,物品推荐装置中的目标物品筛选模块410、目标物品推荐模块420、第一排序召回模块430和第二排序召回模块440)。处理器510通过运行存储在存储器520中的软件程序、指令以及模块,从而执行服务器的各种功能应用以及数据处理,即实现上述的物品推荐方法。The
存储器520可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序;存储数据区可存储根据终端的使用所创建的数据等。此外,存储器520可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性固态存储器件。在一些实例中,存储器520可进一步包括相对于处理器510远程设置的存储器,这些远程存储器可以通过网络连接至服务器。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The
实施例五Embodiment 5
本发明实施例五还提供一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行一种物品推荐方法,包括:Embodiment 5 of the present invention further provides a storage medium containing computer executable instructions, wherein the computer executable instructions, when executed by a computer processor, are used to perform an item recommendation method, including:
根据预设筛选规则对预设物品集合中的预设物品进行筛选,选出目标物品;Filter the preset items in the preset item set according to the preset filtering rules to select the target item;
若所述目标物品的推荐次数小于第一曝光阈值,向根据所述目标物品的预设属性搜索到的预设用户推荐所述目标物品;If the number of recommendations of the target item is less than a first exposure threshold, recommending the target item to a preset user searched according to a preset attribute of the target item;
若所述目标物品的推荐次数大于所述第一曝光阈值,根据所述目标物品的点击率在预设分组中进行排序召回;If the number of recommendations of the target item is greater than the first exposure threshold, sort and recall the target item in the preset grouping according to the click rate of the target item;
当所述目标物品的推荐次数大于第二曝光阈值,将所述目标物品与所述预设物品集合中的其他所述预设物品一并按照预设策略进行排序召回。When the number of recommendations of the target item is greater than a second exposure threshold, the target item and other preset items in the preset item set are sorted and recalled according to a preset strategy.
当然,本发明实施例所提供的一种包含计算机可执行指令的存储介质,其计算机可执行指令不限于如上所述的方法操作,还可以执行本发明任意实施例所提供的物品推荐方法中的相关操作。Of course, the storage medium containing computer executable instructions provided by an embodiment of the present invention is not limited to the method operations described above, and can also execute related operations in the item recommendation method provided by any embodiment of the present invention.
通过以上关于实施方式的描述,所属领域的技术人员可以清楚地了解到,本发明可借助软件及必需的通用硬件来实现,当然也可以通过硬件实现,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在计算机可读存储介质中,如计算机的软盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(RandomAccess Memory,RAM)、闪存(FLASH)、硬盘或光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述的方法。Through the above description of the implementation methods, the technicians in the relevant field can clearly understand that the present invention can be implemented by means of software and necessary general hardware, and of course it can also be implemented by hardware, but in many cases the former is a better implementation method. Based on such an understanding, the technical solution of the present invention, in essence, or the part that contributes to the prior art, can be embodied in the form of a software product, and the computer software product can be stored in a computer-readable storage medium, such as a computer floppy disk, read-only memory (ROM), random access memory (RAM), flash memory (FLASH), hard disk or optical disk, etc., including a number of instructions for a computer device (which can be a personal computer, server, or network device, etc.) to execute the methods described in each embodiment of the present invention.
值得注意的是,上述物品推荐装置的实施例中,所包括的各个单元和模块只是按照功能逻辑进行划分的,但并不局限于上述的划分,只要能够实现相应的功能即可;另外,各功能单元的具体名称也只是为了便于相互区分,并不用于限制本发明的保护范围。It is worth noting that in the embodiment of the above-mentioned item recommendation device, the various units and modules included are only divided according to functional logic, but are not limited to the above-mentioned division, as long as the corresponding functions can be achieved; in addition, the specific names of the functional units are only for the convenience of distinguishing each other, and are not used to limit the scope of protection of the present invention.
注意,上述仅为本发明的较佳实施例及所运用技术原理。本领域技术人员会理解,本发明不限于这里所述的特定实施例,对本领域技术人员来说能够进行各种明显的变化、重新调整和替代而不会脱离本发明的保护范围。因此,虽然通过以上实施例对本发明进行了较为详细的说明,但是本发明不仅仅限于以上实施例,在不脱离本发明构思的情况下,还可以包括更多其他等效实施例,而本发明的范围由所附的权利要求范围决定。Note that the above are only preferred embodiments of the present invention and the technical principles used. Those skilled in the art will understand that the present invention is not limited to the specific embodiments described herein, and that various obvious changes, readjustments and substitutions can be made by those skilled in the art without departing from the scope of protection of the present invention. Therefore, although the present invention has been described in more detail through the above embodiments, the present invention is not limited to the above embodiments, and may include more other equivalent embodiments without departing from the concept of the present invention, and the scope of the present invention is determined by the scope of the appended claims.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010294127.4A CN111538901B (en) | 2020-04-15 | 2020-04-15 | Article recommendation method and device, server and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010294127.4A CN111538901B (en) | 2020-04-15 | 2020-04-15 | Article recommendation method and device, server and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111538901A CN111538901A (en) | 2020-08-14 |
CN111538901B true CN111538901B (en) | 2023-06-06 |
Family
ID=71974921
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010294127.4A Active CN111538901B (en) | 2020-04-15 | 2020-04-15 | Article recommendation method and device, server and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111538901B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113781086B (en) * | 2021-01-21 | 2024-08-20 | 北京沃东天骏信息技术有限公司 | Article recommendation method, device, medium and electronic equipment |
CN112818237A (en) * | 2021-02-05 | 2021-05-18 | 上海明略人工智能(集团)有限公司 | Content pushing method, device, equipment and storage medium |
CN113744021A (en) * | 2021-02-08 | 2021-12-03 | 北京沃东天骏信息技术有限公司 | A recommended method, device, computer storage medium and system |
CN113378068B (en) * | 2021-07-14 | 2023-07-18 | 聚好看科技股份有限公司 | Content recommendation method and server |
CN113836404B (en) * | 2021-09-09 | 2024-08-16 | 武汉卓尔数字传媒科技有限公司 | Object recommendation method, device, electronic equipment and computer readable storage medium |
CN113688295B (en) * | 2021-10-26 | 2022-03-25 | 北京达佳互联信息技术有限公司 | Data determination method and device, electronic equipment and storage medium |
CN114662008B (en) * | 2022-05-26 | 2022-10-21 | 上海二三四五网络科技有限公司 | Click position factor improvement-based CTR hot content calculation method and device |
CN115017410A (en) * | 2022-05-31 | 2022-09-06 | 北京沃东天骏信息技术有限公司 | A task push method, device and cloud server |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109190043A (en) * | 2018-09-07 | 2019-01-11 | 北京三快在线科技有限公司 | Recommended method and device, storage medium, electronic equipment and recommender system |
CN110532468A (en) * | 2019-08-26 | 2019-12-03 | 北京齐尔布莱特科技有限公司 | A kind of recommended method of site resource, device and calculate equipment |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5740814B2 (en) * | 2009-12-22 | 2015-07-01 | ソニー株式会社 | Information processing apparatus and method |
-
2020
- 2020-04-15 CN CN202010294127.4A patent/CN111538901B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109190043A (en) * | 2018-09-07 | 2019-01-11 | 北京三快在线科技有限公司 | Recommended method and device, storage medium, electronic equipment and recommender system |
CN110532468A (en) * | 2019-08-26 | 2019-12-03 | 北京齐尔布莱特科技有限公司 | A kind of recommended method of site resource, device and calculate equipment |
Non-Patent Citations (1)
Title |
---|
郝胜男 ; 赵领杰 ; .一种基于ElasticSearch的推荐系统架构.电脑知识与技术.2017,(36),全文. * |
Also Published As
Publication number | Publication date |
---|---|
CN111538901A (en) | 2020-08-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111538901B (en) | Article recommendation method and device, server and storage medium | |
CN110442790B (en) | Method, device, server and storage medium for recommending multimedia data | |
CN108520076B (en) | Electronic book recommendation method, electronic device and computer storage medium | |
CN110489644B (en) | Information push method, device, computer readable storage medium and computer equipment | |
CN109934704A (en) | Information recommendation method, apparatus, device and storage medium | |
CN105760400B (en) | A kind of PUSH message sort method and device based on search behavior | |
CN107122980A (en) | The method and apparatus for recognizing the affiliated classification of commodity | |
CN111797320A (en) | Data processing method, device, equipment and storage medium | |
CN110727857A (en) | Method and device for identifying key features of potential users aiming at business objects | |
CN110175264A (en) | Construction method, server and the computer readable storage medium of video user portrait | |
CN107943792A (en) | A kind of statement analytical method, device and terminal device, storage medium | |
CN109255676B (en) | Commodity recommendation method and device, computer equipment and storage medium | |
CN113239182A (en) | Article recommendation method and device, computer equipment and storage medium | |
CN106302568A (en) | A kind of user behavior evaluation methodology, Apparatus and system | |
CN110377821A (en) | Generate method, apparatus, computer equipment and the storage medium of interest tags | |
CN111831892A (en) | Information recommendation method, information recommendation device, server and storage medium | |
CN111752985A (en) | Method, device and storage medium for generating main portrait | |
CN114637914A (en) | List processing method, computing device and storage medium | |
CN113468394A (en) | Data processing method and device, electronic equipment and storage medium | |
CN110706036B (en) | Method, apparatus, storage medium and system for evaluating a plurality of new materials for an advertisement | |
CN111160647B (en) | Money laundering behavior prediction method and device | |
CN118134630A (en) | Credit risk level assessment method and device and electronic equipment | |
CN118134652A (en) | Asset configuration scheme generation method and device, electronic equipment and medium | |
CN110705889A (en) | Enterprise screening method, device, equipment and storage medium | |
CN116976995A (en) | Multi-target recommendation processing method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: Room 301, 303 and 304, block B, 112 liangxiu Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 201203 Patentee after: Daguan Data Co.,Ltd. Country or region after: China Address before: Room 301, 303 and 304, block B, 112 liangxiu Road, China (Shanghai) pilot Free Trade Zone, Pudong New Area, Shanghai, 201203 Patentee before: DATAGRAND INFORMATION TECHNOLOGY (SHANGHAI) Co.,Ltd. Country or region before: China |