WO2024016680A1 - Information flow recommendation method and apparatus and computer program product - Google Patents

Information flow recommendation method and apparatus and computer program product Download PDF

Info

Publication number
WO2024016680A1
WO2024016680A1 PCT/CN2023/080416 CN2023080416W WO2024016680A1 WO 2024016680 A1 WO2024016680 A1 WO 2024016680A1 CN 2023080416 W CN2023080416 W CN 2023080416W WO 2024016680 A1 WO2024016680 A1 WO 2024016680A1
Authority
WO
WIPO (PCT)
Prior art keywords
factor
weight
user
information
information flow
Prior art date
Application number
PCT/CN2023/080416
Other languages
French (fr)
Chinese (zh)
Inventor
邓罗丹
Original Assignee
百度在线网络技术(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 百度在线网络技术(北京)有限公司 filed Critical 百度在线网络技术(北京)有限公司
Publication of WO2024016680A1 publication Critical patent/WO2024016680A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

An information flow recommendation method and apparatus, an electronic device, a storage medium, and a program product, relating to the technical field of artificial intelligence, in particular to the technical field of evolution strategies. The specific implementation solution comprises: acquiring feature information of a first user in an information flow recommendation scenario (201); determining, by means of a multi-factor fusion parameter network according to the feature information, first weights corresponding to factors in a factor set (202), wherein the factors in the factor set represent index information needing to be considered in an information flow recommendation process; determining, by means of a gated screening network according to the feature information, second weights corresponding to the factors in the factor set (203); determining, according to the first weights and the second weights, a target factor in the factor set suitable for the first user in the information flow recommendation scenario (204); and according to the target factor, determining and pushing a recommendation result corresponding to the first user in the information flow recommendation scenario to the first user (205). The method improves the accuracy of the recommendation result.

Description

信息流推荐方法、装置及计算机程序产品Information flow recommendation method, device and computer program product
本专利申请要求于2022年7月20日提交的、申请号为202210857765.1、发明名称为“信息流推荐方法、装置及计算机程序产品”的中国专利申请的优先权,该申请的全文以引用的方式并入本申请中。This patent application claims priority to the Chinese patent application submitted on July 20, 2022, with the application number 202210857765.1 and the invention title "Information flow recommendation method, device and computer program product". The full text of the application is incorporated by reference. incorporated into this application.
技术领域Technical field
本公开涉及人工智能技术领域,具体涉及进化策略技术领域,尤其涉及信息流推荐方法、装置以及模型训练方法、装置、电子设备、存储介质以及计算机程序产品,可用于信息流推荐场景下。The present disclosure relates to the field of artificial intelligence technology, specifically to the field of evolutionary strategy technology, and in particular to information flow recommendation methods, devices and model training methods, devices, electronic devices, storage media and computer program products, which can be used in information flow recommendation scenarios.
背景技术Background technique
信息流推荐不同于广告,不仅关注资源的点展比,还会融合阅读时长、展现资源的多样性、用户点赞量、分享量等一系列体验指标作为综合推荐指标。虽然融合的目标因子越来越多,但是不同的融合因子有其适用的场景限制,比如对于新用户模型的首要任务是促活拉新,时长、多样性等目标不是系统重点关注的问题。如何对用户面临的场景做自适应的因子筛选,是信息流推荐系统中常见的问题。Information flow recommendation is different from advertising. It not only focuses on the click-to-view ratio of resources, but also integrates a series of experience indicators such as reading time, diversity of displayed resources, number of user likes, and number of shares as comprehensive recommendation indicators. Although there are more and more target factors for fusion, different fusion factors have their own applicable scenario restrictions. For example, the primary task of the new user model is to promote activation and attract new users, and goals such as duration and diversity are not the key concerns of the system. How to perform adaptive factor screening for the scenarios faced by users is a common problem in information flow recommendation systems.
发明内容Contents of the invention
本公开提供了一种信息流推荐方法、装置以及模型训练方法、装置、电子设备、存储介质以及计算机程序产品。The present disclosure provides an information flow recommendation method and device, as well as a model training method, device, electronic equipment, storage media and computer program products.
根据第一方面,提供了一种信息流推荐方法,包括:获取第一用户在信息流推荐场景下的特征信息;通过多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重,其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;通过门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重;根据第一权重和第二权重,确定因子集合中适用于信息流推荐场景下的第一用户的目标因子;根据目标因子,确定并向第一用户推送信息流推荐场景下的第一用户对应的推荐结果。 According to the first aspect, an information flow recommendation method is provided, which includes: obtaining the characteristic information of the first user in an information flow recommendation scenario; and determining, through a multi-factor fusion parameter network, the third factor corresponding to each factor in the factor set according to the characteristic information. A weight, in which the factors in the factor set represent the indicator information that needs to be considered in the information flow recommendation process; through the gated screening network, the second weight corresponding to each factor in the factor set is determined according to the characteristic information; according to the first weight and The second weight determines the target factor in the factor set that is applicable to the first user in the information flow recommendation scenario; and determines and pushes the recommendation result corresponding to the first user in the information flow recommendation scenario to the first user based on the target factor.
根据第二方面,提供了一种模型训练方法,包括:获取第二用户在信息流推荐场景下的特征信息;通过初始多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重,其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;通过初始门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重;根据第一权重和第二权重,确定因子集合中适用于信息流推荐场景下的第二用户的目标因子;根据目标因子确定信息流推荐场景下的第二用户对应的推荐结果;采用进化策略,根据第二用户对于推荐结果的反馈信息,调整初始多因子融合参数网络的参数和初始门控筛选网络的参数,以得到训练后的多因子融合参数网络和门控筛选网络。According to the second aspect, a model training method is provided, which includes: obtaining the characteristic information of the second user in an information flow recommendation scenario; and determining the third factor corresponding to each factor in the factor set according to the characteristic information through an initial multi-factor fusion parameter network. A weight, in which the factors in the factor set represent the indicator information that needs to be considered in the information flow recommendation process; through the initial gating screening network, the second weight corresponding to each factor in the factor set is determined according to the characteristic information; according to the first weight and the second weight, determine the target factor in the factor set that is suitable for the second user in the information flow recommendation scenario; determine the recommendation result corresponding to the second user in the information flow recommendation scenario based on the target factor; adopt an evolutionary strategy, according to the second user Regarding the feedback information of the recommendation results, the parameters of the initial multi-factor fusion parameter network and the parameters of the initial gate screening network are adjusted to obtain the trained multi-factor fusion parameter network and gate screening network.
根据第三方面,提供了一种信息流推荐装置,包括:第一获取单元,被配置成获取第一用户在信息流推荐场景下的特征信息;第一确定单元,被配置成通过多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重,其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;第二确定单元,被配置成通过门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重;第三确定单元,被配置成根据第一权重和第二权重,确定因子集合中适用于信息流推荐场景下的第一用户的目标因子;推荐单元,被配置成根据目标因子,确定并向第一用户推送信息流推荐场景下的第一用户对应的推荐结果。According to a third aspect, an information flow recommendation device is provided, including: a first acquisition unit configured to obtain the characteristic information of the first user in an information flow recommendation scenario; a first determination unit configured to obtain the feature information through multi-factor fusion The parameter network determines the first weight corresponding to each factor in the factor set based on the characteristic information, where the factors in the factor set represent the indicator information that needs to be considered in the information flow recommendation process; the second determination unit is configured to pass the gating Screen the network and determine the second weight corresponding to each factor in the factor set according to the characteristic information; the third determination unit is configured to determine the first weight in the factor set suitable for the information flow recommendation scenario based on the first weight and the second weight. The user's target factor; the recommendation unit is configured to determine and push the recommendation result corresponding to the first user in the information flow recommendation scenario to the first user based on the target factor.
根据第四方面,提供了一种模型训练装置,包括:第二获取单元,被配置成获取第二用户在信息流推荐场景下的特征信息;第四确定单元,被配置成通过初始多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重,其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;第五确定单元,被配置成通过初始门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重;第六确定单元,被配置成根据第一权重和第二权重,确定因子集合中适用于信息流推荐场景下的第二用户的目标因子;第七确定单元,被配置成根据目标因子确定信息流推荐场景下的第二用户对应的推荐结果;训练单元,被配置成采用进化策略,根据第二用户对于推荐结果的反馈信息,调整初始多因子融合参数网络的参数和初始门控筛选网络的参数,以得到训练后的多因子融合参 数网络和门控筛选网络。According to a fourth aspect, a model training device is provided, including: a second acquisition unit configured to acquire characteristic information of the second user in an information flow recommendation scenario; a fourth determination unit configured to obtain the feature information through initial multi-factor fusion The parameter network determines the first weight corresponding to each factor in the factor set based on the characteristic information, where the factors in the factor set represent the indicator information that needs to be considered in the information flow recommendation process; the fifth determination unit is configured to pass the initial gate The control screening network determines the second weight corresponding to each factor in the factor set according to the characteristic information; the sixth determination unit is configured to determine the third weight in the factor set suitable for the information flow recommendation scenario based on the first weight and the second weight. The target factors of the two users; the seventh determination unit is configured to determine the recommendation results corresponding to the second user in the information flow recommendation scenario according to the target factors; the training unit is configured to use an evolutionary strategy to determine the recommendation results according to the second user's Feedback information, adjust the parameters of the initial multi-factor fusion parameter network and the parameters of the initial gate screening network to obtain the multi-factor fusion parameters after training number network and gated screening network.
根据第五方面,提供了一种电子设备,包括:至少一个处理器;以及与至少一个处理器通信连接的存储器;其中,存储器存储有可被至少一个处理器执行的指令,指令被至少一个处理器执行,以使至少一个处理器能够执行如第一方面、第二方面任一实现方式描述的方法。According to a fifth aspect, an electronic device is provided, including: at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores instructions that can be executed by at least one processor, and the instructions are processed by at least one The processor executes, so that at least one processor can execute the method described in any implementation manner of the first aspect and the second aspect.
根据第六方面,提供了一种存储有计算机指令的非瞬时计算机可读存储介质,计算机指令用于使计算机执行如第一方面、第二方面任一实现方式描述的方法。According to the sixth aspect, a non-transitory computer-readable storage medium storing computer instructions is provided, and the computer instructions are used to cause the computer to execute the method described in any implementation manner of the first aspect or the second aspect.
根据第七方面,提供了一种计算机程序产品,包括:计算机程序,计算机程序在被处理器执行时实现如第一方面、第二方面任一实现方式描述的方法。According to the seventh aspect, a computer program product is provided, including: a computer program. When executed by a processor, the computer program implements the method described in any implementation manner of the first aspect or the second aspect.
根据本公开的技术,提供了一种信息流推荐方法,在信息流推荐场景下,通过多因子融合参数网络确定对应于用户的各因子的第一权重,通过门控筛选网络确定各因子的第二权重,以根据第一权重和第二权重准确地确定出适用于信息流推荐场景下的用户的目标因子,进行信息流推荐,提高了推荐结果的准确度。According to the technology of the present disclosure, an information flow recommendation method is provided. In the information flow recommendation scenario, the first weight of each factor corresponding to the user is determined through a multi-factor fusion parameter network, and the first weight of each factor is determined through a gated screening network. Two weights are used to accurately determine the target factors suitable for users in the information flow recommendation scenario based on the first weight and the second weight, and perform information flow recommendation, thereby improving the accuracy of the recommendation results.
应当理解,本部分所描述的内容并非旨在标识本公开的实施例的关键或重要特征,也不用于限制本公开的范围。本公开的其它特征将通过以下的说明书而变得容易理解。It should be understood that what is described in this section is not intended to identify key or important features of the embodiments of the disclosure, nor is it intended to limit the scope of the disclosure. Other features of the present disclosure will become readily understood from the following description.
附图说明Description of drawings
附图用于更好地理解本方案,不构成对本公开的限定。其中:The accompanying drawings are used to better understand the present solution and do not constitute a limitation of the present disclosure. in:
图1是根据本公开的一个实施例可以应用于其中的示例性系统架构图;1 is an exemplary system architecture diagram to which an embodiment of the present disclosure may be applied;
图2是根据本公开的信息流推荐方法的一个实施例的流程图;Figure 2 is a flow chart of an embodiment of an information flow recommendation method according to the present disclosure;
图3是根据本实施例的信息流推荐方法的应用场景的示意图;Figure 3 is a schematic diagram of an application scenario of the information flow recommendation method according to this embodiment;
图4是根据本公开的信息流推荐方法的又一个实施例的流程图;Figure 4 is a flow chart of yet another embodiment of an information flow recommendation method according to the present disclosure;
图5是根据本公开的模型训练方法的一个实施例的流程图;Figure 5 is a flow chart of one embodiment of a model training method according to the present disclosure;
图6是根据本公开的信息流推荐装置的一个实施例的结构图;Figure 6 is a structural diagram of an embodiment of an information flow recommendation device according to the present disclosure;
图7是根据本公开的模型训练装置的一个实施例的结构图; Figure 7 is a structural diagram of an embodiment of a model training device according to the present disclosure;
图8是适于用来实现本公开实施例的计算机系统的结构示意图。FIG. 8 is a schematic structural diagram of a computer system suitable for implementing embodiments of the present disclosure.
具体实施方式Detailed ways
以下结合附图对本公开的示范性实施例做出说明,其中包括本公开实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。因此,本领域普通技术人员应当认识到,可以对这里描述的实施例做出各种改变和修改,而不会背离本公开的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。Exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, in which various details of the embodiments of the present disclosure are included to facilitate understanding and should be considered to be exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications can be made to the embodiments described herein without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.
本公开的技术方案中,所涉及的用户个人信息的收集、存储、使用、加工、传输、提供和公开等处理,均符合相关法律法规的规定,且不违背公序良俗。In the technical solution of this disclosure, the collection, storage, use, processing, transmission, provision and disclosure of user personal information are in compliance with relevant laws and regulations and do not violate public order and good customs.
图1示出了可以应用本公开的信息流推荐方法及装置、模型训练方法及装置的示例性架构100。Figure 1 shows an exemplary architecture 100 in which the information flow recommendation method and device, and the model training method and device of the present disclosure can be applied.
如图1所示,系统架构100可以包括终端设备101、102、103,网络104和服务器105。终端设备101、102、103之间通信连接构成拓扑网络,网络104用以在终端设备101、102、103和服务器105之间提供通信链路的介质。网络104可以包括各种连接类型,例如有线、无线通信链路或者光纤电缆等等。As shown in Figure 1, the system architecture 100 may include terminal devices 101, 102, 103, a network 104 and a server 105. The communication connections between terminal devices 101, 102, and 103 constitute a topological network, and the network 104 is used to provide a medium for communication links between the terminal devices 101, 102, and 103 and the server 105. Network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
终端设备101、102、103可以是支持网络连接从而进行数据交互和数据处理的硬件设备或软件。当终端设备101、102、103为硬件时,其可以是支持网络连接,信息获取、交互、显示、处理等功能的各种电子设备,包括但不限于智能手机、平板电脑、电子书阅读器、膝上型便携计算机和台式计算机等等。当终端设备101、102、103为软件时,可以安装在上述所列举的电子设备中。其可以实现成例如用来提供分布式服务的多个软件或软件模块,也可以实现成单个软件或软件模块。在此不做具体限定。The terminal devices 101, 102, and 103 may be hardware devices or software that support network connection for data interaction and data processing. When the terminal devices 101, 102, and 103 are hardware, they can be various electronic devices that support network connection, information acquisition, interaction, display, processing and other functions, including but not limited to smart phones, tablet computers, e-book readers, Laptops and desktop computers and more. When the terminal devices 101, 102, and 103 are software, they can be installed in the electronic devices listed above. It may be implemented, for example, as multiple software or software modules for providing distributed services, or as a single software or software module. There are no specific limitations here.
服务器105可以是提供各种服务的服务器,例如,根据终端设备101、102、103对应的用户的在信息流推荐场景下的特征信息,通过多因子融合参数网络确定对应于用户的各因子的第一权重,通过门控筛选网络确定各因子的第二权重,以根据第一权重和第二权重准确地确定出适用于用户的目标因子,进行信息流推荐的后台处理服务器。又例如,根据终端设备101、 102、103提供的对于推荐结果的反馈信息,基于进化策略训练得到多因子融合参数网络和门控筛选网络的后台处理服务器。作为示例,服务器105可以是云端服务器。The server 105 may be a server that provides various services. For example, based on the characteristic information of the users corresponding to the terminal devices 101, 102, 103 in the information flow recommendation scenario, the multi-factor fusion parameter network determines the third factor corresponding to each factor of the user. One weight, determine the second weight of each factor through the gated screening network, so as to accurately determine the target factor suitable for the user based on the first weight and the second weight, and perform the background processing server for information flow recommendation. For another example, according to the terminal device 101, The feedback information for the recommendation results provided by 102 and 103 is based on the evolutionary strategy training to obtain the background processing server of the multi-factor fusion parameter network and the gated screening network. As an example, server 105 may be a cloud server.
需要说明的是,服务器可以是硬件,也可以是软件。当服务器为硬件时,可以实现成多个服务器组成的分布式服务器集群,也可以实现成单个服务器。当服务器为软件时,可以实现成多个软件或软件模块(例如用来提供分布式服务的软件或软件模块),也可以实现成单个软件或软件模块。在此不做具体限定。It should be noted that the server can be hardware or software. When the server is hardware, it can be implemented as a distributed server cluster composed of multiple servers or as a single server. When the server is software, it can be implemented as multiple software or software modules (for example, software or software modules used to provide distributed services), or it can be implemented as a single software or software module. There are no specific limitations here.
还需要说明的是,本公开的实施例所提供的信息流推荐方法、模型训练方法可以由服务器执行,也可以由终端设备执行,还可以由服务器和终端设备彼此配合执行。相应地,信息流推荐装置、模型训练装置包括的各个部分(例如各个单元)可以全部设置于服务器中,也可以全部设置于终端设备中,还可以分别设置于服务器和终端设备中。It should also be noted that the information flow recommendation method and model training method provided by the embodiments of the present disclosure can be executed by the server or by the terminal device, or can be executed by the server and the terminal device in cooperation with each other. Correspondingly, various parts (for example, each unit) included in the information flow recommendation device and the model training device can be all installed in the server, or they can all be installed in the terminal device, or they can be installed in the server and the terminal device respectively.
应该理解,图1中的终端设备、网络和服务器的数目仅仅是示意性的。根据实现需要,可以具有任意数目的终端设备、网络和服务器。当信息流推荐方法、模型训练方法运行于其上的电子设备不需要与其他电子设备进行数据传输时,该系统架构可以仅包括信息流推荐方法、模型训练方法运行于其上的电子设备(例如服务器或终端设备)。It should be understood that the number of terminal devices, networks and servers in Figure 1 is only illustrative. Depending on implementation needs, there can be any number of end devices, networks, and servers. When the electronic device on which the information flow recommendation method and the model training method runs does not need to transmit data with other electronic devices, the system architecture may only include the electronic device on which the information flow recommendation method and the model training method run (for example, server or terminal device).
请参考图2,图2为本公开实施例提供的一种信息流推荐方法的流程图,其中,流程200包括以下步骤:Please refer to Figure 2, which is a flow chart of an information flow recommendation method provided by an embodiment of the present disclosure. The process 200 includes the following steps:
步骤201,获取第一用户在信息流推荐场景下的特征信息。Step 201: Obtain the characteristic information of the first user in the information flow recommendation scenario.
本实施例中,信息流推荐方法的执行主体(例如,图1中的终端设备或服务器)可以基于有线网络连接方式或无线网络连接方式从远程,或从本地获取第一用户在信息流推荐场景下的特征信息。In this embodiment, the execution subject of the information flow recommendation method (for example, the terminal device or server in Figure 1) can obtain the information flow recommendation scenario of the first user remotely or locally based on a wired network connection or a wireless network connection. feature information below.
其中,第一用户为待进行信息流推荐的用户。信息流推荐场景可以是各种类型的信息流对应的推荐场景。例如,在新闻类应用中,信息流推荐场景为确定用户感兴趣的新闻类信息流;在视频类应用中,信息流推荐场景为确定用户感兴趣的视频类信息流。Among them, the first user is a user to be recommended for information flow. Information flow recommendation scenarios can be recommendation scenarios corresponding to various types of information flows. For example, in news applications, the information flow recommendation scenario is to determine the news information flow that the user is interested in; in video applications, the information flow recommendation scenario is to determine the video information flow that the user is interested in.
第一用户在信息流推荐场景下的特征信息包括第一用户的用户特征 信息和信息流推荐场景的场景特征信息。作为示例,用户特征信息包括用户活跃度、年龄、性别、产品日均使用时长、使用次数等;场景特征信息包括刷新状态、刷新次数、刷新时间等。The characteristic information of the first user in the information flow recommendation scenario includes the user characteristics of the first user Scene feature information of information and information flow recommendation scenarios. As an example, user characteristic information includes user activity, age, gender, average daily product usage time, number of uses, etc.; scene characteristic information includes refresh status, refresh times, refresh time, etc.
步骤202,通过多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重。Step 202: Determine the first weight corresponding to each factor in the factor set according to the feature information through the multi-factor fusion parameter network.
本实施例中,上述执行主体可以通过多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重。In this embodiment, the above-mentioned execution subject can determine the first weight corresponding to each factor in the factor set according to the feature information through the multi-factor fusion parameter network.
其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息。例如,因子集合中包括阅读时长、展现资源的多样性、用户点赞量、分享量等因子。Among them, the factors in the factor set represent the indicator information that needs to be considered in the information flow recommendation process. For example, the factor set includes factors such as reading time, diversity of displayed resources, number of user likes, number of shares, etc.
本实现方式中,为了提高信息流推荐的针对性,可以为不同的信息流推荐场景确定不同的因子集合。每一信息流推荐场景对应的因子集合包括针对于该息流推荐场景的多个因子。In this implementation, in order to improve the pertinence of information flow recommendation, different factor sets can be determined for different information flow recommendation scenarios. The factor set corresponding to each information flow recommendation scenario includes multiple factors for the information flow recommendation scenario.
作为示例,多因子融合参数网络中包括多个塔网络,每个塔网络对应于因子集合中的一个因子。在多因子融合参数网络的底层,多个塔网络共享特征信息,多个塔网络用于输出所对应的因子的第一权重。每个塔网络可以是一个神经网络,包括但不限于是卷积神经网络、循环神经网络等网络模型。As an example, a multi-factor fusion parameter network includes multiple tower networks, each tower network corresponding to a factor in the factor set. At the bottom of the multi-factor fusion parameter network, multiple tower networks share feature information, and multiple tower networks are used to output the first weight of the corresponding factor. Each tower network can be a neural network, including but not limited to convolutional neural network, recurrent neural network and other network models.
步骤203,通过门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重。Step 203: Determine the second weight corresponding to each factor in the factor set according to the characteristic information through the gated screening network.
本实施例中,上述执行主体可以通过门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重。In this embodiment, the above-mentioned execution subject can determine the second weight corresponding to each factor in the factor set according to the characteristic information through the gated screening network.
门控筛选网络可以基于门控循环神经网络实现。作为示例,上述执行主体可以根据因子集合中包括的因子的数量,确定门控筛选网络的网络结构,以使得门控筛选网络的输出的数量与因子集合中包括的因子的数量一致,且因子集合中包括的因子与门控筛选网络的输出一一对应。门控筛选网络的多个输出即为因子集合中的各因子对应的第二权重。Gated screening networks can be implemented based on gated recurrent neural networks. As an example, the above execution subject can determine the network structure of the gated screening network according to the number of factors included in the factor set, so that the number of outputs of the gated screening network is consistent with the number of factors included in the factor set, and the factor set The factors included in correspond to the output of the gated screening network. The multiple outputs of the gated screening network are the second weights corresponding to each factor in the factor set.
步骤204,根据第一权重和第二权重,确定因子集合中适用于信息流推荐场景下的第一用户的目标因子。Step 204: Based on the first weight and the second weight, determine the target factor in the factor set that is suitable for the first user in the information flow recommendation scenario.
本实施例中,上述执行主体可以根据第一权重和第二权重,确定因子 集合中适用于信息流推荐场景下的第一用户的目标因子。In this embodiment, the above execution subject can determine the factor based on the first weight and the second weight. The target factor in the set that is suitable for the first user in the information flow recommendation scenario.
作为示例,对于因子集合中的每个因子,上述执行主体可以确定该因子对应的第一权重和该因子对应的第二权重,并通过求和、加权求和等方式确定总权重;进而对总权重排序,将排序在前的预设数量个因子确定为目标因子,或者将总权重大于预设数值的因子确定为目标因子。As an example, for each factor in the factor set, the above-mentioned executive body can determine the first weight corresponding to the factor and the second weight corresponding to the factor, and determine the total weight through summation, weighted summation, etc.; and then calculate the total weight. In weight sorting, the preset number of factors ranked first are determined as the target factors, or the factors whose total weight is greater than the preset value are determined as the target factors.
步骤205,根据目标因子,确定并向第一用户推送信息流推荐场景下的第一用户对应的推荐结果。Step 205: Based on the target factor, determine and push the recommendation result corresponding to the first user in the information flow recommendation scenario to the first user.
本实施例中,上述执行主体可以根据目标因子,确定并向第一用户推送信息流推荐场景下的第一用户对应的推荐结果。In this embodiment, the above execution subject may determine and push the recommendation result corresponding to the first user in the information flow recommendation scenario to the first user based on the target factor.
作为示例,对于所确定的每个目标因子,上述执行主体可以首先基于该目标因子对应的第一权重和第二权重得到该目标因子的总权重;进而,基于相对应的目标因子与总权重,得每个目标因子对应的加权项,以组合各目标因子的加权项,得到多目标因子融合公式。As an example, for each target factor determined, the above-mentioned execution subject can first obtain the total weight of the target factor based on the first weight and the second weight corresponding to the target factor; then, based on the corresponding target factor and the total weight, The weighted items corresponding to each target factor are obtained to combine the weighted items of each target factor to obtain the multi-target factor fusion formula.
得到多目标因子融合公式之后,可以通过多目标因子融合公式确定预设待推荐内容集合中的待排序内容的推荐排序得分;基于推荐排序得分对待排序内容集合中的待排序内容进行排序,以将排序在前的预设数量个待排序内容作为第一用户对应的推荐结果,推送给第一用户。After obtaining the multi-objective factor fusion formula, the recommendation ranking score of the content to be sorted in the preset content collection to be recommended can be determined through the multi-objective factor fusion formula; the content to be sorted in the content collection to be sorted is sorted based on the recommendation ranking score, so as to The preset number of top-ordered contents to be sorted are pushed to the first user as the recommendation results corresponding to the first user.
继续参见图3,图3是根据本实施例的信息流推荐方法的应用场景的一个示意图300。在图3的应用场景中,用户301通过终端设备302向短视频类应用发出启动指令。服务器303首先基于开启指令获取用户301在短视频类信息流推荐场景下的特征信息;然后,通过多因子融合参数网络304,根据特征信息确定因子集合中的各因子对应的第一权重305,其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;然后,通过门控筛选网络306,根据特征信息确定因子集合中的各因子对应的第二权重307;然后,根据第一权重305和第二权重307,确定因子集合中适用于信息流推荐场景下的第一用户的目标因子308;根据目标因子308,确定并向用户301推送信息流推荐场景下的第一用户对应的推荐结果309。Continuing to refer to Figure 3, Figure 3 is a schematic diagram 300 of an application scenario of the information flow recommendation method according to this embodiment. In the application scenario of Figure 3, the user 301 issues a startup instruction to a short video application through the terminal device 302. The server 303 first obtains the feature information of the user 301 in the short video information stream recommendation scenario based on the opening instruction; then, through the multi-factor fusion parameter network 304, determines the first weight 305 corresponding to each factor in the factor set based on the feature information, where , the factors in the factor set represent the index information that needs to be considered in the information flow recommendation process; then, through the gated filtering network 306, the second weight 307 corresponding to each factor in the factor set is determined according to the feature information; then, according to the first The weight 305 and the second weight 307 are used to determine the target factor 308 in the factor set that is suitable for the first user in the information flow recommendation scenario; based on the target factor 308, determine and push to the user 301 the target factor 308 corresponding to the first user in the information flow recommendation scenario. Recommended results 309.
本实施例中,提供了一种信息流推荐方法,在信息流推荐场景下,通过多因子融合参数网络确定对应于用户的各因子的第一权重,通过门控筛 选网络确定各因子的第二权重,以根据第一权重和第二权重准确地确定出适用于用户的目标因子,进行信息流推荐,提高了推荐结果的准确度。In this embodiment, an information flow recommendation method is provided. In the information flow recommendation scenario, the first weight of each factor corresponding to the user is determined through a multi-factor fusion parameter network, and the first weight of each factor corresponding to the user is determined through the gated filter. The selection network determines the second weight of each factor, so as to accurately determine the target factors suitable for the user based on the first weight and the second weight, and perform information flow recommendation, which improves the accuracy of the recommendation results.
在本实施例的一些可选的实现方式中,上述执行主体可以通过如下方式执行上述步骤203:In some optional implementations of this embodiment, the above execution subject may perform the above step 203 in the following manner:
首先,通过门控筛选网络,根据特征信息确定因子集合中的各因子对应的初始第二权重;然后,通过预设激活函数将各因子对应的初始第二权重转化为0或者1,得到各因子对应的第二权重。First, through the gated screening network, the initial second weight corresponding to each factor in the factor set is determined based on the feature information; then, the initial second weight corresponding to each factor is converted into 0 or 1 through the preset activation function to obtain each factor. The corresponding second weight.
作为示例,门控筛选网络最上层的预设激活函数的值域为0和1,从而巧妙地将连续值问题转化为0/1问题。作为示例,其预设激活函数可以为:
As an example, the value range of the preset activation function at the top layer of the gated screening network is 0 and 1, thus cleverly transforming the continuous value problem into a 0/1 problem. As an example, its preset activation function can be:
本实现方式中,对于因子集合中的每个因子,门控筛选网络输出的第二权重为0或者1,从而巧妙地将连续值问题转化为0/1问题,提高了目标因子的确定过程的高效性和便捷性。In this implementation, for each factor in the factor set, the second weight output by the gated screening network is 0 or 1, thus cleverly converting the continuous value problem into a 0/1 problem and improving the efficiency of the target factor determination process. Efficiency and convenience.
在本实施例的一些可选的实现方式中,上述执行主体可以通过如下方式执行上述步骤204:In some optional implementations of this embodiment, the above execution subject may perform the above step 204 in the following manner:
首先,将因子集合中的各因子对应的第一权重和第二权重相乘,得到权重乘积;然后,根据因子集合中的各因子对应的权重乘积,确定因子集合中适用于信息流推荐场景下的第一用户的目标因子。First, multiply the first weight and the second weight corresponding to each factor in the factor set to obtain the weight product; then, based on the weight product corresponding to each factor in the factor set, determine the information flow recommendation scenario in the factor set. The target factor of the first user.
当门控筛选网络输出的第二权重为0或者1时,可以便捷地确定各因子对应的权重乘积为该因子对应的第一权重或零;进而,去除权重乘积为零的因子,保留权重乘积非零的因子,得到目标因子。When the second weight output by the gated screening network is 0 or 1, the weight product corresponding to each factor can be easily determined to be the first weight corresponding to the factor or zero; then, factors whose weight product is zero are removed and the weight product is retained Non-zero factors are used to obtain the target factors.
本实现方式中,基于各因子对应第一权重和第二权重的权重乘积,确定目标因子,进一步提高了目标因子确定过程的便捷性和准确度。In this implementation, the target factor is determined based on the weight product of the first weight and the second weight corresponding to each factor, which further improves the convenience and accuracy of the target factor determination process.
在本实施例的一些可选的实现方式中,上述执行主体还可以执行如下操作:第一,获取第一用户对于推荐结果的反馈信息。In some optional implementations of this embodiment, the above execution subject may also perform the following operations: first, obtain the first user's feedback information on the recommendation results.
其中,反馈信息可以是第一用户在获取推荐结果之后,对于推荐结果中的信息流的反映信息。作为示例,反馈信息包括是否点击,是否查看,是否做出点赞、评论等互动操作。The feedback information may be the first user's reflection information on the information flow in the recommended results after obtaining the recommended results. As examples, feedback information includes whether to click, whether to view, whether to like, comment and other interactive operations.
第二,采用进化策略,根据反馈信息调整多因子融合参数网络的参数 和门控筛选网络的参数,以通过调整后的多因子融合参数网络和门控筛选网络执行后续用户在信息流推荐场景下的推荐任务。Second, an evolutionary strategy is adopted to adjust the parameters of the multi-factor fusion parameter network based on feedback information. and the parameters of the gated screening network to perform subsequent user recommendation tasks in the information flow recommendation scenario through the adjusted multi-factor fusion parameter network and gated screening network.
进化策略算法指基于进化理论的算法,可通过该算法进行探索,寻找使多因子融合参数网络和门控筛选网络的总体回报更大的参数扰动。具体的,根据反馈信息和预设回报函数,确定多因子融合参数网络的参数和门控筛选网络的参数的回报值;基于回报值最大化的原则,指导多因子融合参数网络的参数和门控筛选网络的参数的调整过程;基于预设进化策略算法按照预设迭代次数进行迭代,生成满足每一参数的均值和方差的高斯分布的新一轮次的多因子融合参数网络的参数和门控筛选网络的参数。Evolutionary strategy algorithms refer to algorithms based on evolutionary theory that can be used to explore parameter perturbations that make the overall return of multi-factor fusion parameter networks and gated screening networks greater. Specifically, based on the feedback information and the preset reward function, the parameters of the multi-factor fusion parameter network and the reward values of the parameters of the gated screening network are determined; based on the principle of maximizing the reward value, the parameters and gating of the multi-factor fusion parameter network are guided. The adjustment process of filtering network parameters; based on the preset evolutionary strategy algorithm, iterates according to the preset number of iterations to generate a new round of parameters and gating of the multi-factor fusion parameter network that satisfies the Gaussian distribution of the mean and variance of each parameter. Filter network parameters.
将采用调整参数后的多因子融合参数网络和门控筛选网络执行后续用户的推荐任务。The multi-factor fusion parameter network and gated screening network after adjusting parameters will be used to perform subsequent user recommendation tasks.
本实现方式中,上述执行主体在多因子融合参数网络和门控筛选网络的应用过程中,采用进化策略进行多因子融合参数网络和门控筛选网络的调整,可以持续提高多因子融合参数网络和门控筛选网络的准确度。In this implementation, the above-mentioned executive body uses an evolutionary strategy to adjust the multi-factor fusion parameter network and gated screening network during the application process, which can continuously improve the multi-factor fusion parameter network and gated screening network. Accuracy of gated screening networks.
继续参考图4,示出了根据本公开的信息流推荐方法的又一个实施例的示意性流程400,包括以下步骤:Continuing to refer to Figure 4, a schematic process 400 of yet another embodiment of the information flow recommendation method according to the present disclosure is shown, including the following steps:
步骤401,获取第一用户在信息流推荐场景下的特征信息。Step 401: Obtain the characteristic information of the first user in the information flow recommendation scenario.
步骤402,通过多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重。Step 402: Determine the first weight corresponding to each factor in the factor set according to the feature information through the multi-factor fusion parameter network.
其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息。Among them, the factors in the factor set represent the indicator information that needs to be considered in the information flow recommendation process.
步骤403,通过门控筛选网络,根据特征信息确定因子集合中的各因子对应的初始第二权重。Step 403: Determine the initial second weight corresponding to each factor in the factor set according to the characteristic information through the gated screening network.
步骤404,通过预设激活函数将各因子对应的初始第二权重转化为0或者1,得到各因子对应的第二权重。Step 404: Convert the initial second weight corresponding to each factor to 0 or 1 through a preset activation function to obtain the second weight corresponding to each factor.
步骤405,将因子集合中的各因子对应的第一权重和第二权重相乘,得到权重乘积。Step 405: Multiply the first weight and the second weight corresponding to each factor in the factor set to obtain a weight product.
步骤406,根据因子集合中的各因子对应的权重乘积,确定因子集合中适用于信息流推荐场景下的第一用户的目标因子。Step 406: Determine the target factor in the factor set that is suitable for the first user in the information flow recommendation scenario based on the weight product corresponding to each factor in the factor set.
步骤407,根据目标因子,确定并向第一用户推送信息流推荐场景下 的第一用户对应的推荐结果。Step 407: Determine and push the information flow recommendation scenario to the first user based on the target factor. The recommendation result corresponding to the first user.
从本实施例中可以看出,与图2对应的实施例相比,本实施例中的信息流推荐方法的流程400具体说明了第二权重的确定过程,目标因子的确定过程,进一步提高了目标因子的确定过程的高效性和便捷性,提高了推荐结果的准确度。It can be seen from this embodiment that compared with the embodiment corresponding to Figure 2, the process 400 of the information flow recommendation method in this embodiment specifically illustrates the determination process of the second weight and the determination process of the target factor, further improving the The efficiency and convenience of the target factor determination process improves the accuracy of the recommendation results.
继续参考图5,示出了根据本公开的模型训练方法的一个实施例的示意性流程500,包括以下步骤:Continuing to refer to Figure 5, a schematic process 500 of one embodiment of a model training method according to the present disclosure is shown, including the following steps:
步骤501,获取第二用户在信息流推荐场景下的特征信息。Step 501: Obtain the characteristic information of the second user in the information flow recommendation scenario.
本实施例中,模型训练方法的执行主体(例如,图1中的终端设备或服务器)可以基于有线网络连接方式或无线网络连接方式从远程,或从本地获取第二用户在信息流推荐场景下的特征信息。In this embodiment, the execution subject of the model training method (for example, the terminal device or server in Figure 1) can obtain the second user's information flow recommendation scenario remotely or locally based on a wired network connection or a wireless network connection. characteristic information.
其中,第二用户为初始多因子融合参数网络和初始门控筛选网络的训练过程中,待进行信息流推荐的用户。在模型训练过程中,一般涉及多个第二用户。针对于每个第二用户,均可以执行步骤501-506所示的训练过程。Among them, the second user is the user to be recommended for information flow during the training process of the initial multi-factor fusion parameter network and the initial gate screening network. During the model training process, multiple second users are generally involved. For each second user, the training process shown in steps 501-506 can be performed.
信息流推荐场景可以是各种类型的信息流对应的推荐场景。例如,在新闻类应用中,信息流推荐场景为确定用户的新闻类信息流;在视频类应用中,信息流推荐场景为确定用户的视频类信息流。Information flow recommendation scenarios can be recommendation scenarios corresponding to various types of information flows. For example, in news applications, the information flow recommendation scenario is to determine the user's news information flow; in video applications, the information flow recommendation scenario is to determine the user's video information flow.
第二用户在信息流推荐场景下的特征信息包括第二用户的用户特征信息和信息流推荐场景的场景特征信息。作为示例,用户特征信息包括用户活跃度、年龄、性别、产品日均使用时长、使用次数等;场景特征信息包括刷新状态、刷新次数、刷新时间等。The characteristic information of the second user in the information flow recommendation scenario includes the user characteristic information of the second user and the scene characteristic information of the information flow recommendation scenario. As an example, user characteristic information includes user activity, age, gender, average daily product usage time, number of uses, etc.; scene characteristic information includes refresh status, refresh times, refresh time, etc.
步骤502,通过初始多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重。Step 502: Determine the first weight corresponding to each factor in the factor set according to the feature information through the initial multi-factor fusion parameter network.
本实施例中,上述执行主体可以通过初始多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重。In this embodiment, the above-mentioned execution subject can determine the first weight corresponding to each factor in the factor set according to the characteristic information through the initial multi-factor fusion parameter network.
其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息。例如,因子集合中包括阅读时长、展现资源的多样性、用户点赞量、分享量等因子。 Among them, the factors in the factor set represent the indicator information that needs to be considered in the information flow recommendation process. For example, the factor set includes factors such as reading time, diversity of displayed resources, number of user likes, number of shares, etc.
本实现方式中,为了提高信息流推荐的针对性,可以为不同的信息流推荐场景确定不同的因子集合。每一信息流推荐场景对应的因子集合包括针对于该息流推荐场景的多个因子。In this implementation, in order to improve the pertinence of information flow recommendation, different factor sets can be determined for different information flow recommendation scenarios. The factor set corresponding to each information flow recommendation scenario includes multiple factors for the information flow recommendation scenario.
作为示例,初始多因子融合参数网络中包括多个塔网络,每个塔网络对应于因子集合中的一个因子。在初始多因子融合参数网络的底层,多个塔网络共享特征信息,多个塔网络用于输出所对应的因子的第一权重。每个塔网络可以是一个神经网络,包括但不限于是卷积神经网络、循环神经网络等网络模型。As an example, the initial multi-factor fusion parameter network includes multiple tower networks, each tower network corresponding to a factor in the factor set. At the bottom of the initial multi-factor fusion parameter network, multiple tower networks share feature information, and multiple tower networks are used to output the first weight of the corresponding factor. Each tower network can be a neural network, including but not limited to convolutional neural network, recurrent neural network and other network models.
步骤503,通过初始门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重。Step 503: Determine the second weight corresponding to each factor in the factor set according to the characteristic information through the initial gating screening network.
本实施例中,上述执行主体可以通过初始门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重。In this embodiment, the above-mentioned execution subject can determine the second weight corresponding to each factor in the factor set according to the characteristic information through the initial gating screening network.
初始门控筛选网络可以基于门控循环神经网络实现。作为示例,上述执行主体可以根据因子集合中包括的因子的数量,确定初始门控筛选网络的网络结构,以使得初始门控筛选网络的输出的数量与因子集合中包括的因子的数量一致,且因子集合中包括的因子与初始门控筛选网络的输出一一对应。初始门控筛选网络的多个输出即为因子集合中的各因子对应的第二权重。The initial gated screening network can be implemented based on gated recurrent neural networks. As an example, the above execution subject can determine the network structure of the initial gating screening network according to the number of factors included in the factor set, so that the number of outputs of the initial gating screening network is consistent with the number of factors included in the factor set, and The factors included in the factor set correspond one-to-one with the output of the initial gated screening network. The multiple outputs of the initial gated screening network are the second weights corresponding to each factor in the factor set.
步骤504,根据第一权重和第二权重,确定因子集合中适用于信息流推荐场景下的第二用户的目标因子。Step 504: Based on the first weight and the second weight, determine the target factor in the factor set that is suitable for the second user in the information flow recommendation scenario.
本实施例中,上述执行主体可以根据第一权重和第二权重,确定因子集合中适用于信息流推荐场景下的第二用户的目标因子。In this embodiment, the above execution subject may determine the target factor in the factor set that is suitable for the second user in the information flow recommendation scenario based on the first weight and the second weight.
作为示例,对于因子集合中的每个因子,上述执行主体可以确定该因子对应的第一权重和该因子对应的第二权重,并通过求和、加权求和等方式确定总权重;进而对总权重排序,将排序在前的预设数量个因子确定为目标因子,或者将总权重大于预设数值的因子确定为目标因子。As an example, for each factor in the factor set, the above-mentioned executive body can determine the first weight corresponding to the factor and the second weight corresponding to the factor, and determine the total weight through summation, weighted summation, etc.; and then calculate the total weight. In weight sorting, the preset number of factors ranked first are determined as the target factors, or the factors whose total weight is greater than the preset value are determined as the target factors.
步骤505,根据目标因子确定信息流推荐场景下的第二用户对应的推荐结果。Step 505: Determine the recommendation result corresponding to the second user in the information flow recommendation scenario according to the target factor.
本实施例中,上述执行主体可以根据目标因子确定信息流推荐场景下的第二用户对应的推荐结果。 In this embodiment, the above execution subject can determine the recommendation result corresponding to the second user in the information flow recommendation scenario according to the target factor.
作为示例,对于所确定的每个目标因子,上述执行主体可以首先基于该目标因子对应的第一权重和第二权重得到该目标因子的总权重;进而,基于相对应的目标因子与总权重,得每个目标因子对应的加权项,以组合各目标因子的加权项,得到多目标因子融合公式。As an example, for each target factor determined, the above-mentioned execution subject can first obtain the total weight of the target factor based on the first weight and the second weight corresponding to the target factor; then, based on the corresponding target factor and the total weight, The weighted items corresponding to each target factor are obtained to combine the weighted items of each target factor to obtain the multi-target factor fusion formula.
得到多目标因子融合公式之后,可以通过多目标因子融合公式确定预设待推荐内容集合中的待排序内容的推荐排序得分;基于推荐排序得分对待排序内容集合中的待排序内容进行排序,以将排序在前的预设数量个待排序内容作为第二用户对应的推荐结果,推送给第二用户。After obtaining the multi-objective factor fusion formula, the recommendation ranking score of the content to be sorted in the preset content collection to be recommended can be determined through the multi-objective factor fusion formula; the content to be sorted in the content collection to be sorted is sorted based on the recommendation ranking score, so as to The preset number of top-ordered contents to be sorted are pushed to the second user as the recommendation results corresponding to the second user.
步骤506,采用进化策略,根据第二用户对于推荐结果的反馈信息,调整初始多因子融合参数网络的参数和初始门控筛选网络的参数,以得到训练后的多因子融合参数网络和门控筛选网络。Step 506: Use an evolutionary strategy to adjust the parameters of the initial multi-factor fusion parameter network and the parameters of the initial gate screening network based on the second user's feedback information on the recommendation results to obtain the trained multi-factor fusion parameter network and gate screening. network.
本实施例中,上述执行主体可以采用进化策略,根据第二用户对于推荐结果的反馈信息,调整初始多因子融合参数网络的参数和初始门控筛选网络的参数,以得到训练后的多因子融合参数网络和门控筛选网络。In this embodiment, the above-mentioned execution subject can adopt an evolutionary strategy to adjust the parameters of the initial multi-factor fusion parameter network and the parameters of the initial gated screening network according to the feedback information of the second user on the recommendation results to obtain the multi-factor fusion after training. Parametric networks and gated screening networks.
具体的,根据反馈信息和预设回报函数,确定初始多因子融合参数网络的参数和初始门控筛选网络的参数的回报值;基于回报值最大化的原则,指导初始多因子融合参数网络的参数和初始门控筛选网络的参数的调整过程;基于预设进化策略算法按照预设迭代次数进行迭代,生成满足每一参数的均值和方差的高斯分布的新一轮次的多因子融合参数网络的参数和门控筛选网络的参数,将调整后的多因子融合参数网络和门控筛选网络作为下一轮训练的初始多因子融合参数网络和初始门控筛选网络。Specifically, based on the feedback information and the preset reward function, the parameters of the initial multi-factor fusion parameter network and the reward values of the parameters of the initial gated screening network are determined; based on the principle of maximizing the reward value, the parameters of the initial multi-factor fusion parameter network are guided. and the adjustment process of the parameters of the initial gated screening network; based on the preset evolutionary strategy algorithm, iterate according to the preset number of iterations to generate a new round of multi-factor fusion parameter network that satisfies the Gaussian distribution of the mean and variance of each parameter. Parameters and parameters of the gated screening network, and the adjusted multi-factor fusion parameter network and gated screening network are used as the initial multi-factor fusion parameter network and initial gated screening network for the next round of training.
通过迭代执行上述训练操作,响应于达到预设结束条件,得到训练后的多因子融合参数网络和门控筛选网络。其中,预设结束条件例如可以是迭代次数超过预设次数阈值、训练时间超过预设时间阈值等。By iteratively performing the above training operation, in response to reaching the preset end condition, the trained multi-factor fusion parameter network and gated screening network are obtained. The preset end condition may be, for example, that the number of iterations exceeds a preset times threshold, the training time exceeds a preset time threshold, etc.
本实施例中,基于门控筛选网络对于目标因子的筛选,有效提高了进化策略的进化效率,同时能从全局优化的角度触发自动筛选有益于全局的目标因子。In this embodiment, the screening of target factors based on the gated screening network effectively improves the evolutionary efficiency of the evolutionary strategy, and at the same time triggers the automatic screening of target factors that are beneficial to the whole from the perspective of global optimization.
在本实施例的一些可选的实现方式中,上述执行主体可以通过如下方式执行上述步骤503:In some optional implementations of this embodiment, the above execution subject may perform the above step 503 in the following manner:
首先,通过初始门控筛选网络,根据特征信息确定因子集合中的各因 子对应的初始第二权重;然后,通过预设激活函数将各因子对应的初始第二权重转化为0或者1,得到各因子对应的第二权重。First, through the initial gating screening network, each factor in the factor set is determined based on the characteristic information. The initial second weight corresponding to the sub-factor; then, the initial second weight corresponding to each factor is converted into 0 or 1 through the preset activation function to obtain the second weight corresponding to each factor.
作为示例,初始门控筛选网络最上层的预设激活函数的值域为0和1,从而巧妙地将连续值问题转化为0/1问题。作为示例,其预设激活函数可以为:
As an example, the value range of the preset activation function at the top layer of the initial gated screening network is 0 and 1, thus cleverly transforming the continuous value problem into a 0/1 problem. As an example, its preset activation function can be:
本实现方式中,对于因子集合中的每个因子,初始门控筛选网络输出的第二权重为0或者1,从而巧妙地将连续值问题转化为0/1问题,进一步提高了进化策略的进化效率。In this implementation, for each factor in the factor set, the second weight output by the initial gate screening network is 0 or 1, thus cleverly transforming the continuous value problem into a 0/1 problem, further improving the evolution of the evolutionary strategy. efficiency.
在本实施例的一些可选的实现方式中,上述执行主体可以通过如下方式执行上述步骤504:In some optional implementations of this embodiment, the above execution subject may perform the above step 504 in the following manner:
首先,将因子集合中的各因子对应的第一权重和第二权重相乘,得到权重乘积;然后,根据因子集合中的各因子对应的权重乘积,确定因子集合中适用于信息流推荐场景下的第二用户的目标因子。First, multiply the first weight and the second weight corresponding to each factor in the factor set to obtain the weight product; then, based on the weight product corresponding to each factor in the factor set, determine the information flow recommendation scenario in the factor set. The target factor of the second user.
当初始门控筛选网络输出的第二权重为0或者1时,可以便捷地确定各因子对应的权重乘积为该因子对应的第一权重或零;进而,去除权重乘积为零的因子,保留权重乘积非零的因子,得到目标因子。When the second weight output by the initial gated screening network is 0 or 1, the weight product corresponding to each factor can be conveniently determined to be the first weight corresponding to the factor or zero; then, factors whose weight product is zero are removed and the weights are retained Multiply the non-zero factors to get the target factor.
本实现例中,基于各因子对应第一权重和第二权重的权重乘积,确定目标因子,进一步提高了模型训练过程中,目标因子确定过程的便捷性和准确度。In this implementation example, the target factor is determined based on the weight product of the first weight and the second weight corresponding to each factor, which further improves the convenience and accuracy of the target factor determination process during model training.
继续参考图6,作为对上述各图所示方法的实现,本公开提供了一种信息流推荐装置的一个实施例,该装置实施例与图2所示的方法实施例相对应,该装置具体可以应用于各种电子设备中。Continuing to refer to Figure 6, as an implementation of the methods shown in the above figures, the present disclosure provides an embodiment of an information flow recommendation device. The device embodiment corresponds to the method embodiment shown in Figure 2. The device is specifically Can be used in various electronic devices.
如图6所示,信息流推荐装置600包括:第一获取单元601,被配置成获取第一用户在信息流推荐场景下的特征信息;第一确定单元602,被配置成通过多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重,其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;第二确定单元603,被配置成通过门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重;第三确定单元604,被 配置成根据第一权重和第二权重,确定因子集合中适用于信息流推荐场景下的第一用户的目标因子;推荐单元605,被配置成根据目标因子,确定并向第一用户推送信息流推荐场景下的第一用户对应的推荐结果。As shown in Figure 6, the information flow recommendation device 600 includes: a first acquisition unit 601, configured to obtain the characteristic information of the first user in an information flow recommendation scenario; a first determination unit 602, configured to fuse parameters through multiple factors The network determines the first weight corresponding to each factor in the factor set according to the characteristic information, where the factors in the factor set represent the index information that needs to be considered in the information flow recommendation process; the second determination unit 603 is configured to pass the gating Screen the network and determine the second weight corresponding to each factor in the factor set based on the feature information; the third determination unit 604 is The recommendation unit 605 is configured to determine, based on the first weight and the second weight, the target factor in the factor set that is suitable for the first user in the information flow recommendation scenario; the recommendation unit 605 is configured to determine and push the information flow to the first user based on the target factor. Recommendation results corresponding to the first user in the recommendation scenario.
在本实施例的一些可选的实现方式中,第二确定单元603,进一步被配置成:通过门控筛选网络,根据特征信息确定因子集合中的各因子对应的初始第二权重;通过预设激活函数将各因子对应的初始第二权重转化为0或者1,得到各因子对应的第二权重。In some optional implementations of this embodiment, the second determination unit 603 is further configured to: determine the initial second weight corresponding to each factor in the factor set according to the characteristic information through the gated screening network; The activation function converts the initial second weight corresponding to each factor into 0 or 1 to obtain the second weight corresponding to each factor.
在本实施例的一些可选的实现方式中,第三确定单元604,进一步被配置成:将因子集合中的各因子对应的第一权重和第二权重相乘,得到权重乘积;根据因子集合中的各因子对应的权重乘积,确定因子集合中适用于信息流推荐场景下的第一用户的目标因子。In some optional implementations of this embodiment, the third determination unit 604 is further configured to: multiply the first weight and the second weight corresponding to each factor in the factor set to obtain a weight product; according to the factor set The weight product corresponding to each factor in determines the target factor in the factor set that is suitable for the first user in the information flow recommendation scenario.
在本实施例的一些可选的实现方式中,上述装置还包括:反馈单元(图中未示出),被配置成获取第一用户对于推荐结果的反馈信息;进化单元(图中未示出),被配置成采用进化策略,根据反馈信息调整多因子融合参数网络的参数和门控筛选网络的参数,以通过调整后的多因子融合参数网络和门控筛选网络执行后续用户在信息流推荐场景下的推荐任务。In some optional implementations of this embodiment, the above device further includes: a feedback unit (not shown in the figure) configured to obtain the first user's feedback information on the recommendation results; an evolution unit (not shown in the figure) ), is configured to adopt an evolutionary strategy to adjust the parameters of the multi-factor fusion parameter network and the parameters of the gated screening network based on feedback information, so as to perform subsequent user recommendations in the information flow through the adjusted multi-factor fusion parameter network and gated screening network. Recommended tasks in scenarios.
本实施例中,提供了一种信息流推荐装置,在信息流推荐场景下,通过多因子融合参数网络确定对应于用户的各因子的第一权重,通过门控筛选网络确定各因子的第二权重,以根据第一权重和第二权重准确地确定出适用于用户的目标因子,进行信息流推荐,提高了推荐结果的准确度。In this embodiment, an information flow recommendation device is provided. In the information flow recommendation scenario, the first weight of each factor corresponding to the user is determined through a multi-factor fusion parameter network, and the second weight of each factor is determined through a gated screening network. weight to accurately determine the target factors applicable to the user based on the first weight and the second weight, and perform information flow recommendation, thereby improving the accuracy of the recommendation results.
继续参考图7,作为对上述各图所示方法的实现,本公开提供了一种模型训练装置的一个实施例,该装置实施例与图5所示的方法实施例相对应,该装置具体可以应用于各种电子设备中。Continuing to refer to Figure 7, as an implementation of the methods shown in the above figures, the present disclosure provides an embodiment of a model training device. The device embodiment corresponds to the method embodiment shown in Figure 5. The device can specifically Used in various electronic equipment.
如图7所示,模型训练装置700包括:第二获取单元701,被配置成获取第二用户在信息流推荐场景下的特征信息;第四确定单元702,被配置成通过初始多因子融合参数网络,根据特征信息确定因子集合中的各因子对应的第一权重,其中,因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;第五确定单元703,被配置成通过初始门控筛选网络,根据特征信息确定因子集合中的各因子对应的第二权重;第六确定单元 704,被配置成根据第一权重和第二权重,确定因子集合中适用于信息流推荐场景下的第二用户的目标因子;第七确定单元705,被配置成根据目标因子确定信息流推荐场景下的第二用户对应的推荐结果;训练单元706,被配置成采用进化策略,根据第二用户对于推荐结果的反馈信息,调整初始多因子融合参数网络的参数和初始门控筛选网络的参数,以得到训练后的多因子融合参数网络和门控筛选网络。As shown in Figure 7, the model training device 700 includes: a second acquisition unit 701, configured to acquire the characteristic information of the second user in the information flow recommendation scenario; a fourth determination unit 702, configured to use the initial multi-factor fusion parameters The network determines the first weight corresponding to each factor in the factor set according to the characteristic information, where the factors in the factor set represent the index information that needs to be considered in the information flow recommendation process; the fifth determination unit 703 is configured to pass the initial gate Control the screening network and determine the second weight corresponding to each factor in the factor set based on the feature information; the sixth determination unit 704, configured to determine the target factor in the factor set suitable for the second user in the information flow recommendation scenario based on the first weight and the second weight; the seventh determination unit 705, configured to determine the information flow recommendation scenario based on the target factor The recommendation results corresponding to the second user under; the training unit 706 is configured to adopt an evolutionary strategy and adjust the parameters of the initial multi-factor fusion parameter network and the parameters of the initial gate screening network according to the feedback information of the second user on the recommendation results, To obtain the trained multi-factor fusion parameter network and gated screening network.
在本实施例的一些可选的实现方式中,第五确定单元703,进一步被配置成:通过初始门控筛选网络,根据特征信息确定因子集合中的各因子对应的初始第二权重;通过预设激活函数将各因子对应的初始第二权重转化为0或者1,得到各因子对应的第二权重。In some optional implementations of this embodiment, the fifth determination unit 703 is further configured to: determine the initial second weight corresponding to each factor in the factor set according to the characteristic information through the initial gating screening network; Assume that the activation function converts the initial second weight corresponding to each factor into 0 or 1 to obtain the second weight corresponding to each factor.
在本实施例的一些可选的实现方式中,第六确定单元704,进一步被配置成:将因子集合中的各因子对应的第一权重和第二权重相乘,得到权重乘积;根据因子集合中的各因子对应的权重乘积,确定因子集合中适用于信息流推荐场景下的第二用户的目标因子。In some optional implementations of this embodiment, the sixth determination unit 704 is further configured to: multiply the first weight and the second weight corresponding to each factor in the factor set to obtain a weight product; according to the factor set The weight product corresponding to each factor in determines the target factor in the factor set that is suitable for the second user in the information flow recommendation scenario.
本实施例中,基于各因子对应第一权重和第二权重的权重乘积,确定目标因子,进一步提高了模型训练过程中,目标因子确定过程的便捷性和准确度。In this embodiment, the target factor is determined based on the weight product of the first weight and the second weight corresponding to each factor, which further improves the convenience and accuracy of the target factor determination process during model training.
根据本公开的实施例,本公开还提供了一种电子设备,该电子设备包括:至少一个处理器;以及与至少一个处理器通信连接的存储器;其中,存储器存储有可被至少一个处理器执行的指令,该指令被至少一个处理器执行,以使至少一个处理器执行时能够实现上述任意实施例所描述的信息流推荐方法、模型训练方法。According to an embodiment of the present disclosure, the present disclosure also provides an electronic device, which includes: at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores information that can be executed by the at least one processor. The instruction is executed by at least one processor, so that when executed by at least one processor, the information flow recommendation method and the model training method described in any of the above embodiments can be implemented.
根据本公开的实施例,本公开还提供了一种可读存储介质,该可读存储介质存储有计算机指令,该计算机指令用于使计算机执行时能够实现上述任意实施例所描述的信息流推荐方法、模型训练方法。According to an embodiment of the present disclosure, the present disclosure also provides a readable storage medium that stores computer instructions. The computer instructions are used to enable the computer to implement the information flow recommendation described in any of the above embodiments when executed. Methods, model training methods.
本公开实施例提供了一种计算机程序产品,该计算机程序在被处理器执行时能够实现上述任意实施例所描述的信息流推荐方法、模型训练方法。Embodiments of the present disclosure provide a computer program product. When executed by a processor, the computer program can implement the information flow recommendation method and model training method described in any of the above embodiments.
图8示出了可以用来实施本公开的实施例的示例电子设备800的示意 性框图。电子设备旨在表示各种形式的数字计算机,诸如,膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。电子设备还可以表示各种形式的移动装置,诸如,个人数字处理、蜂窝电话、智能电话、可穿戴设备和其它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例,并且不意在限制本文中描述的和/或者要求的本公开的实现。8 illustrates a schematic of an example electronic device 800 that may be used to implement embodiments of the present disclosure. Sex diagram. Electronic devices are intended to refer to various forms of digital computers, such as laptop computers, desktop computers, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers. Electronic devices may also represent various forms of mobile devices, such as personal digital assistants, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions are examples only and are not intended to limit implementations of the disclosure described and/or claimed herein.
如图8所示,设备800包括计算单元801,其可以根据存储在只读存储器(ROM)802中的计算机程序或者从存储单元808加载到随机访问存储器(RAM)803中的计算机程序,来执行各种适当的动作和处理。在RAM803中,还可存储设备800操作所需的各种程序和数据。计算单元801、ROM 802以及RAM 803通过总线804彼此相连。输入/输出(I/O)接口805也连接至总线804。As shown in FIG. 8 , the device 800 includes a computing unit 801 that can execute according to a computer program stored in a read-only memory (ROM) 802 or loaded from a storage unit 808 into a random access memory (RAM) 803 Various appropriate actions and treatments. In the RAM 803, various programs and data required for the operation of the device 800 can also be stored. Computing unit 801, ROM 802 and RAM 803 are connected to each other via bus 804. An input/output (I/O) interface 805 is also connected to bus 804.
设备800中的多个部件连接至I/O接口805,包括:输入单元806,例如键盘、鼠标等;输出单元807,例如各种类型的显示器、扬声器等;存储单元808,例如磁盘、光盘等;以及通信单元809,例如网卡、调制解调器、无线通信收发机等。通信单元809允许设备800通过诸如因特网的计算机网络和/或各种电信网络与其他设备交换信息/数据。Multiple components in the device 800 are connected to the I/O interface 805, including: an input unit 806, such as a keyboard, a mouse, etc.; an output unit 807, such as various types of displays, speakers, etc.; a storage unit 808, such as a magnetic disk, optical disk, etc. ; and communication unit 809, such as a network card, modem, wireless communication transceiver, etc. The communication unit 809 allows the device 800 to exchange information/data with other devices through computer networks such as the Internet and/or various telecommunications networks.
计算单元801可以是各种具有处理和计算能力的通用和/或专用处理组件。计算单元801的一些示例包括但不限于中央处理单元(CPU)、图形处理单元(GPU)、各种专用的人工智能(AI)计算芯片、各种运行机器学习模型算法的计算单元、数字信号处理器(DSP)、以及任何适当的处理器、控制器、微控制器等。计算单元801执行上文所描述的各个方法和处理,例如信息流推荐方法。例如,在一些实施例中,信息流推荐方法可被实现为计算机软件程序,其被有形地包含于机器可读介质,例如存储单元808。在一些实施例中,计算机程序的部分或者全部可以经由ROM 802和/或通信单元809而被载入和/或安装到设备800上。当计算机程序加载到RAM803并由计算单元801执行时,可以执行上文描述的信息流推荐方法的一个或多个步骤。备选地,在其他实施例中,计算单元801可以通过其他任何适当的方式(例如,借助于固件)而被配置为执行信息流推荐方法。Computing unit 801 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of the computing unit 801 include, but are not limited to, a central processing unit (CPU), a graphics processing unit (GPU), various dedicated artificial intelligence (AI) computing chips, various computing units running machine learning model algorithms, digital signal processing processor (DSP), and any appropriate processor, controller, microcontroller, etc. The computing unit 801 performs various methods and processes described above, such as the information flow recommendation method. For example, in some embodiments, the information flow recommendation method may be implemented as a computer software program that is tangibly embodied in a machine-readable medium, such as the storage unit 808. In some embodiments, part or all of the computer program may be loaded and/or installed onto device 800 via ROM 802 and/or communication unit 809. When the computer program is loaded into the RAM 803 and executed by the computing unit 801, one or more steps of the information flow recommendation method described above may be performed. Alternatively, in other embodiments, the computing unit 801 may be configured to perform the information flow recommendation method in any other suitable manner (eg, by means of firmware).
本文中以上描述的系统和技术的各种实施方式可以在数字电子电路 系统、集成电路系统、场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、芯片上系统的系统(SOC)、负载可编程逻辑设备(CPLD)、计算机硬件、固件、软件、和/或它们的组合中实现。这些各种实施方式可以包括:实施在一个或者多个计算机程序中,该一个或者多个计算机程序可在包括至少一个可编程处理器的可编程系统上执行和/或解释,该可编程处理器可以是专用或者通用可编程处理器,可以从存储系统、至少一个输入装置、和至少一个输出装置接收数据和指令,并且将数据和指令传输至该存储系统、该至少一个输入装置、和该至少一个输出装置。Various implementations of the systems and techniques described above may be implemented in digital electronic circuits System, integrated circuit system, field programmable gate array (FPGA), application specific integrated circuit (ASIC), application specific standard product (ASSP), system on chip (SOC), load programmable logic device (CPLD), computer hardware, Implemented in firmware, software, and/or a combination thereof. These various embodiments may include implementation in one or more computer programs executable and/or interpreted on a programmable system including at least one programmable processor, the programmable processor The processor, which may be a special purpose or general purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device. An output device.
用于实施本公开的方法的程序代码可以采用一个或多个编程语言的任何组合来编写。这些程序代码可以提供给通用计算机、专用计算机或其他可编程数据处理装置的处理器或控制器,使得程序代码当由处理器或控制器执行时使流程图和/或框图中所规定的功能/操作被实施。程序代码可以完全在机器上执行、部分地在机器上执行,作为独立软件包部分地在机器上执行且部分地在远程机器上执行或完全在远程机器或服务器上执行。Program code for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general-purpose computer, special-purpose computer, or other programmable data processing device, such that the program codes, when executed by the processor or controller, cause the functions specified in the flowcharts and/or block diagrams/ The operation is implemented. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of this disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, laptop disks, hard drives, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
为了提供与用户的交互,可以在计算机上实施此处描述的系统和技术,该计算机具有:用于向用户显示信息的显示装置(例如,CRT(阴极射线管)或者LCD(液晶显示器)监视器);以及键盘和指向装置(例如,鼠标或者轨迹球),用户可以通过该键盘和该指向装置来将输入提供给计算机。其它种类的装置还可以用于提供与用户的交互;例如,提供给用户的反馈可以是任何形式的传感反馈(例如,视觉反馈、听觉反馈、或者触觉 反馈);并且可以用任何形式(包括声输入、语音输入或者、触觉输入)来接收来自用户的输入。To provide interaction with a user, the systems and techniques described herein may be implemented on a computer having a display device (eg, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user ); and a keyboard and pointing device (eg, a mouse or a trackball) through which a user can provide input to the computer. Other kinds of devices may also be used to provide interaction with the user; for example, the feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback). feedback); and input from the user can be received in any form (including acoustic input, speech input, or tactile input).
可以将此处描述的系统和技术实施在包括后台部件的计算系统(例如,作为数据服务器)、或者包括中间件部件的计算系统(例如,应用服务器)、或者包括前端部件的计算系统(例如,具有图形用户界面或者网络浏览器的用户计算机,用户可以通过该图形用户界面或者该网络浏览器来与此处描述的系统和技术的实施方式交互)、或者包括这种后台部件、中间件部件、或者前端部件的任何组合的计算系统中。可以通过任何形式或者介质的数字数据通信(例如,通信网络)来将系统的部件相互连接。通信网络的示例包括:局域网(LAN)、广域网(WAN)和互联网。The systems and techniques described herein may be implemented in a computing system that includes back-end components (e.g., as a data server), or a computing system that includes middleware components (e.g., an application server), or a computing system that includes front-end components (e.g., A user's computer having a graphical user interface or web browser through which the user can interact with implementations of the systems and technologies described herein), or including such backend components, middleware components, or any combination of front-end components in a computing system. The components of the system may be interconnected by any form or medium of digital data communication (eg, a communications network). Examples of communication networks include: local area network (LAN), wide area network (WAN), and the Internet.
计算机系统可以包括客户端和服务器。客户端和服务器一般远离彼此并且通常通过通信网络进行交互。通过在相应的计算机上运行并且彼此具有客户端-服务器关系的计算机程序来产生客户端和服务器的关系。服务器可以是云服务器,又称为云计算服务器或云主机,是云计算服务体系中的一项主机产品,以解决传统物理主机与虚拟专用服务器(VPS,Virtual Private Server)服务中存在的管理难度大,业务扩展性弱的缺陷;也可以为分布式系统的服务器,或者是结合了区块链的服务器。Computer systems may include clients and servers. Clients and servers are generally remote from each other and typically interact over a communications network. The relationship of client and server is created by computer programs running on corresponding computers and having a client-server relationship with each other. The server can be a cloud server, also known as cloud computing server or cloud host. It is a host product in the cloud computing service system to solve the management difficulties existing in traditional physical host and virtual private server (VPS, Virtual Private Server) services. Large, weak business scalability; it can also be a server of a distributed system, or a server combined with a blockchain.
根据本公开实施例的技术方案,提供了一种信息流推荐方法,提供了一种信息流推荐方法,在信息流推荐场景下,通过多因子融合参数网络确定对应于用户的各因子的第一权重,通过门控筛选网络确定各因子的第二权重,以根据第一权重和第二权重准确地确定出适用于用户的目标因子,进行信息流推荐,提高了推荐结果的准确度。According to the technical solution of the embodiment of the present disclosure, an information flow recommendation method is provided. In the information flow recommendation scenario, the first parameter corresponding to each factor of the user is determined through a multi-factor fusion parameter network. The weight determines the second weight of each factor through the gated screening network to accurately determine the target factors suitable for the user based on the first weight and the second weight, and performs information flow recommendation, which improves the accuracy of the recommendation results.
应该理解,可以使用上面所示的各种形式的流程,重新排序、增加或删除步骤。例如,本发公开中记载的各步骤可以并行地执行也可以顺序地执行也可以不同的次序执行,只要能够实现本公开提供的技术方案所期望的结果,本文在此不进行限制。It should be understood that various forms of the process shown above may be used, with steps reordered, added or deleted. For example, each step described in the present disclosure can be executed in parallel, sequentially, or in a different order. As long as the desired results of the technical solution provided by the present disclosure can be achieved, there is no limitation here.
上述具体实施方式,并不构成对本公开保护范围的限制。本领域技术人员应该明白的是,根据设计要求和其他因素,可以进行各种修改、组合、子组合和替代。任何在本公开的精神和原则之内所作的修改、等同替换和改进等,均应包含在本公开保护范围之内。 The above-mentioned specific embodiments do not constitute a limitation on the scope of the present disclosure. It will be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions are possible depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of this disclosure shall be included in the protection scope of this disclosure.

Claims (17)

  1. 一种信息流推荐方法,包括:An information flow recommendation method includes:
    获取第一用户在信息流推荐场景下的特征信息;Obtain the characteristic information of the first user in the information flow recommendation scenario;
    通过多因子融合参数网络,根据所述特征信息确定因子集合中的各因子对应的第一权重,其中,所述因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;Through a multi-factor fusion parameter network, determine the first weight corresponding to each factor in the factor set according to the characteristic information, wherein the factors in the factor set represent the indicator information that needs to be considered in the information flow recommendation process;
    通过门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的第二权重;Determine the second weight corresponding to each factor in the factor set according to the characteristic information through a gated screening network;
    根据所述第一权重和所述第二权重,确定所述因子集合中适用于所述信息流推荐场景下的所述第一用户的目标因子;According to the first weight and the second weight, determine a target factor in the factor set that is suitable for the first user in the information flow recommendation scenario;
    根据所述目标因子,确定并向所述第一用户推送所述信息流推荐场景下的所述第一用户对应的推荐结果。According to the target factor, a recommendation result corresponding to the first user in the information flow recommendation scenario is determined and pushed to the first user.
  2. 根据权利要求1所述的方法,其中,所述通过门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的第二权重,包括:The method according to claim 1, wherein determining the second weight corresponding to each factor in the factor set according to the characteristic information through the gated screening network includes:
    通过所述门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的初始第二权重;Through the gated screening network, determine the initial second weight corresponding to each factor in the factor set according to the characteristic information;
    通过预设激活函数将各因子对应的初始第二权重转化为0或者1,得到各因子对应的第二权重。The initial second weight corresponding to each factor is converted into 0 or 1 through the preset activation function to obtain the second weight corresponding to each factor.
  3. 根据权利要求1或2所述的方法,其中,所述根据所述第一权重和所述第二权重,确定所述因子集合中适用于所述信息流推荐场景下的所述第一用户的目标因子,包括:The method according to claim 1 or 2, wherein the factors in the factor set that are suitable for the first user in the information flow recommendation scenario are determined based on the first weight and the second weight. Target factors include:
    将所述因子集合中的各因子对应的第一权重和第二权重相乘,得到权重乘积;Multiply the first weight and the second weight corresponding to each factor in the factor set to obtain a weight product;
    根据所述因子集合中的各因子对应的权重乘积,确定所述因子集合中适用于所述信息流推荐场景下的所述第一用户的目标因子。According to the weight product corresponding to each factor in the factor set, a target factor in the factor set suitable for the first user in the information flow recommendation scenario is determined.
  4. 根据权利要求1所述的方法,其中,所述方法还包括: The method of claim 1, further comprising:
    获取所述第一用户对于所述推荐结果的反馈信息;Obtain feedback information from the first user regarding the recommendation result;
    采用进化策略,根据所述反馈信息调整所述多因子融合参数网络的参数和所述门控筛选网络的参数,以通过调整后的多因子融合参数网络和门控筛选网络执行后续用户在所述信息流推荐场景下的推荐任务。Adopting an evolutionary strategy, the parameters of the multi-factor fusion parameter network and the gate control screening network are adjusted according to the feedback information, so that subsequent users can perform the following operations through the adjusted multi-factor fusion parameter network and the gate control screening network. Recommendation tasks in information flow recommendation scenarios.
  5. 一种模型训练方法,包括:A model training method including:
    获取第二用户在信息流推荐场景下的特征信息;Obtain the characteristic information of the second user in the information flow recommendation scenario;
    通过初始多因子融合参数网络,根据所述特征信息确定因子集合中的各因子对应的第一权重,其中,所述因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;Through the initial multi-factor fusion parameter network, determine the first weight corresponding to each factor in the factor set according to the characteristic information, wherein the factors in the factor set represent the indicator information that needs to be considered in the information flow recommendation process;
    通过初始门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的第二权重;Through the initial gating screening network, determine the second weight corresponding to each factor in the factor set according to the characteristic information;
    根据所述第一权重和所述第二权重,确定所述因子集合中适用于所述信息流推荐场景下的所述第二用户的目标因子;According to the first weight and the second weight, determine a target factor in the factor set that is suitable for the second user in the information flow recommendation scenario;
    根据所述目标因子确定所述信息流推荐场景下的所述第二用户对应的推荐结果;Determine the recommendation result corresponding to the second user in the information flow recommendation scenario according to the target factor;
    采用进化策略,根据所述第二用户对于所述推荐结果的反馈信息,调整所述初始多因子融合参数网络的参数和所述初始门控筛选网络的参数,以得到训练后的多因子融合参数网络和门控筛选网络。Using an evolutionary strategy, adjust the parameters of the initial multi-factor fusion parameter network and the parameters of the initial gated screening network according to the feedback information of the second user on the recommendation result to obtain the trained multi-factor fusion parameters. Networks and gated screening networks.
  6. 根据权利要求5所述的方法,其中,所述通过初始门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的第二权重,包括:The method according to claim 5, wherein the determining the second weight corresponding to each factor in the factor set according to the characteristic information through the initial gating screening network includes:
    通过所述初始门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的初始第二权重;Through the initial gating screening network, determine the initial second weight corresponding to each factor in the factor set according to the characteristic information;
    通过预设激活函数将各因子对应的初始第二权重转化为0或者1,得到各因子对应的第二权重。The initial second weight corresponding to each factor is converted into 0 or 1 through the preset activation function to obtain the second weight corresponding to each factor.
  7. 根据权利要求5或6所述的方法,其中,所述根据所述第一权重和所述第二权重,确定所述因子集合中适用于所述信息流推荐场景下的所述第二用户的目标因子,包括: The method according to claim 5 or 6, wherein the factors in the factor set that are suitable for the second user in the information flow recommendation scenario are determined based on the first weight and the second weight. Target factors include:
    将所述因子集合中的各因子对应的第一权重和第二权重相乘,得到权重乘积;Multiply the first weight and the second weight corresponding to each factor in the factor set to obtain a weight product;
    根据所述因子集合中的各因子对应的权重乘积,确定所述因子集合中适用于所述信息流推荐场景下的所述第二用户的目标因子。According to the weight product corresponding to each factor in the factor set, a target factor in the factor set suitable for the second user in the information flow recommendation scenario is determined.
  8. 一种信息流推荐装置,包括:An information flow recommendation device, including:
    第一获取单元,被配置成获取第一用户在信息流推荐场景下的特征信息;The first acquisition unit is configured to acquire the characteristic information of the first user in the information flow recommendation scenario;
    第一确定单元,被配置成通过多因子融合参数网络,根据所述特征信息确定因子集合中的各因子对应的第一权重,其中,所述因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;The first determination unit is configured to determine the first weight corresponding to each factor in the factor set according to the characteristic information through the multi-factor fusion parameter network, wherein the factors in the factor set represent what is required in the information flow recommendation process. Indicator information considered;
    第二确定单元,被配置成通过门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的第二权重;The second determination unit is configured to determine the second weight corresponding to each factor in the factor set according to the characteristic information through the gated screening network;
    第三确定单元,被配置成根据所述第一权重和所述第二权重,确定所述因子集合中适用于所述信息流推荐场景下的所述第一用户的目标因子;A third determination unit configured to determine, according to the first weight and the second weight, a target factor in the factor set that is suitable for the first user in the information flow recommendation scenario;
    推荐单元,被配置成根据所述目标因子,确定并向所述第一用户推送所述信息流推荐场景下的所述第一用户对应的推荐结果。The recommendation unit is configured to determine and push the recommendation result corresponding to the first user in the information flow recommendation scenario to the first user according to the target factor.
  9. 根据权利要求8所述的装置,其中,所述第二确定单元,进一步被配置成:The device according to claim 8, wherein the second determining unit is further configured to:
    通过所述门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的初始第二权重;通过预设激活函数将各因子对应的初始第二权重转化为0或者1,得到各因子对应的第二权重。Through the gated screening network, the initial second weight corresponding to each factor in the factor set is determined according to the characteristic information; the initial second weight corresponding to each factor is converted into 0 or 1 through the preset activation function, and we obtain The second weight corresponding to each factor.
  10. 根据权利要求8或9所述的装置,其中,所述第三确定单元,进一步被配置成:The device according to claim 8 or 9, wherein the third determining unit is further configured to:
    将所述因子集合中的各因子对应的第一权重和第二权重相乘,得到权重乘积;根据所述因子集合中的各因子对应的权重乘积,确定所述因子集合中适用于所述信息流推荐场景下的所述第一用户的目标因子。Multiply the first weight and the second weight corresponding to each factor in the factor set to obtain a weight product; determine the information in the factor set that is suitable for the information based on the weight product corresponding to each factor in the factor set. The target factor of the first user in the flow recommendation scenario.
  11. 根据权利要求8所述的装置,其中,还包括: The device of claim 8, further comprising:
    反馈单元,被配置成获取所述第一用户对于所述推荐结果的反馈信息;A feedback unit configured to obtain feedback information from the first user on the recommendation result;
    进化单元,被配置成采用进化策略,根据所述反馈信息调整所述多因子融合参数网络的参数和所述门控筛选网络的参数,以通过调整后的多因子融合参数网络和门控筛选网络执行后续用户在所述信息流推荐场景下的推荐任务。An evolution unit configured to adopt an evolution strategy and adjust parameters of the multi-factor fusion parameter network and parameters of the gated screening network according to the feedback information to pass the adjusted multi-factor fusion parameter network and gated screening network. Perform subsequent user recommendation tasks in the information flow recommendation scenario.
  12. 一种模型训练装置,包括:A model training device including:
    第二获取单元,被配置成获取第二用户在信息流推荐场景下的特征信息;The second acquisition unit is configured to acquire the characteristic information of the second user in the information flow recommendation scenario;
    第四确定单元,被配置成通过初始多因子融合参数网络,根据所述特征信息确定因子集合中的各因子对应的第一权重,其中,所述因子集合中的因子表征信息流推荐过程中所需考虑的指标信息;The fourth determination unit is configured to determine the first weight corresponding to each factor in the factor set according to the characteristic information through the initial multi-factor fusion parameter network, wherein the factors in the factor set represent the factors used in the information flow recommendation process. Indicator information to be considered;
    第五确定单元,被配置成通过初始门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的第二权重;The fifth determination unit is configured to determine the second weight corresponding to each factor in the factor set according to the characteristic information through the initial gating screening network;
    第六确定单元,被配置成根据所述第一权重和所述第二权重,确定所述因子集合中适用于所述信息流推荐场景下的所述第二用户的目标因子;A sixth determination unit configured to determine, according to the first weight and the second weight, a target factor in the factor set that is suitable for the second user in the information flow recommendation scenario;
    第七确定单元,被配置成根据所述目标因子确定所述信息流推荐场景下的所述第二用户对应的推荐结果;A seventh determination unit configured to determine the recommendation result corresponding to the second user in the information flow recommendation scenario according to the target factor;
    训练单元,被配置成采用进化策略,根据所述第二用户对于所述推荐结果的反馈信息,调整所述初始多因子融合参数网络的参数和所述初始门控筛选网络的参数,以得到训练后的多因子融合参数网络和门控筛选网络。The training unit is configured to adopt an evolutionary strategy and adjust the parameters of the initial multi-factor fusion parameter network and the parameters of the initial gated screening network according to the feedback information of the second user on the recommendation results to obtain training The final multi-factor fusion parameter network and gated screening network.
  13. 根据权利要求12所述的装置,其中,所述第五确定单元,进一步被配置成:The device according to claim 12, wherein the fifth determining unit is further configured to:
    通过所述初始门控筛选网络,根据所述特征信息确定所述因子集合中的各因子对应的初始第二权重;通过预设激活函数将各因子对应的初始第二权重转化为0或者1,得到各因子对应的第二权重。Through the initial gated screening network, the initial second weight corresponding to each factor in the factor set is determined according to the characteristic information; the initial second weight corresponding to each factor is converted into 0 or 1 through the preset activation function, Get the second weight corresponding to each factor.
  14. 根据权利要求12或13所述的装置,其中,所述第六确定单元,进一步被配置成:The device according to claim 12 or 13, wherein the sixth determining unit is further configured to:
    将所述因子集合中的各因子对应的第一权重和第二权重相乘,得到权重 乘积;根据所述因子集合中的各因子对应的权重乘积,确定所述因子集合中适用于所述信息流推荐场景下的所述第二用户的目标因子。Multiply the first weight and the second weight corresponding to each factor in the factor set to obtain the weight Product; determine the target factor in the factor set that is suitable for the second user in the information flow recommendation scenario according to the weight product corresponding to each factor in the factor set.
  15. 一种电子设备,其特征在于,包括:An electronic device, characterized by including:
    至少一个处理器;以及at least one processor; and
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively connected to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行权利要求1-7中任一项所述的方法。The memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor to enable the at least one processor to perform any one of claims 1-7 Methods.
  16. 一种存储有计算机指令的非瞬时计算机可读存储介质,其特征在于,所述计算机指令用于使所述计算机执行权利要求1-7中任一项所述的方法。A non-transitory computer-readable storage medium storing computer instructions, characterized in that the computer instructions are used to cause the computer to execute the method described in any one of claims 1-7.
  17. 一种计算机程序产品,包括:计算机程序,所述计算机程序在被处理器执行时实现根据权利要求1-7中任一项所述的方法。 A computer program product comprising: a computer program which, when executed by a processor, implements the method according to any one of claims 1-7.
PCT/CN2023/080416 2022-07-20 2023-03-09 Information flow recommendation method and apparatus and computer program product WO2024016680A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210857765.1A CN115203564A (en) 2022-07-20 2022-07-20 Information flow recommendation method and device and computer program product
CN202210857765.1 2022-07-20

Publications (1)

Publication Number Publication Date
WO2024016680A1 true WO2024016680A1 (en) 2024-01-25

Family

ID=83582969

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/080416 WO2024016680A1 (en) 2022-07-20 2023-03-09 Information flow recommendation method and apparatus and computer program product

Country Status (2)

Country Link
CN (1) CN115203564A (en)
WO (1) WO2024016680A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115203564A (en) * 2022-07-20 2022-10-18 百度在线网络技术(北京)有限公司 Information flow recommendation method and device and computer program product

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111918136A (en) * 2020-07-04 2020-11-10 中信银行股份有限公司 Interest analysis method and device, storage medium and electronic equipment
CN113836406A (en) * 2021-09-10 2021-12-24 北京小米移动软件有限公司 Information flow recommendation method and device
CN114265979A (en) * 2021-12-17 2022-04-01 北京百度网讯科技有限公司 Method for determining fusion parameters, information recommendation method and model training method
CN114463091A (en) * 2022-01-29 2022-05-10 北京沃东天骏信息技术有限公司 Information push model training and information push method, device, equipment and medium
US20220215032A1 (en) * 2020-02-13 2022-07-07 Tencent Technology (Shenzhen) Company Limited Ai-based recommendation method and apparatus, electronic device, and storage medium
CN115203564A (en) * 2022-07-20 2022-10-18 百度在线网络技术(北京)有限公司 Information flow recommendation method and device and computer program product

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220215032A1 (en) * 2020-02-13 2022-07-07 Tencent Technology (Shenzhen) Company Limited Ai-based recommendation method and apparatus, electronic device, and storage medium
CN111918136A (en) * 2020-07-04 2020-11-10 中信银行股份有限公司 Interest analysis method and device, storage medium and electronic equipment
CN113836406A (en) * 2021-09-10 2021-12-24 北京小米移动软件有限公司 Information flow recommendation method and device
CN114265979A (en) * 2021-12-17 2022-04-01 北京百度网讯科技有限公司 Method for determining fusion parameters, information recommendation method and model training method
CN114463091A (en) * 2022-01-29 2022-05-10 北京沃东天骏信息技术有限公司 Information push model training and information push method, device, equipment and medium
CN115203564A (en) * 2022-07-20 2022-10-18 百度在线网络技术(北京)有限公司 Information flow recommendation method and device and computer program product

Also Published As

Publication number Publication date
CN115203564A (en) 2022-10-18

Similar Documents

Publication Publication Date Title
JP7331171B2 (en) Methods and apparatus for training image recognition models, methods and apparatus for recognizing images, electronic devices, storage media, and computer programs
WO2020094060A1 (en) Recommendation method and apparatus
US20220374776A1 (en) Method and system for federated learning, electronic device, and computer readable medium
CN114549874B (en) Training method of multi-target image-text matching model, image-text retrieval method and device
WO2023109059A1 (en) Method for determining fusion parameter, information recommendation method, and model training method
CN113301442B (en) Method, device, medium, and program product for determining live broadcast resource
CN110909165A (en) Data processing method, device, medium and electronic equipment
CN114329201A (en) Deep learning model training method, content recommendation method and device
US20230083116A1 (en) Federated learning method and system, electronic device, and storage medium
WO2023206771A1 (en) Environment modeling method and apparatus based on decision flow graph, and electronic device
US20210157819A1 (en) Determining a collection of data visualizations
WO2024016680A1 (en) Information flow recommendation method and apparatus and computer program product
WO2023207028A1 (en) Image retrieval method and apparatus, and computer program product
CN114065864A (en) Federal learning method, federal learning device, electronic device, and storage medium
CN112528159B (en) Feature quality assessment method and device, electronic equipment and storage medium
WO2024040869A1 (en) Multi-task model training method, information recommendation method, apparatus, and device
WO2023185125A1 (en) Product resource data processing method and apparatus, electronic device and storage medium
US20220391780A1 (en) Method of federated learning, electronic device, and storage medium
WO2023093015A1 (en) Data screening method and apparatus, device, and storage medium
US20220374775A1 (en) Method for multi-task scheduling, device and storage medium
CN114511022B (en) Feature screening, behavior recognition model training and abnormal behavior recognition method and device
CN115185606A (en) Method, device, equipment and storage medium for obtaining service configuration parameters
CN113792876A (en) Backbone network generation method, device, equipment and storage medium
CN113806541A (en) Emotion classification method and emotion classification model training method and device
CN113010782A (en) Demand amount acquisition method and device, electronic equipment and computer readable medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23841758

Country of ref document: EP

Kind code of ref document: A1