CN115391663A - Video recommendation method and device, computer equipment and storage medium - Google Patents

Video recommendation method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN115391663A
CN115391663A CN202211160276.7A CN202211160276A CN115391663A CN 115391663 A CN115391663 A CN 115391663A CN 202211160276 A CN202211160276 A CN 202211160276A CN 115391663 A CN115391663 A CN 115391663A
Authority
CN
China
Prior art keywords
video
vector
recommended
sequences
vectors
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211160276.7A
Other languages
Chinese (zh)
Inventor
段勇
郑聪
姚倩媛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing QIYI Century Science and Technology Co Ltd
Original Assignee
Beijing QIYI Century Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing QIYI Century Science and Technology Co Ltd filed Critical Beijing QIYI Century Science and Technology Co Ltd
Priority to CN202211160276.7A priority Critical patent/CN115391663A/en
Publication of CN115391663A publication Critical patent/CN115391663A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • G06F16/735Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7847Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/7715Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Library & Information Science (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本申请涉及一种视频推荐方法、装置、计算机设备和存储介质。所述方法包括:获取目标用户标识对应的视频观看集合,视频观看集合包括至少两个视频时长不同的视频观看序列,基于各个视频观看序列分别提取出相应的特征向量,并将各个特征向量融合形成融合向量,根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端,由于上述方法不仅从不同视频观看序列中提取特征向量捕捉用户的喜好,还结合了不同视频观看序列之间的联系,从而实现充分准确地捕捉用户对于视频内容的兴趣偏好,优化了视频内容的推送结果。

Figure 202211160276

The present application relates to a video recommendation method, device, computer equipment and storage medium. The method includes: acquiring a video viewing set corresponding to the target user identifier, the video viewing set includes at least two video viewing sequences with different video durations, extracting corresponding feature vectors based on each video viewing sequence, and fusing each feature vector to form Fusion vectors, according to the obtained inner product results of each video vector to be recommended and the fusion vector, determine the recommended value of each video vector to be recommended, and push the corresponding video vector corresponding to the video vector to be recommended according to the descending order of the recommended value The video data is sent to the terminal corresponding to the target user identification. Since the above method not only extracts feature vectors from different video viewing sequences to capture the user's preferences, but also combines the links between different video viewing sequences, so as to fully and accurately capture the user's interest in the video. The interest preference of content optimizes the push results of video content.

Figure 202211160276

Description

视频推荐方法、装置、计算机设备和存储介质Video recommendation method, device, computer equipment and storage medium

技术领域technical field

本申请涉及计算机技术领域,尤其涉及一种视频推荐方法、装置、计算机设备和存储介质。The present application relates to the field of computer technology, and in particular to a video recommendation method, device, computer equipment and storage medium.

背景技术Background technique

随着视频软件的发展,用户可通过视频软件根据喜好观看长视频或短视频,视频软件会根据用户历史观看行为捕捉用户对于视频的喜好类型,从而推送用户可能感兴趣的视频内容,但由于用户观看序列分为长视频观看序列、中视频观看序列和短视频观看序列,而目前已有的视频推送方式都是对于不同类别的观看序列分别建模分析用户可能感兴趣的视频内容,忽略了不同类别观看序列之间的联系,因此无法充分准确地捕捉用户兴趣偏好,导致推送结果欠优。With the development of video software, users can watch long videos or short videos according to their preferences through video software. The video software will capture the user's preferences for videos based on the user's historical viewing behavior, and thus push the video content that the user may be interested in. Viewing sequences are divided into long video viewing sequences, medium video viewing sequences, and short video viewing sequences. Currently, existing video push methods model and analyze video content that users may be interested in for different types of viewing sequences, ignoring different Therefore, it cannot fully and accurately capture user interest preferences, resulting in suboptimal push results.

发明内容Contents of the invention

为了解决上述技术问题,本申请提供了一种视频推荐方法、装置、计算机设备和存储介质。In order to solve the above technical problems, the present application provides a video recommendation method, device, computer equipment and storage medium.

第一方面,本申请提供了一种视频推荐方法,包括:In the first aspect, the present application provides a video recommendation method, including:

获取目标用户标识对应的视频观看集合,其中,所述视频观看集合包括至少两个视频时长不同的视频观看序列;Obtain a video viewing set corresponding to the target user identifier, wherein the video viewing set includes at least two video viewing sequences with different video durations;

基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量;Extracting corresponding feature vectors based on each of the video viewing sequences, and fusing each of the feature vectors to form a fusion vector;

根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息;According to the obtained inner product result of each video vector to be recommended and the fusion vector, the recommendation value of each video vector to be recommended is determined, wherein the video vector to be recommended is used to indicate the attribute information of the video to be recommended;

依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端。Pushing the video data corresponding to the video vector to be recommended to the terminal corresponding to the target user identifier in descending order of the recommendation value.

第二方面,本申请提供了一种视频推荐装置,包括:In a second aspect, the present application provides a video recommendation device, including:

获取模块,用于获取目标用户标识对应的视频观看集合,其中,所述视频观看集合包括至少两个视频时长不同的视频观看序列;An acquisition module, configured to acquire a video viewing set corresponding to the target user identifier, wherein the video viewing set includes at least two video viewing sequences with different video durations;

融合模块,用于基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量;A fusion module, configured to extract corresponding feature vectors based on each of the video viewing sequences, and fuse each of the feature vectors to form a fusion vector;

确定模块,用于根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息;A determination module, configured to determine the recommended value of each video vector to be recommended according to the acquired inner product result of each video vector to be recommended and the fusion vector, wherein the video vector to be recommended is used to indicate the video to be recommended attribute information;

推送模块,用于依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端。The push module is configured to push the video data corresponding to the video vector to be recommended to the terminal corresponding to the target user identifier in descending order of the recommendation value.

第三方面,本申请提供了一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现以下步骤:In a third aspect, the present application provides a computer device, including a memory, a processor, and a computer program stored on the memory and operable on the processor. When the processor executes the computer program, the following steps are implemented:

获取目标用户标识对应的视频观看集合,其中,所述视频观看集合包括至少两个视频时长不同的视频观看序列;Obtain a video viewing set corresponding to the target user identifier, wherein the video viewing set includes at least two video viewing sequences with different video durations;

基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量;Extracting corresponding feature vectors based on each of the video viewing sequences, and fusing each of the feature vectors to form a fusion vector;

根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息;According to the obtained inner product result of each video vector to be recommended and the fusion vector, the recommendation value of each video vector to be recommended is determined, wherein the video vector to be recommended is used to indicate the attribute information of the video to be recommended;

依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端。Pushing the video data corresponding to the video vector to be recommended to the terminal corresponding to the target user identifier in descending order of the recommendation value.

第四方面,本申请提供了一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现以下步骤:In a fourth aspect, the present application provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the following steps are implemented:

获取目标用户标识对应的视频观看集合,其中,所述视频观看集合包括至少两个视频时长不同的视频观看序列;Obtain a video viewing set corresponding to the target user identifier, wherein the video viewing set includes at least two video viewing sequences with different video durations;

基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量;Extracting corresponding feature vectors based on each of the video viewing sequences, and fusing each of the feature vectors to form a fusion vector;

根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息;According to the obtained inner product result of each video vector to be recommended and the fusion vector, the recommendation value of each video vector to be recommended is determined, wherein the video vector to be recommended is used to indicate the attribute information of the video to be recommended;

依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端。Pushing the video data corresponding to the video vector to be recommended to the terminal corresponding to the target user identifier in descending order of the recommendation value.

基于上述视频推荐方法,获取目标用户标识对应的视频观看集合,视频观看集合包括至少两个视频时长不同的视频观看序列,基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量,即从不同视频观看序列中挖掘用户喜好并建立不同视频观看序列之间的联系,根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息,利用融合有不同视频观看序列相应信息的融合向量来预测,目标用户标识相应用户对各个待推荐视频向量相应视频内容的喜好程度,即得到各个待推荐视频向量的推荐值,依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端,由于上述方法不仅从不同视频观看序列中提取特征向量捕捉用户的喜好,还结合了不同视频观看序列之间的联系,从而可实现充分准确地捕捉用户对于视频内容的兴趣偏好,优化了视频内容的推送结果。Based on the above video recommendation method, the video viewing set corresponding to the target user identifier is obtained, the video viewing set includes at least two video viewing sequences with different video durations, the corresponding feature vectors are respectively extracted based on each of the video viewing sequences, and each of the video viewing sequences is extracted The above feature vectors are fused to form a fusion vector, that is, user preferences are mined from different video viewing sequences and connections between different video viewing sequences are established. According to the obtained inner product results of each video vector to be recommended and the fusion vector, each The recommendation value of the video vector to be recommended, wherein, the video vector to be recommended is used to indicate the attribute information of the video to be recommended, and is predicted by using a fusion vector fused with corresponding information of different video viewing sequences, and the target user identifies the corresponding user for each The preference degree of the video content corresponding to the video vector to be recommended, that is, the recommendation value of each video vector to be recommended is obtained, and the video data corresponding to the video vector to be recommended is pushed to the terminal corresponding to the target user identification according to the descending order of the recommended value. The above method not only extracts feature vectors from different video viewing sequences to capture user preferences, but also combines the connections between different video viewing sequences, so as to fully and accurately capture user preferences for video content and optimize the push of video content result.

附图说明Description of drawings

此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本发明的实施例,并与说明书一起用于解释本发明的原理。The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description serve to explain the principles of the invention.

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,对于本领域普通技术人员而言,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, for those of ordinary skill in the art, In other words, other drawings can also be obtained from these drawings without paying creative labor.

图1为一个实施例中视频推荐方法的应用环境图;Fig. 1 is an application environment diagram of a video recommendation method in an embodiment;

图2为一个实施例中视频推荐方法的流程示意图;Fig. 2 is a schematic flow chart of a video recommendation method in an embodiment;

图3为一个实施例中视频推荐装置的结构框图;Fig. 3 is a structural block diagram of a video recommendation device in an embodiment;

图4为一个实施例中计算机设备的内部结构图。Figure 4 is an internal block diagram of a computer device in one embodiment.

具体实施方式Detailed ways

为使本申请实施例的目的、技术方案和优点更加清楚,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请的一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动的前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, but not all of them. Based on the embodiments in the present application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present application.

图1为一个实施例中视频推荐方法的应用环境图。参照图1,该视频推荐方法应用于视频推荐系统。该视频推荐系统包括终端110和服务器120。终端110和服务器120通过网络连接,终端110内安装有视频播放软件,用户可通过用户标识登录视频播放软件以观看视频内容。终端110具体可以是台式终端或移动终端,移动终端具体可以为手机、平板电脑、笔记本电脑等中的至少一种。服务器120可以用独立的服务器或者是多个服务器组成的服务器集群来实现。Fig. 1 is an application environment diagram of a video recommendation method in an embodiment. Referring to FIG. 1 , the video recommendation method is applied to a video recommendation system. The video recommendation system includes a terminal 110 and a server 120 . The terminal 110 and the server 120 are connected through the network, and the video playing software is installed in the terminal 110, and the user can log in the video playing software through the user ID to watch the video content. The terminal 110 may specifically be a desktop terminal or a mobile terminal, and the mobile terminal may specifically be at least one of a mobile phone, a tablet computer, a notebook computer, and the like. The server 120 can be implemented by an independent server or a server cluster composed of multiple servers.

在一个实施例中,图2为一个实施例中一种视频推荐方法的流程示意图,参照图2,提供了一种视频推荐方法。本实施例主要以该方法应用于上述图1中的服务器120来举例说明,该视频推荐方法具体包括如下步骤:In an embodiment, FIG. 2 is a schematic flowchart of a video recommendation method in an embodiment. Referring to FIG. 2 , a video recommendation method is provided. This embodiment is mainly illustrated by taking the method applied to the server 120 in the above-mentioned FIG. 1 as an example. The video recommendation method specifically includes the following steps:

步骤S210,获取目标用户标识对应的视频观看集合,其中,所述视频观看集合包括至少两个视频时长不同的视频观看序列。Step S210, acquiring a video viewing set corresponding to the target user identifier, wherein the video viewing set includes at least two video viewing sequences with different video durations.

具体的,目标用户标识为任意一个用于登录视频播放软件的用户标识,用户标识具体包括用户的身份信息以及自定义字符,身份信息包括电话号码、邮箱或第三方应用账号等,自定义字符可以为任意数字、字母、符号等中一种或多种组合而成。不同的用户标识用于指示不同的用户,不同的用户对于不同视频内容的喜好也不同,因此不同用户标识所对应的视频观看集合也不同,视频观看集合包括至少两个视频时长不同的视频观看序列,依照视频时长将视频观看序列分为长视频观看序列、中视频观看序列、短视频观看序列,通常将视频时长超过20分钟的视频定义为长视频,将视频时长小于1分钟的视频定义为短视频,将视频时长位于1~20分钟之间的视频定义为中视频,对于长视频、中视频、短视频的视频时长定义还可进行自定义设定。在本实施例中视频观看集合包括长视频观看序列和短视频观看序列。Specifically, the target user ID is any user ID used to log in to the video playback software. The user ID specifically includes the user's identity information and custom characters. The identity information includes phone numbers, email addresses, or third-party application accounts. The custom characters can be It is composed of one or more combinations of any number, letter, symbol, etc. Different user IDs are used to indicate different users, and different users have different preferences for different video content, so the video viewing sets corresponding to different user IDs are also different, and the video viewing sets include at least two video viewing sequences with different video durations According to the length of the video, the video viewing sequence is divided into long video viewing sequence, medium video viewing sequence, and short video viewing sequence. Usually, a video with a video duration of more than 20 minutes is defined as a long video, and a video with a video duration of less than 1 minute is defined as a short video. For videos, videos with a video duration between 1 and 20 minutes are defined as medium videos, and the definition of video duration for long videos, medium videos, and short videos can also be customized. In this embodiment, the video viewing set includes a long video viewing sequence and a short video viewing sequence.

不同视频时长的视频观看序列包含相应视频时长对应的视频信息,即长视频观看序列包括多个长视频对应的视频描述向量,中视频观看序列包括多个中视频对应的视频描述向量,短视频观看序列包括多个短视频对应的视频描述向量。视频描述向量包含视频的属性信息,属性信息包括视频ID(Identity document,标识)、视频专辑ID、剧名、类型、频道ID、独播标签、参演演员、导演、演员作品数量、导演作品数量、编剧、该剧所有的获奖名称、主角名称、影片等级、影片限制等级及描述、全球发行时间、中国发行时间、页面首次上线时间等等,即属性信息包含视频多种维度的描述信息。Video viewing sequences of different video lengths contain video information corresponding to corresponding video durations, that is, long video viewing sequences include multiple video description vectors corresponding to long videos, medium video viewing sequences include multiple video description vectors corresponding to medium videos, and short video viewing sequences include multiple video description vectors corresponding to medium videos. A sequence includes video description vectors corresponding to multiple short videos. The video description vector contains the attribute information of the video, and the attribute information includes video ID (Identity document, logo), video album ID, drama name, type, channel ID, solo label, participating actors, director, number of actor works, number of director works , screenwriter, all award-winning names of the play, protagonist name, movie rating, movie restriction rating and description, global release time, China release time, page first online time, etc., that is, the attribute information includes description information of multiple dimensions of the video.

步骤S220,基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量。Step S220, extract corresponding feature vectors based on each of the video viewing sequences, and fuse each of the feature vectors to form a fusion vector.

具体的,基于每个视频观看序列提取出一个特征向量,特征向量用于指示用户对于视频喜好的向量,再将各个视频观看序列对应的特征向量进行融合处理形成融合向量,即从不同视频观看序列中挖掘用户喜好信息并建立不同视频观看序列之间的联系,从而对不同视频时长的视频观看序列进行了充分的信息交互。Specifically, a feature vector is extracted based on each video viewing sequence, and the feature vector is used to indicate the user's preference vector for the video, and then the feature vectors corresponding to each video viewing sequence are fused to form a fusion vector, that is, from different video viewing sequences In this method, user preference information is mined and connections between different video viewing sequences are established, so that the video viewing sequences of different video durations are fully interacted with information.

步骤S230,根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息。Step S230: Determine the recommended value of each video vector to be recommended according to the obtained inner product result of each video vector to be recommended and the fusion vector, wherein the video vector to be recommended is used to indicate the attribute of the video to be recommended information.

具体的,待推荐视频向量为待推荐视频的描述向量,待推荐视频为视频数据库中相对于目标用户标识观看状态为未观看的视频,视频数据库包含不同视频时长的视频,即视频数据库包括多个长视频、中视频、短视频。基于待推荐视频向量与融合向量的内积结果,即将待推荐视频向量与融合向量作点积处理,将待推荐视频的属性信息与用户兴趣有效结合,从而得到各个待推荐视频向量的推荐值,依照各个待推荐视频向量的推荐值进行降序排列以得到粗排后的视频推荐列表。Specifically, the video vector to be recommended is the description vector of the video to be recommended, and the video to be recommended is a video whose viewing status is unwatched relative to the target user identification in the video database. The video database contains videos with different video durations, that is, the video database includes multiple Long video, medium video, short video. Based on the inner product result of the video vector to be recommended and the fusion vector, the video vector to be recommended and the fusion vector are processed by dot product, and the attribute information of the video to be recommended is effectively combined with the user's interest, so as to obtain the recommendation value of each video vector to be recommended. Arrange in descending order according to the recommendation value of each video vector to be recommended to obtain a roughly sorted video recommendation list.

步骤S240,依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端。Step S240, push the video data corresponding to the video vector to be recommended to the terminal corresponding to the target user identifier in descending order of the recommended value.

具体的,推荐值越高,表示相应待推荐视频越可能满足用户的视频观看喜好,推荐值越低,表示用户对于相应待推荐视频的喜好程度越低,可以选择视频推荐列表中前N个待推荐视频向量所对应的视频数据推送至目标用户标识对应的终端,N为大于零的正整数,且N小于或等于视频推荐列表中待推荐视频向量的数量。Specifically, the higher the recommendation value, it means that the corresponding video to be recommended is more likely to meet the user's video viewing preferences, and the lower the recommendation value, it means that the user's preference for the corresponding video to be recommended is lower. The video data corresponding to the recommended video vector is pushed to the terminal corresponding to the target user identifier, N is a positive integer greater than zero, and N is less than or equal to the number of video vectors to be recommended in the video recommendation list.

由于上述步骤不仅从不同视频观看序列中提取特征向量捕捉用户的喜好,还结合了不同视频观看序列之间的联系,从而可实现充分准确地捕捉用户对于视频内容的兴趣偏好,提升了待推荐视频的粗排效果,即优化了视频内容的推送结果。Since the above steps not only extract feature vectors from different video viewing sequences to capture the user's preferences, but also combine the connections between different video viewing sequences, it is possible to fully and accurately capture the user's interest preferences for video content, and improve the quality of videos to be recommended. The rough layout effect, that is, the push result of the video content is optimized.

在一个实施例中,所述获取目标用户标识对应的视频观看集合,包括:In one embodiment, the acquisition of the video viewing set corresponding to the target user identifier includes:

获取所述目标用户标识对应的历史视频集合,其中,所述历史视频集合包括至少两个视频时长不同的历史视频序列,所述历史视频序列包括多个视频属性向量;Obtaining a historical video collection corresponding to the target user identifier, wherein the historical video collection includes at least two historical video sequences with different video durations, and the historical video sequences include a plurality of video attribute vectors;

对各个所述历史视频序列分别进行独热编码处理,得到相应的视频编码序列,其中,所述视频编码序列包括多个所述视频属性向量对应的独热码向量;Performing one-hot encoding processing on each of the historical video sequences to obtain a corresponding video encoding sequence, wherein the video encoding sequence includes a plurality of one-hot encoding vectors corresponding to the video attribute vectors;

将各个所述视频编码序列进行降维处理,得到相应的所述视频观看序列,其中,所述视频观看序列包括多个视频描述向量。Perform dimensionality reduction processing on each of the video coding sequences to obtain the corresponding video viewing sequence, wherein the video viewing sequence includes a plurality of video description vectors.

具体的,历史视频集合中包括不同视频时长的历史视频序列,历史视频序列按照视频时长分为历史长视频序列、历史中视频序列、历史短视频序列,在本实施例中历史视频集合包括历史长视频序列和历史短视频序列,将历史长视频序列记为L=(l1,l2,…,ln),其中,l1~ln为历史长视频序列中的n个视频属性向量,历史长视频序列中的每个视频属性向量用于指示一个长视频的属性信息,历史短视频序列记为S=(s1,s2,…,sn),s1~sn为历史短视频序列中的n个视频属性向量,历史短视频序列中的每个视频属性向量用于指示一个短视频的属性信息。Specifically, the historical video collection includes historical video sequences of different video durations. The historical video sequences are divided into historical long video sequences, historical medium video sequences, and historical short video sequences according to the video duration. In this embodiment, the historical video collection includes historical long video sequences. Video sequence and historical short video sequence, the historical long video sequence is recorded as L=(l1, l2, ..., ln), wherein, l1~ln are n video attribute vectors in the historical long video sequence, in the historical long video sequence Each video attribute vector of is used to indicate the attribute information of a long video, and the historical short video sequence is recorded as S=(s1, s2, ..., sn), and s1~sn are n video attribute vectors in the historical short video sequence, Each video attribute vector in the historical short video sequence is used to indicate the attribute information of a short video.

对历史长视频序列和历史短视频序列分别进行独热编码(one-hot)处理,即将视频属性向量编码转换为独热码向量,从而得到相应的视频编码序列,历史长视频序列经过独热编码得到长视频编码序列,历史短视频序列经过独热编码得到短视频编码序列。The historical long video sequence and the historical short video sequence are respectively subjected to one-hot encoding (one-hot) processing, that is, the encoding of the video attribute vector is converted into a one-hot code vector, so as to obtain the corresponding video encoding sequence, and the historical long video sequence is subjected to one-hot encoding. The long video encoding sequence is obtained, and the short video encoding sequence is obtained through one-hot encoding of the historical short video sequence.

再将各个视频编码序列进行降维处理,得到降维后的视频观看序列,即步骤S210中的视频观看集合是由历史视频集合经过编码降维处理而得,即长视频编码序列经过降维处理得到长视频观看序列,短视频编码序列经过降维处理得到短视频观看序列。Then each video coding sequence is subjected to dimensionality reduction processing to obtain the video viewing sequence after dimensionality reduction, that is, the video viewing set in step S210 is obtained by the historical video collection through coding dimensionality reduction processing, that is, the long video coding sequence is subjected to dimensionality reduction processing A long video viewing sequence is obtained, and a short video coding sequence is subjected to dimensionality reduction processing to obtain a short video viewing sequence.

在一个实施例中,所述视频观看序列包括第一降维序列和第二降维序列,所述将各个所述视频编码序列进行降维处理,得到相应的所述视频观看序列,包括以下至少之一:In one embodiment, the video viewing sequence includes a first dimensionality reduction sequence and a second dimensionality reduction sequence, and performing dimensionality reduction processing on each of the video encoding sequences to obtain a corresponding video viewing sequence, including at least the following one:

将各个所述视频编码序列分别与第一映射矩阵相乘,得到相应的所述第一降维序列,其中,所述第一映射矩阵包含视频属性对应的矩阵参数;Multiplying each of the video encoding sequences by the first mapping matrix respectively to obtain the corresponding first dimensionality reduction sequence, wherein the first mapping matrix includes matrix parameters corresponding to video attributes;

将各个所述视频编码序列分别与第二映射矩阵相乘,得到相应的所述第二降维序列,其中,所述第二映射矩阵包含偏好属性对应的矩阵参数。Multiplying each of the video coding sequences by the second mapping matrix respectively to obtain the corresponding second dimensionality reduction sequence, wherein the second mapping matrix includes matrix parameters corresponding to the preference attributes.

具体的,视频观看序列包括第一降维序列和第二降维序列,表示对同一视频编码序列采用两种降维方式得到的不同降维序列,若采用第一种降维方式,则将长视频编码序列和短视频编码序列分别与第一映射矩阵相乘,以实现权重嵌入映射(embedding),得到相应的第一降维序列,即长视频编码序列与第一映射矩阵相乘,得到长视频第一降维序列,短视频编码序列与第一映射矩阵相乘,得到短视频第一降维序列。将长视频第一降维序列记为C=(c1,c2,…,cn),其中,c1~cn为长视频第一降维序列中的n个长视频的视频描述向量,短视频第一降维序列记为I=(i1,i2,…,in),其中,i1~in为短视频第一降维序列中的n个短视频的视频描述向量,每个视频描述向量不但包含相应视频的属性信息还包含该视频与同序列中其他视频之间的关联关系。Specifically, the video viewing sequence includes a first dimensionality reduction sequence and a second dimensionality reduction sequence, which represent different dimensionality reduction sequences obtained by using two dimensionality reduction methods for the same video coding sequence. If the first dimensionality reduction method is adopted, the length The video coding sequence and the short video coding sequence are respectively multiplied by the first mapping matrix to realize the weight embedding mapping (embedding), and the corresponding first dimensionality reduction sequence is obtained, that is, the long video coding sequence is multiplied by the first mapping matrix to obtain the long In the first dimensionality reduction sequence of the video, the coding sequence of the short video is multiplied by the first mapping matrix to obtain the first dimensionality reduction sequence of the short video. The first dimensionality reduction sequence of the long video is recorded as C=(c1, c2, ..., cn), where c1~cn are the video description vectors of n long videos in the first dimensionality reduction sequence of the long video, and the short video first The dimensionality reduction sequence is denoted as I=(i1, i2,...,in), wherein, i1~in are the video description vectors of n short videos in the first dimensionality reduction sequence of the short video, and each video description vector not only contains the corresponding video The attribute information of also includes the association relationship between the video and other videos in the same sequence.

若采用第二种降维方式,则将长视频编码序列和短视频编码序列分别与第二映射矩阵相乘,以实现权重嵌入映射(embedding),得到相应的第二降维序列,即将长视频编码序列与第二映射矩阵相乘,得到长视频第二降维矩阵,将短视频编码序列与第二映射矩阵相乘,得到短视频第二降维矩阵。第一映射矩阵和第二映射矩阵可以包含相同和/或不同的矩阵参数,第一映射矩阵中多为视频属性相关的矩阵参数,例如,视频类型、视频ID、频道ID等视频属性特征,第二映射矩阵中多为偏好属性相关的矩阵参数,例如,剧名、参演演员、导演、主角名称等用户可能感兴趣的偏好特征。将长视频第二降维序列记为D=(d1,d2,…,dn),其中,d1~dn为长视频第二降维序列中的n个长视频的视频描述向量,短视频第二降维序列记为K=(k1,k 2,…,k n),其中,k 1~k n为短视频第二降维序列中的n个短视频的视频描述向量。If the second dimensionality reduction method is used, the long video coding sequence and the short video coding sequence are multiplied by the second mapping matrix to realize the weight embedding mapping (embedding), and the corresponding second dimensionality reduction sequence is obtained, that is, the long video The coding sequence is multiplied by the second mapping matrix to obtain the second dimensionality reduction matrix of the long video, and the short video coding sequence is multiplied by the second mapping matrix to obtain the second dimensionality reduction matrix of the short video. The first mapping matrix and the second mapping matrix can include the same and/or different matrix parameters. Most of the first mapping matrix are matrix parameters related to video attributes, such as video attribute features such as video type, video ID, and channel ID. Most of the matrix parameters in the two-mapping matrix are matrix parameters related to preference attributes, for example, preference features that users may be interested in, such as play title, participating actors, director, and protagonist name. The second dimensionality reduction sequence of the long video is recorded as D=(d1, d2, ..., dn), wherein, d1~dn is the video description vector of n long videos in the second dimensionality reduction sequence of the long video, and the short video second The dimensionality reduction sequence is denoted as K=(k1, k2,...,kn), where k1˜kn are the video description vectors of n short videos in the second short video dimensionality reduction sequence.

依照第一种降维方式得到的降维序列用于反映用户对于各个长视频或短视频的喜好程度,依照第二种降维方式得到的降维序列用于反映用户对于各个偏好特征的喜好程度,以此挖掘用户的兴趣特征。The dimensionality reduction sequence obtained according to the first dimensionality reduction method is used to reflect the user's preference for each long video or short video, and the dimensionality reduction sequence obtained according to the second dimensionality reduction method is used to reflect the user's preference for each preference feature , so as to mine the user's interest characteristics.

在一个实施例中,所述特征向量包括表征向量,所述融合向量包括表征融合向量,所述基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量,包括:In one embodiment, the feature vector includes a characterization vector, the fusion vector includes a characterization fusion vector, and the corresponding feature vectors are respectively extracted based on each of the video viewing sequences, and each of the feature vectors is fused to form a fusion vector, including:

分别对各个所述第一降维序列进行均值池化处理,得到相应的所述表征向量;respectively performing mean pooling processing on each of the first dimensionality reduction sequences to obtain the corresponding characterization vectors;

根据各个所述表征向量之间的点积结果,确定一级融合向量;Determine a primary fusion vector according to the dot product results between each of the characterization vectors;

将所述一级融合向量与全部所述第一降维序列进行融合处理,得到二级融合向量;performing fusion processing on the first-level fusion vector and all the first dimensionality reduction sequences to obtain a second-level fusion vector;

将所述一级融合向量与所述二级融合向量相加形成所述表征融合向量。Adding the first-level fusion vector and the second-level fusion vector to form the representation fusion vector.

具体的,特征向量包括表征向量,表征向量用于指示用户对于不同视频观看序列的喜好程度。分别对长视频第一降维序列和短视频第一降维序列进行均值池化(MeanPooling)处理,即对长视频第一降维序列中各个视频描述向量求平均,得到相应的长视频表征向量Ec1,对短视频第一降维序列中各个视频描述向量求平均,得到相应的短视频表征向量Ei1,即表征向量包括长视频表征向量和短视频表征向量,用于反映用户对于长视频或短视频的喜好程度。Specifically, the feature vector includes a characterization vector, and the characterization vector is used to indicate the user's preference for different video viewing sequences. Mean pooling (MeanPooling) processing is performed on the first dimensionality reduction sequence of the long video and the first dimensionality reduction sequence of the short video respectively, that is, the average of each video description vector in the first dimensionality reduction sequence of the long video is averaged to obtain the corresponding long video representation vector Ec1, average the video description vectors in the first dimensionality reduction sequence of the short video to obtain the corresponding short video characterization vector Ei1, that is, the characterization vector includes the long video characterization vector and the short video characterization vector, which is used to reflect the user’s preference for long video or short video Likeness of the video.

将长视频表征向量Ec1和短视频表征向量Ei1之间的点积结果作为一级融合向量Ef1,即融合了长视频第一降维序列和短视频第一降维序列的部分信息,再将一级融合向量Ef1与长视频第一降维序列C以及短视频第一降维序列I作为输入参数输入至注意力网络模型(Attention)中,以输出融合后的二级融合向量Ef2,在一级融合向量的基础上继续融合长视频第一降维序列C和短视频第一降维序列I产生更高阶表示的二级融合向量Ef2,将一级融合向量Ef1与二级融合向量Ef2相加,形成长视频第一降维序列与短视频第一降维序列双序列的表征融合向量F表征融合向量包含不同视频观看序列之间的联系和交互信息,可用于准确捕捉用户对于感兴趣视频的多种维度。The dot product result between the long video characterization vector Ec1 and the short video characterization vector Ei1 is used as the first-level fusion vector Ef1, that is, part of the information of the first dimensionality reduction sequence of the long video and the first dimensionality reduction sequence of the short video is fused, and then a The first-level fusion vector Ef1 and the first dimensionality reduction sequence C of the long video and the first dimensionality reduction sequence I of the short video are input into the attention network model (Attention) as input parameters to output the fused level-2 fusion vector Ef2. On the basis of the fusion vector, continue to fuse the first dimensionality reduction sequence C of the long video and the first dimensionality reduction sequence I of the short video to generate a higher-order fusion vector Ef2, and add the first-level fusion vector Ef1 to the second-level fusion vector Ef2 , forming the characterization fusion vector F of the double sequence of the first dimensionality reduction sequence of the long video and the first dimensionality reduction sequence of the short video. Multiple dimensions.

在一个实施例中,所述特征向量还包括偏好向量,所述融合向量包括偏好融合向量,所述基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量,包括:In one embodiment, the feature vector further includes a preference vector, the fusion vector includes a preference fusion vector, and the corresponding feature vectors are respectively extracted based on each of the video viewing sequences, and each of the feature vectors is fused to form Fusion vectors, including:

基于各个所述第二降维序列中不同视频描述向量之间的关联关系,确定各个所述视频描述向量对应的关系学习向量,其中,所述关系学习向量包含目标描述向量以及所述目标描述向量与所述第二降维序列中各个所述视频描述向量之间的关联关系,所述目标描述向量为所述第二降维序列中任意一个所述视频描述向量;Based on the association relationship between different video description vectors in each of the second dimensionality reduction sequences, determine a relationship learning vector corresponding to each of the video description vectors, wherein the relationship learning vector includes a target description vector and the target description vector The association relationship with each of the video description vectors in the second dimensionality reduction sequence, the target description vector is any one of the video description vectors in the second dimensionality reduction sequence;

根据同一所述第二降维序列对应的多个所述关系学习向量属于各个偏好特征的置信度,确定各个所述第二降维序列对应的偏好向量;determining a preference vector corresponding to each of the second dimensionality reduction sequences according to the confidence that the plurality of relational learning vectors corresponding to the same second dimensionality reduction sequence belong to each preference feature;

将各个所述第二降维序列对应的偏好向量融合形成所述偏好融合向量。The preference vectors corresponding to each of the second dimensionality reduction sequences are fused to form the preference fusion vector.

具体的,特征向量还包括偏好向量,偏好向量用于指示用户对于不同偏好特征的喜好程度,即通过偏好向量反映用户对于视频不同方面的感兴趣程度,偏好特征包括上述属性信息中的全部或部分特征,例如参演演员、导演、视频类别等。Specifically, the feature vector also includes a preference vector, which is used to indicate the user's preference for different preference features, that is, the preference vector reflects the user's degree of interest in different aspects of the video, and the preference features include all or part of the above attribute information Characteristics, such as starring actors, directors, category of video, etc.

分别将长视频第二降维序列D和短视频第二降维序列K作为输入参数输入至多头自注意力网络模型中,以对各个第二降维序列中不同视频描述向量之间的关联关系进行学习后输出各个视频描述向量对应的关系学习向量,即关系学习向量不仅包含相应视频对应的属性信息还包含该视频与同序列中其他视频之间的关联关系,即对同序列中各个视频描述向量进行融合学习,再将多头自注意力网络模型输出的多个关系学习向量作为输入参数输入至全连接层网络结构中,对各个关系学习向量进行分类,并计算各个关系学习向量属于各个偏好特征的置信度,根据各个关系学习向量属于各个偏好特征的置信度从而确定用户对于各个偏好特征的偏好度,各个偏好特征的偏好度形成第二降维序列相应的偏好向量,即分别得到长视频第二降维序列D对应的长视频偏好向量和短视频第二降维序列K对应的短视频偏好向量。The second dimensionality reduction sequence D of the long video and the second dimensionality reduction sequence K of the short video are respectively input into the multi-head self-attention network model as input parameters, so as to describe the relationship between different video description vectors in each second dimensionality reduction sequence After learning, output the relationship learning vector corresponding to each video description vector, that is, the relationship learning vector not only contains the attribute information corresponding to the corresponding video, but also includes the relationship between the video and other videos in the same sequence, that is, the description of each video in the same sequence Vectors are fused and learned, and then multiple relational learning vectors output by the multi-head self-attention network model are input into the fully connected layer network structure as input parameters, and each relational learning vector is classified, and each relational learning vector is calculated to belong to each preference feature According to the confidence degree of each relational learning vector belonging to each preference feature, the user’s preference degree for each preference feature is determined, and the preference degree of each preference feature forms the corresponding preference vector of the second dimensionality reduction sequence, that is, the long video No. The long video preference vector corresponding to the second dimensionality reduction sequence D and the short video preference vector corresponding to the second dimensionality reduction sequence K of the short video.

将长视频偏好向量记为P=(p1,p2,…,pm),其中,p1~pm用于指示用户面对长视频时对于m个偏好特征的偏好度,将短视频偏好向量记为Q=(q1,q2,…,qm),其中,q1~qm用于指示用户面对端视频时对于m个偏好特征的偏好度,不同的用户在面对长视频或短视频时对于相同的偏好特征对应不同的偏好度,例如,偏好特征为参演演员,年轻人面对长视频时更喜欢年轻演员参演的电影或电视剧,即对于年轻演员参演的长视频的偏好度更高;年长者面对长视频时更喜欢资深演员参演的电影或电视剧,即对于资深演员参演的长视频的偏好度更高。以此方式可以挖掘出不同用户标识对于不同视频的偏好特征以及对于各个偏好特征的偏好程度。The long video preference vector is denoted as P=(p1, p2,...,pm), where p1~pm is used to indicate the user's preference for m preference features when facing long videos, and the short video preference vector is denoted as Q =(q1, q2, ..., qm), where q1~qm are used to indicate the user's preference for m preference features when facing end-to-end videos, and different users have the same preference for long or short videos Features correspond to different preferences. For example, the preference feature is actors, and young people prefer movies or TV series played by young actors when facing long videos, that is, they have a higher preference for long videos played by young actors; When faced with long videos, the elderly prefer movies or TV series with senior actors, that is, they have a higher preference for long videos with senior actors. In this way, the preference characteristics of different user identifiers for different videos and the degree of preference for each preference characteristic can be mined.

将长视频偏好向量和短视频偏好向量输入至MLP网络中融合形成偏好融合向量H,偏好融合向量用于反映用户对于不同视频的偏好特征以及对于各个偏好特征的偏好程度,以准确捕捉到用户对于视频的偏好特征。The long video preference vector and the short video preference vector are input into the MLP network and fused to form a preference fusion vector H. The preference fusion vector is used to reflect the user's preference characteristics for different videos and the degree of preference for each preference characteristic, so as to accurately capture the user's preference for Preference characteristics of the video.

在一个实施例中,所述根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,包括:In one embodiment, the determining the recommended value of each video vector to be recommended according to the obtained inner product result of each video vector to be recommended and the fusion vector includes:

将获取到的各个待推荐视频的属性信息,转换为相应的所述待推荐视频向量;converting the acquired attribute information of each video to be recommended into a corresponding vector of the video to be recommended;

根据所述表征融合向量或所述偏好融合向量与各个所述待推荐视频向量之间的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息。According to the inner product result between the characterization fusion vector or the preference fusion vector and each of the video vectors to be recommended, determine the recommendation value of each video vector to be recommended, wherein the video vector to be recommended is used to indicate Attribute information of the video to be recommended.

具体的,将各个待推荐视频的视频属性信息映射至向量表示,在本实施例中通过MLP网络模型对视频属性信息进行映射转换处理,从而输出相应的待推荐视频向量V,根据表征融合向量或偏好融合向量与每个待推荐视频向量的点积结果,以实现双塔建模计算待推荐视频的推荐值,根据表征融合向量与待推荐视频向量之间的点积结果作为该待推荐视频向量的推荐值,即F·V,融合了长视频观看序列和短视频观看序列的交叉信息以及待推荐视频的属性信息,来确定待推荐视频的粗排结果;根据偏好融合向量与待推荐视频向量之间的点积结果作为该待推荐视频向量的推荐值,即H·V,融合了用户对于长视频观看序列和短视频观看序列的喜好信息以及待推荐视频的属性信息,来确定待推荐视频的粗排结果。Specifically, the video attribute information of each video to be recommended is mapped to a vector representation. In this embodiment, the video attribute information is mapped and converted through the MLP network model, thereby outputting the corresponding video vector V to be recommended. According to the representation fusion vector or The dot product result of the preference fusion vector and each video vector to be recommended is used to realize two-tower modeling to calculate the recommended value of the video to be recommended, and the dot product result between the fusion vector and the video vector to be recommended is used as the video vector to be recommended The recommendation value of , that is, F V, combines the cross information of the long video viewing sequence and the short video viewing sequence and the attribute information of the video to be recommended to determine the rough sorting result of the video to be recommended; according to the preference fusion vector and the video vector to be recommended The dot product result between is used as the recommended value of the video vector to be recommended, that is, H·V, which combines the user's preference information for long video viewing sequences and short video viewing sequences and the attribute information of the video to be recommended to determine the video to be recommended The results of rough sorting.

在一个实施例中,所述将获取到的各个待推荐视频的属性信息,转换为相应的所述待推荐视频向量之后,所述方法还包括:In one embodiment, after converting the acquired attribute information of each video to be recommended into corresponding vectors of the video to be recommended, the method further includes:

将所述待推荐视频向量分别与所述表征融合向量、所述偏好融合向量的点积结果相加,得到所述待推荐视频向量的推荐值。The video vector to be recommended is added to the dot product result of the characterization fusion vector and the preference fusion vector to obtain the recommendation value of the video vector to be recommended.

具体的,将所述待推荐视频向量分别与所述表征融合向量、所述偏好融合向量的点积结果相加,即F·V+H·V,以实现三塔建模计算待推荐视频的推荐值,融合了长视频观看序列和短视频观看序列的交叉信息、用户对于长视频观看序列和短视频观看序列的喜好信息以及待推荐视频的属性信息,以确定待推荐视频的粗排结果,从而更加充分全面的捕捉用户对于视频的偏好信息,以提供更符合用户喜好的视频粗排结果。Specifically, the vector of the video to be recommended is added to the result of the dot product of the characterization fusion vector and the preference fusion vector respectively, that is, F·V+H·V, so as to realize three-tower modeling calculation of the video to be recommended The recommendation value combines the cross information of the long video viewing sequence and the short video viewing sequence, the user's preference information for the long video viewing sequence and the short video viewing sequence, and the attribute information of the video to be recommended to determine the rough sorting result of the video to be recommended. In this way, the user's preference information for videos can be more fully and comprehensively captured, so as to provide a video rough sorting result more in line with the user's preferences.

无论是采用双塔建模还是三塔建模,建模流程同样依照上述对于模型使用流程对初始模型进行学习训练,即依照上述流程对样本数据集合中的样本序列进行交叉学习训练,以得到能够实现上述流程的神经网络模型。Regardless of whether two-tower modeling or three-tower modeling is used, the modeling process also learns and trains the initial model according to the above-mentioned process for using the model, that is, performs cross-learning training on the sample sequence in the sample data set according to the above-mentioned process, so as to obtain A neural network model that implements the above process.

图2为一个实施例中视频推荐方法的流程示意图。应该理解的是,虽然图2的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图2中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。Fig. 2 is a schematic flowchart of a video recommendation method in an embodiment. It should be understood that although the various steps in the flow chart of FIG. 2 are displayed sequentially as indicated by the arrows, these steps are not necessarily executed sequentially in the order indicated by the arrows. Unless otherwise specified herein, there is no strict order restriction on the execution of these steps, and these steps can be executed in other orders. Moreover, at least some of the steps in FIG. 2 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed at the same time, but may be executed at different times. The execution of these sub-steps or stages The order is not necessarily performed sequentially, but may be performed alternately or alternately with at least a part of other steps or sub-steps or stages of other steps.

在一个实施例中,如图3所示,提供了一种视频推荐装置,包括:In one embodiment, as shown in Figure 3, a video recommendation device is provided, including:

获取模块310,用于获取目标用户标识对应的视频观看集合,其中,所述视频观看集合包括至少两个视频时长不同的视频观看序列;An acquisition module 310, configured to acquire a video viewing set corresponding to the target user identifier, wherein the video viewing set includes at least two video viewing sequences with different video durations;

融合模块320,用于基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量;A fusion module 320, configured to extract corresponding feature vectors based on each of the video viewing sequences, and fuse each of the feature vectors to form a fusion vector;

确定模块330,用于根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息;The determination module 330 is configured to determine the recommended value of each video vector to be recommended according to the acquired inner product result of each video vector to be recommended and the fusion vector, wherein the video vector to be recommended is used to indicate the video vector to be recommended Attribute information of the video;

推送模块340,用于依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端。The push module 340 is configured to push the video data corresponding to the video vector to be recommended to the terminal corresponding to the target user identifier in descending order of the recommendation value.

在一个实施例中,所述获取模块310具体用于:In one embodiment, the acquiring module 310 is specifically configured to:

获取所述目标用户标识对应的历史视频集合,其中,所述历史视频集合包括至少两个视频时长不同的历史视频序列,所述历史视频序列包括多个视频属性向量;Obtaining a historical video collection corresponding to the target user identifier, wherein the historical video collection includes at least two historical video sequences with different video durations, and the historical video sequences include a plurality of video attribute vectors;

对各个所述历史视频序列分别进行独热编码处理,得到相应的视频编码序列,其中,所述视频编码序列包括多个所述视频属性向量对应的独热码向量;Performing one-hot encoding processing on each of the historical video sequences to obtain a corresponding video encoding sequence, wherein the video encoding sequence includes a plurality of one-hot encoding vectors corresponding to the video attribute vectors;

将各个所述视频编码序列进行降维处理,得到相应的所述视频观看序列,其中,所述视频观看序列包括多个视频描述向量。Perform dimensionality reduction processing on each of the video coding sequences to obtain the corresponding video viewing sequence, wherein the video viewing sequence includes a plurality of video description vectors.

在一个实施例中,所述获取模块310具体用于:In one embodiment, the acquiring module 310 is specifically configured to:

将各个所述视频编码序列分别与第一映射矩阵相乘,得到相应的所述第一降维序列,其中,所述第一映射矩阵包含视频属性对应的矩阵参数;Multiplying each of the video encoding sequences by the first mapping matrix respectively to obtain the corresponding first dimensionality reduction sequence, wherein the first mapping matrix includes matrix parameters corresponding to video attributes;

将各个所述视频编码序列分别与第二映射矩阵相乘,得到相应的所述第二降维序列,其中,所述第二映射矩阵包含偏好属性对应的矩阵参数。Multiplying each of the video coding sequences by the second mapping matrix respectively to obtain the corresponding second dimensionality reduction sequence, wherein the second mapping matrix includes matrix parameters corresponding to the preference attributes.

在一个实施例中,所述融合模块320具体用于:In one embodiment, the fusion module 320 is specifically used for:

分别对各个所述第一降维序列进行均值池化处理,得到相应的所述表征向量;respectively performing mean pooling processing on each of the first dimensionality reduction sequences to obtain the corresponding characterization vectors;

根据各个所述表征向量之间的点积结果,确定一级融合向量;Determine a primary fusion vector according to the dot product results between each of the characterization vectors;

将所述一级融合向量与全部所述第一降维序列进行融合处理,得到二级融合向量;performing fusion processing on the first-level fusion vector and all the first dimensionality reduction sequences to obtain a second-level fusion vector;

将所述一级融合向量与所述二级融合向量相加形成所述表征融合向量。Adding the first-level fusion vector and the second-level fusion vector to form the representation fusion vector.

在一个实施例中,所述融合模块320具体用于:In one embodiment, the fusion module 320 is specifically used for:

基于各个所述第二降维序列中不同视频描述向量之间的关联关系,确定各个所述视频描述向量对应的关系学习向量,其中,所述关系学习向量包含目标描述向量以及所述目标描述向量与所述第二降维序列中各个所述视频描述向量之间的关联关系,所述目标描述向量为所述第二降维序列中任意一个所述视频描述向量;Based on the association relationship between different video description vectors in each of the second dimensionality reduction sequences, determine a relationship learning vector corresponding to each of the video description vectors, wherein the relationship learning vector includes a target description vector and the target description vector The association relationship with each of the video description vectors in the second dimensionality reduction sequence, the target description vector is any one of the video description vectors in the second dimensionality reduction sequence;

根据同一所述第二降维序列对应的多个所述关系学习向量属于各个偏好特征的置信度,确定各个所述第二降维序列对应的偏好向量;determining a preference vector corresponding to each of the second dimensionality reduction sequences according to the confidence that the plurality of relational learning vectors corresponding to the same second dimensionality reduction sequence belong to each preference feature;

将各个所述第二降维序列对应的偏好向量融合形成所述偏好融合向量。The preference vectors corresponding to each of the second dimensionality reduction sequences are fused to form the preference fusion vector.

在一个实施例中,所述确定模块330具体用于:In one embodiment, the determining module 330 is specifically configured to:

将获取到的各个待推荐视频的属性信息,转换为相应的所述待推荐视频向量;converting the acquired attribute information of each video to be recommended into a corresponding vector of the video to be recommended;

根据所述表征融合向量或所述偏好融合向量与各个所述待推荐视频向量之间的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息。According to the inner product result between the characterization fusion vector or the preference fusion vector and each of the video vectors to be recommended, determine the recommendation value of each video vector to be recommended, wherein the video vector to be recommended is used to indicate Attribute information of the video to be recommended.

在一个实施例中,所述确定模块330具体用于:In one embodiment, the determining module 330 is specifically configured to:

将所述待推荐视频向量分别与所述表征融合向量、所述偏好融合向量的点积结果相加,得到所述待推荐视频向量的推荐值。The video vector to be recommended is added to the dot product result of the characterization fusion vector and the preference fusion vector to obtain the recommendation value of the video vector to be recommended.

图4示出了一个实施例中计算机设备的内部结构图。该计算机设备具体可以是图1中的服务器120。如图4所示,该计算机设备包括该计算机设备包括通过系统总线连接的处理器、存储器、网络接口、输入装置和显示屏。其中,存储器包括非易失性存储介质和内存储器。该计算机设备的非易失性存储介质存储有操作系统,还可存储有计算机程序,该计算机程序被处理器执行时,可使得处理器实现视频推荐方法。该内存储器中也可储存有计算机程序,该计算机程序被处理器执行时,可使得处理器执行视频推荐方法。计算机设备的显示屏可以是液晶显示屏或者电子墨水显示屏,计算机设备的输入装置可以是显示屏上覆盖的触摸层,也可以是计算机设备外壳上设置的按键、轨迹球或触控板,还可以是外接的键盘、触控板或鼠标等。Figure 4 shows a diagram of the internal structure of a computer device in one embodiment. Specifically, the computer device may be the server 120 in FIG. 1 . As shown in FIG. 4 , the computer equipment includes a processor, a memory, a network interface, an input device, and a display screen connected through a system bus. Wherein, the memory includes a non-volatile storage medium and an internal memory. The non-volatile storage medium of the computer device stores an operating system, and may also store a computer program. When the computer program is executed by the processor, the processor may implement the video recommendation method. A computer program may also be stored in the internal memory, and when the computer program is executed by the processor, the processor may execute the video recommendation method. The display screen of the computer equipment may be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment may be a touch layer covered on the display screen, or a button, a trackball or a touch pad provided on the casing of the computer equipment, or It can be an external keyboard, touchpad or mouse.

本领域技术人员可以理解,图4中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in Figure 4 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation to the computer equipment on which the solution of the application is applied. The specific computer equipment can be More or fewer components than shown in the figures may be included, or some components may be combined, or have a different arrangement of components.

在一个实施例中,本申请提供的视频推荐装置可以实现为一种计算机程序的形式,计算机程序可在如图4所示的计算机设备上运行。计算机设备的存储器中可存储组成该视频推荐装置的各个程序模块,比如,图3所示的获取模块310、融合模块320、确定模块330和推送模块340。各个程序模块构成的计算机程序使得处理器执行本说明书中描述的本申请各个实施例的视频推荐方法中的步骤。In one embodiment, the video recommendation apparatus provided in the present application may be implemented in the form of a computer program, and the computer program may run on the computer device as shown in FIG. 4 . Various program modules constituting the video recommendation apparatus can be stored in the memory of the computer equipment, for example, the acquisition module 310, the fusion module 320, the determination module 330 and the push module 340 shown in FIG. 3 . The computer program constituted by each program module enables the processor to execute the steps in the video recommendation method of each embodiment of the application described in this specification.

图4所示的计算机设备可以通过如图3所示的视频推荐装置中的获取模块310执行获取目标用户标识对应的视频观看集合,其中,所述视频观看集合包括至少两个视频时长不同的视频观看序列。计算机设备可通过融合模块320执行基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量。计算机设备可通过确定模块330执行根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息。计算机设备可通过推送模块340执行依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端。The computer device shown in FIG. 4 can execute the acquisition module 310 in the video recommendation device as shown in FIG. 3 to acquire the video viewing set corresponding to the target user identification, wherein the video viewing set includes at least two videos with different video durations. Watch the sequence. The computer device may use the fusion module 320 to extract corresponding feature vectors based on each of the video viewing sequences, and fuse each of the feature vectors to form a fusion vector. The computer device may determine the recommended value of each video vector to be recommended according to the obtained inner product result of each video vector to be recommended and the fusion vector through the determination module 330, wherein the video vector to be recommended is used to indicate Attribute information of the video to be recommended. The computer device may use the push module 340 to push the video data corresponding to the video vector to be recommended to the terminal corresponding to the target user identifier in descending order of the recommendation value.

在一个实施例中,提供了一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,处理器执行计算机程序时实现上述任一项实施例所述的方法。In one embodiment, a computer device is provided, including a memory, a processor, and a computer program stored on the memory and operable on the processor. method.

在一个实施例中,提供了一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现上述任一项实施例所述的方法。In one embodiment, a computer-readable storage medium is provided, on which a computer program is stored, and when the computer program is executed by a processor, the method described in any one of the above-mentioned embodiments is implemented.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指示相关的硬件来完成,所述的程序可存储于一非易失性计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双倍速率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be realized by instructing related hardware through computer programs, and the programs can be stored in a non-volatile computer-readable storage medium When the program is executed, it may include the processes of the embodiments of the above-mentioned methods. Wherein, any references to memory, storage, database or other media used in the various embodiments provided in the present application may include non-volatile and/or volatile memory. Nonvolatile memory can include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory can include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in many forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Chain Synchlink DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

需要说明的是,在本文中,诸如“第一”和“第二”等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that in this article, relative terms such as "first" and "second" are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply these No such actual relationship or order exists between entities or operations. Furthermore, the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, but also includes elements not expressly listed. other elements of or also include elements inherent in such a process, method, article, or device. Without further limitations, an element defined by the phrase "comprising a ..." does not exclude the presence of additional identical elements in the process, method, article or apparatus comprising said element.

以上所述仅是本发明的具体实施方式,使本领域技术人员能够理解或实现本发明。对这些实施例的多种修改对本领域的技术人员来说将是显而易见的,本文中所定义的一般原理可以在不脱离本发明的精神或范围的情况下,在其它实施例中实现。因此,本发明将不会被限制于本文所示的这些实施例,而是要符合与本文所申请的原理和新颖特点相一致的最宽的范围。The above descriptions are only specific embodiments of the present invention, so that those skilled in the art can understand or implement the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be implemented in other embodiments without departing from the spirit or scope of the invention. Accordingly, the present invention will not be limited to the embodiments shown herein, but is to be accorded the widest scope consistent with the principles and novel features claimed herein.

Claims (10)

1.一种视频推荐方法,其特征在于,所述方法包括:1. A video recommendation method, characterized in that the method comprises: 获取目标用户标识对应的视频观看集合,其中,所述视频观看集合包括至少两个视频时长不同的视频观看序列;Obtain a video viewing set corresponding to the target user identifier, wherein the video viewing set includes at least two video viewing sequences with different video durations; 基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量;Extracting corresponding feature vectors based on each of the video viewing sequences, and fusing each of the feature vectors to form a fusion vector; 根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息;According to the obtained inner product result of each video vector to be recommended and the fusion vector, the recommendation value of each video vector to be recommended is determined, wherein the video vector to be recommended is used to indicate the attribute information of the video to be recommended; 依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端。Pushing the video data corresponding to the video vector to be recommended to the terminal corresponding to the target user identifier in descending order of the recommendation value. 2.根据权利要求1所述的方法,其特征在于,所述获取目标用户标识对应的视频观看集合,包括:2. The method according to claim 1, wherein said acquiring the video viewing set corresponding to the target user identifier comprises: 获取所述目标用户标识对应的历史视频集合,其中,所述历史视频集合包括至少两个视频时长不同的历史视频序列,所述历史视频序列包括多个视频属性向量;Obtaining a historical video collection corresponding to the target user identifier, wherein the historical video collection includes at least two historical video sequences with different video durations, and the historical video sequences include a plurality of video attribute vectors; 对各个所述历史视频序列分别进行独热编码处理,得到相应的视频编码序列,其中,所述视频编码序列包括多个所述视频属性向量对应的独热码向量;Performing one-hot encoding processing on each of the historical video sequences to obtain a corresponding video encoding sequence, wherein the video encoding sequence includes a plurality of one-hot encoding vectors corresponding to the video attribute vectors; 将各个所述视频编码序列进行降维处理,得到相应的所述视频观看序列,其中,所述视频观看序列包括多个视频描述向量。Perform dimensionality reduction processing on each of the video coding sequences to obtain the corresponding video viewing sequence, wherein the video viewing sequence includes a plurality of video description vectors. 3.根据权利要求1所述的方法,其特征在于,所述视频观看序列包括第一降维序列和第二降维序列,所述将各个所述视频编码序列进行降维处理,得到相应的所述视频观看序列,包括以下至少之一:3. The method according to claim 1, wherein the video viewing sequence comprises a first dimensionality reduction sequence and a second dimensionality reduction sequence, and each of the video coding sequences is subjected to dimensionality reduction processing to obtain corresponding The video viewing sequence includes at least one of the following: 将各个所述视频编码序列分别与第一映射矩阵相乘,得到相应的所述第一降维序列,其中,所述第一映射矩阵包含视频属性对应的矩阵参数;Multiplying each of the video encoding sequences by the first mapping matrix respectively to obtain the corresponding first dimensionality reduction sequence, wherein the first mapping matrix includes matrix parameters corresponding to video attributes; 将各个所述视频编码序列分别与第二映射矩阵相乘,得到相应的所述第二降维序列,其中,所述第二映射矩阵包含偏好属性对应的矩阵参数。Multiplying each of the video coding sequences by the second mapping matrix respectively to obtain the corresponding second dimensionality reduction sequence, wherein the second mapping matrix includes matrix parameters corresponding to the preference attributes. 4.根据权利要求3所述的方法,其特征在于,所述特征向量包括表征向量,所述融合向量包括表征融合向量,所述基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量,包括:4. The method according to claim 3, wherein the feature vector comprises a characterization vector, the fusion vector comprises a characterization fusion vector, and the corresponding feature vectors are respectively extracted based on each of the video viewing sequences, and Merge each of the feature vectors to form a fusion vector, including: 分别对各个所述第一降维序列进行均值池化处理,得到相应的所述表征向量;respectively performing mean pooling processing on each of the first dimensionality reduction sequences to obtain the corresponding characterization vectors; 根据各个所述表征向量之间的点积结果,确定一级融合向量;Determine a primary fusion vector according to the dot product results between each of the characterization vectors; 将所述一级融合向量与全部所述第一降维序列进行融合处理,得到二级融合向量;performing fusion processing on the first-level fusion vector and all the first dimensionality reduction sequences to obtain a second-level fusion vector; 将所述一级融合向量与所述二级融合向量相加形成所述表征融合向量。Adding the first-level fusion vector and the second-level fusion vector to form the representation fusion vector. 5.根据权利要求4所述的方法,其特征在于,所述特征向量还包括偏好向量,所述融合向量包括偏好融合向量,所述基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量,包括:5. The method according to claim 4, wherein the feature vector also includes a preference vector, the fusion vector includes a preference fusion vector, and the corresponding feature vectors are respectively extracted based on each of the video viewing sequences, And each of the feature vectors is fused to form a fusion vector, including: 基于各个所述第二降维序列中不同视频描述向量之间的关联关系,确定各个所述视频描述向量对应的关系学习向量,其中,所述关系学习向量包含目标描述向量以及所述目标描述向量与所述第二降维序列中各个所述视频描述向量之间的关联关系,所述目标描述向量为所述第二降维序列中任意一个所述视频描述向量;Based on the association relationship between different video description vectors in each of the second dimensionality reduction sequences, determine a relationship learning vector corresponding to each of the video description vectors, wherein the relationship learning vector includes a target description vector and the target description vector The association relationship with each of the video description vectors in the second dimensionality reduction sequence, the target description vector is any one of the video description vectors in the second dimensionality reduction sequence; 根据同一所述第二降维序列对应的多个所述关系学习向量属于各个偏好特征的置信度,确定各个所述第二降维序列对应的偏好向量;determining a preference vector corresponding to each of the second dimensionality reduction sequences according to the confidence that the plurality of relational learning vectors corresponding to the same second dimensionality reduction sequence belong to each preference feature; 将各个所述第二降维序列对应的偏好向量融合形成所述偏好融合向量。The preference vectors corresponding to each of the second dimensionality reduction sequences are fused to form the preference fusion vector. 6.根据权利要求5所述的方法,其特征在于,所述根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,包括:6. The method according to claim 5, wherein the inner product result of each obtained video vector to be recommended and the fusion vector is determined to determine the recommended value of each video vector to be recommended, including: 将获取到的各个待推荐视频的属性信息,转换为相应的所述待推荐视频向量;converting the acquired attribute information of each video to be recommended into a corresponding vector of the video to be recommended; 根据所述表征融合向量或所述偏好融合向量与各个所述待推荐视频向量之间的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息。According to the inner product result between the characterization fusion vector or the preference fusion vector and each of the video vectors to be recommended, determine the recommendation value of each video vector to be recommended, wherein the video vector to be recommended is used to indicate Attribute information of the video to be recommended. 7.根据权利要求6所述的方法,其特征在于,所述将获取到的各个待推荐视频的属性信息,转换为相应的所述待推荐视频向量之后,所述方法还包括:7. The method according to claim 6, wherein, after converting the acquired attribute information of each video to be recommended into the corresponding video vector to be recommended, the method further comprises: 将所述待推荐视频向量分别与所述表征融合向量、所述偏好融合向量的点积结果相加,得到所述待推荐视频向量的推荐值。The video vector to be recommended is added to the dot product result of the characterization fusion vector and the preference fusion vector to obtain the recommendation value of the video vector to be recommended. 8.一种视频推荐装置,其特征在于,所述装置包括:8. A video recommendation device, characterized in that the device comprises: 获取模块,用于获取目标用户标识对应的视频观看集合,其中,所述视频观看集合包括至少两个视频时长不同的视频观看序列;An acquisition module, configured to acquire a video viewing set corresponding to the target user identifier, wherein the video viewing set includes at least two video viewing sequences with different video durations; 融合模块,用于基于各个所述视频观看序列分别提取出相应的特征向量,并将各个所述特征向量融合形成融合向量;A fusion module, configured to extract corresponding feature vectors based on each of the video viewing sequences, and fuse each of the feature vectors to form a fusion vector; 确定模块,用于根据获取到的各个待推荐视频向量与所述融合向量的内积结果,确定各个所述待推荐视频向量的推荐值,其中,所述待推荐视频向量用于指示待推荐视频的属性信息;A determination module, configured to determine the recommended value of each video vector to be recommended according to the acquired inner product result of each video vector to be recommended and the fusion vector, wherein the video vector to be recommended is used to indicate the video to be recommended attribute information; 推送模块,用于依照推荐值的降序顺序推送相应所述待推荐视频向量对应的视频数据至所述目标用户标识对应终端。The push module is configured to push the video data corresponding to the video vector to be recommended to the terminal corresponding to the target user identifier in descending order of the recommendation value. 9.一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现权利要求1至7中任一项所述方法的步骤。9. A computer device, comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, characterized in that, when the processor executes the computer program, any one of claims 1 to 7 is realized. A step of said method. 10.一种计算机可读存储介质,其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现权利要求1至7中任一项所述方法的步骤。10. A computer-readable storage medium, on which a computer program is stored, characterized in that, when the computer program is executed by a processor, the steps of the method according to any one of claims 1 to 7 are implemented.
CN202211160276.7A 2022-09-22 2022-09-22 Video recommendation method and device, computer equipment and storage medium Pending CN115391663A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211160276.7A CN115391663A (en) 2022-09-22 2022-09-22 Video recommendation method and device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211160276.7A CN115391663A (en) 2022-09-22 2022-09-22 Video recommendation method and device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN115391663A true CN115391663A (en) 2022-11-25

Family

ID=84126154

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211160276.7A Pending CN115391663A (en) 2022-09-22 2022-09-22 Video recommendation method and device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115391663A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116975425A (en) * 2022-12-28 2023-10-31 腾讯科技(深圳)有限公司 Video recommendation method, device, electronic device and computer-readable storage medium
CN118747224A (en) * 2024-06-24 2024-10-08 中央广播电视总台 A recommendation method and system based on multi-tower model

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110798718A (en) * 2019-09-02 2020-02-14 腾讯科技(深圳)有限公司 Video recommendation method and device
CN111339355A (en) * 2020-05-21 2020-06-26 北京搜狐新媒体信息技术有限公司 A video recommendation method and system
CN112818251A (en) * 2021-04-13 2021-05-18 腾讯科技(深圳)有限公司 Video recommendation method and device, electronic equipment and storage medium
CN114722238A (en) * 2022-05-13 2022-07-08 北京达佳互联信息技术有限公司 Video recommendation method and device, electronic equipment, storage medium and program product

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110798718A (en) * 2019-09-02 2020-02-14 腾讯科技(深圳)有限公司 Video recommendation method and device
CN111339355A (en) * 2020-05-21 2020-06-26 北京搜狐新媒体信息技术有限公司 A video recommendation method and system
CN112818251A (en) * 2021-04-13 2021-05-18 腾讯科技(深圳)有限公司 Video recommendation method and device, electronic equipment and storage medium
CN114722238A (en) * 2022-05-13 2022-07-08 北京达佳互联信息技术有限公司 Video recommendation method and device, electronic equipment, storage medium and program product

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116975425A (en) * 2022-12-28 2023-10-31 腾讯科技(深圳)有限公司 Video recommendation method, device, electronic device and computer-readable storage medium
CN116975425B (en) * 2022-12-28 2025-12-02 腾讯科技(深圳)有限公司 Video recommendation methods, devices, electronic devices, and computer-readable storage media
CN118747224A (en) * 2024-06-24 2024-10-08 中央广播电视总台 A recommendation method and system based on multi-tower model

Similar Documents

Publication Publication Date Title
CN110263243B (en) Media information recommendation method, device, storage medium and computer equipment
CN109002488B (en) Recommendation model training method and device based on meta-path context
KR101944469B1 (en) Estimating and displaying social interest in time-based media
CN110909182B (en) Multimedia resource searching method, device, computer equipment and storage medium
JP6523498B1 (en) Learning device, learning method and learning program
CN115066906A (en) Method and system for recommending based on user-provided criteria
CN112148889A (en) Recommendation list generation method and device
US8806000B1 (en) Identifying viral videos
US20230004608A1 (en) Method for content recommendation and device
CN112364204B (en) Video searching method, device, computer equipment and storage medium
US20180121470A1 (en) Object Annotation in Media Items
CN113515696A (en) Recommended method, apparatus, electronic device and storage medium
CN104813674A (en) Systems and methods for optimizing video
CN113220974B (en) Click rate prediction model training and search recall method, device, equipment and medium
Cho et al. AMPS: Predicting popularity of short-form videos using multi-modal attention mechanisms in social media marketing environments
WO2022042157A1 (en) Method and apparatus for manufacturing video data, and computer device and storage medium
WO2024188044A1 (en) Video tag generation method and apparatus, electronic device, and storage medium
CN115391663A (en) Video recommendation method and device, computer equipment and storage medium
WO2024255428A1 (en) Sample processing method for cross-domain recommendation model, sample processing apparatus for cross-domain recommendation model, and electronic device, computer storage medium and computer program product
CN104363478B (en) Program label association rule analyzing system and method
CN110162689B (en) Information push method, device, computer equipment and storage medium
WO2021036344A1 (en) Digest generation method and device
CN111353052B (en) Multimedia object recommendation method and device, electronic equipment and storage medium
CN117688390A (en) Content matching method, apparatus, computer device, storage medium, and program product
CN112035740B (en) Project usage duration prediction method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Country or region after: China

Address after: 100080 Room 108, 1st Floor, No. 52 West Beisihuan Road, Haidian District, Beijing

Applicant after: BEIJING QIYI CENTURY SCIENCE & TECHNOLOGY Co.,Ltd.

Address before: A 100080 street Beijing city Haidian District North No. 2, building 10, 11 layers of Goldtron expand

Applicant before: BEIJING QIYI CENTURY SCIENCE & TECHNOLOGY Co.,Ltd.

Country or region before: China