WO2023060759A1 - 视频推送方法、设备及存储介质 - Google Patents

视频推送方法、设备及存储介质 Download PDF

Info

Publication number
WO2023060759A1
WO2023060759A1 PCT/CN2021/139233 CN2021139233W WO2023060759A1 WO 2023060759 A1 WO2023060759 A1 WO 2023060759A1 CN 2021139233 W CN2021139233 W CN 2021139233W WO 2023060759 A1 WO2023060759 A1 WO 2023060759A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
scene recognition
recognition result
category
audio
Prior art date
Application number
PCT/CN2021/139233
Other languages
English (en)
French (fr)
Inventor
孙思凯
Original Assignee
深圳创维-Rgb电子有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳创维-Rgb电子有限公司 filed Critical 深圳创维-Rgb电子有限公司
Publication of WO2023060759A1 publication Critical patent/WO2023060759A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/466Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • H04N21/4668Learning process for intelligent management, e.g. learning user preferences for recommending movies for recommending content, e.g. movies

Definitions

  • the present application relates to the field of multimedia technologies, and in particular to a video push method, device, equipment and storage medium.
  • the main purpose of this application is to provide a video push method, device, equipment and storage medium, aiming to solve the technical problem in the prior art that the video content pushed by the user when watching a video is random and cannot meet the viewing needs of the user.
  • the application provides a video push method, the method includes the following steps:
  • the audio and video information includes image information and/or audio information:
  • the step of obtaining the audio-video information of the multimedia file played by the display interface includes:
  • the image information of the multimedia file played on the display interface is intercepted according to the preset interception frequency
  • the scene recognition result includes: an image scene recognition result and/or an audio scene recognition result;
  • the step of performing scene recognition on the audio and video information through a preset scene recognition model to obtain a scene recognition result includes:
  • the image features in the image information intercepted each time are extracted by the preset scene recognition model, and scene recognition is performed on the extracted image features, and after the step of obtaining the image scene recognition result, it also includes:
  • the image information of the multimedia file played on the display interface is intercepted according to the preset interception frequency.
  • the video push category is determined according to the scene recognition result, and before the step of pushing the video according to the video push category, it also includes:
  • the video push category is determined according to the scene recognition result, and the step of pushing video according to the video push category includes:
  • the step of obtaining the audio and video information of the multimedia file played by the display interface it includes:
  • a video push device which includes:
  • the obtaining module is used to obtain the audio and video information of the multimedia file played by the display interface
  • a recognition module configured to perform scene recognition on the audio and video information through a preset scene recognition model to obtain a scene recognition result
  • the push module is configured to determine the video push category according to the scene recognition result, and perform video push according to the video push category.
  • the present application also proposes a video push device, which includes: a memory, a processor, and a video push program stored in the memory and operable on the processor, the video The push program is configured to realize the steps of the above-mentioned video push method.
  • the present application also proposes a storage medium, on which a video push program is stored, and when the video push program is executed by a processor, the steps of the video push method as described above are implemented.
  • the application obtains the audio and video information of the multimedia files played on the display interface; performs scene recognition on the audio and video information through the preset scene recognition model to obtain the scene recognition result; determines the video push category according to the scene recognition result, and according to the Video push category for video push. Since this application performs scene recognition on audio and video information through a preset scene recognition model, and obtains the scene recognition result; determines the video push category according to the scene recognition result, and pushes the video according to the video push category.
  • the above-mentioned method of this application can push videos that users are interested in and improve user experience.
  • the product links used as drainage in the video content will not arouse the disgust of users. Instead, it can increase the success rate of users clicking and entering the background mall, better meet the user's usage habits and improve the success rate of drainage, while increasing operating income. , It can also reduce the complaint rate of users.
  • FIG. 1 is a schematic structural diagram of a video push device in a hardware operating environment involved in an embodiment of the present application
  • FIG. 2 is a schematic flow diagram of the first embodiment of the video push method of the present application
  • FIG. 3 is a schematic flow diagram of the second embodiment of the video push method of the present application.
  • Fig. 4 is a structural block diagram of the first embodiment of the video push device of the present application.
  • FIG. 1 is a schematic structural diagram of a video push device in a hardware operating environment involved in an embodiment of the present application.
  • the video push device may include: a processor 1001 , such as a central processing unit (Central Processing Unit, CPU), a communication bus 1002 , a user interface 1003 , a network interface 1004 , and a memory 1005 .
  • the communication bus 1002 is used to realize connection and communication between these components.
  • the user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface.
  • the network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a Wireless-Fidelity (Wireless-Fidelity, WI-FI) interface).
  • Memory 1005 can be a high-speed random access memory (Random Access Memory, RAM), can also be a stable non-volatile memory (Non-Volatile Memory, NVM), such as disk storage.
  • RAM Random Access Memory
  • NVM Non-Volatile Memory
  • the memory 1005 may also be a storage device independent of the aforementioned processor 1001 .
  • FIG. 1 does not constitute a limitation on the video push device, and may include more or less components than those shown in the figure, or combine some components, or arrange different components.
  • the memory 1005 as a storage medium may include an operating system, a network communication module, a user interface module, and a video push program.
  • the network interface 1004 is mainly used for data communication with the network server;
  • the user interface 1003 is mainly used for data interaction with the user;
  • the processor 1001 and the memory 1005 in the video push device of the present application can be Set in the video push device, the video push device invokes the video push program stored in the memory 1005 through the processor 1001, and executes the video push method provided in the embodiment of the present application.
  • FIG. 2 is a schematic flowchart of a first embodiment of the video push method of the present application.
  • the video push method includes the following steps:
  • Step S10 Obtain the audio and video information of the multimedia file played on the display interface.
  • the execution subject of this embodiment may be a computing service device with data processing, network communication and program running functions, such as a mobile phone, a tablet computer, a personal computer, etc., or an electronic device capable of realizing the above functions. device or video playback device.
  • a computing service device with data processing, network communication and program running functions such as a mobile phone, a tablet computer, a personal computer, etc.
  • an electronic device capable of realizing the above functions. device or video playback device such as a mobile phone, a tablet computer, a personal computer, etc.
  • the display interface may be a display interface of a device such as a TV or a mobile phone with a video browsing function.
  • the multimedia file may be a video file or an audio file being played by the display interface.
  • the audio and video information may be image information and/or audio information contained in the multimedia file.
  • the video playback device acquires the audio and video information of the multimedia file currently played on the display interface.
  • the step S10 includes: within the preset sampling period, intercepting the multimedia files played on the display interface according to the preset interception frequency Image information; and/or, within a preset sampling period, record the audio information of the multimedia file played on the display interface according to the preset recording frequency and recording duration.
  • the preset sampling period may be a preset sampling period. It can be 10 minutes or 30 minutes as a cycle.
  • the preset interception frequency may be a preset sampling interval.
  • the image information of the multimedia file played on the display interface is intercepted at 300 ms/time.
  • the preset recording frequency may be a preset recording interval.
  • the audio information of the multimedia file played by the display interface is recorded at 100 ms/time. It is also possible not to set the recording frequency, and to continue recording audio when sound is detected.
  • the recording duration may be the time for recording audio once. If no recording frequency is set, the audio recording will continue when sound is detected.
  • the recording duration at this time is the preset sampling period.
  • the aforementioned preset sampling period, preset interception frequency, and recording duration can all be adaptively set according to specific usage scenarios, which are not limited in this embodiment.
  • step S10 in order to make the pushed video more in line with the user's expectations and improve the user experience, before the step S10, it also includes: obtaining the historical video push category; determining the current video to be pushed according to the historical video push category, and displaying The video to be pushed.
  • the historical video push category may be the video push category corresponding to the scene recognition result when the user used the video playback device to watch the video last time.
  • the current scene recognition result has not been completed, and the current video push needs to be performed according to the video push category corresponding to the last scene recognition result.
  • video push category For example, when a user enters the video browsing page, yesterday’s video push category is used to screen and push videos each time. In a week, Tuesday uses Monday’s video push category to screen and push videos, and Wednesday adopts Tuesday’s video push category Screen and push videos.
  • a summary cycle can also be set, that is, every other summary cycle, the comprehensive video push category is summarized according to each video push category in this period, and the video is screened and pushed according to the comprehensive video push category.
  • Step S20 Perform scene recognition on the audio and video information by using a preset scene recognition model to obtain a scene recognition result.
  • the preset scene recognition model may be a scene recognition model obtained by pre-training a large amount of sample data, which can recognize the current playback scene according to the input audio and video information.
  • the scene recognition result includes: an image scene recognition result and/or an audio scene recognition result.
  • the step S20 includes: extracting the image features in the image information intercepted each time through the preset scene recognition model, and performing scene recognition on the extracted image features to obtain the image scene recognition result; and/or, extracting voiceprint features in the audio information through a preset scene recognition model, and performing scene recognition on the extracted voiceprint features to obtain an audio scene recognition result.
  • the image features may be characters, watermarks, logos, item information and other features in the image information.
  • the upper left corner of the acquired image information has a typical
  • basketball elements in the basketball logo picture so it can be determined that the current video type is basketball.
  • the voiceprint feature may be a voiceprint in recorded audio information.
  • the video playback device when the video playback device detects that the image information currently acquired is image information, it extracts the image features in the image information intercepted each time through the preset scene recognition model, and performs scene recognition on the extracted image features to obtain the image A scene recognition result, wherein, there may be one or more video types contained in the image scene recognition result.
  • the video playback device detects that the currently acquired audio information is, it extracts voiceprint features in the audio information through a preset scene recognition model, performs scene recognition on the extracted voiceprint features, and obtains an audio scene recognition result. There may also be one or more types of audio contained in the audio scene recognition result.
  • scene recognition is performed on the image information and audio information respectively to obtain a target scene recognition result.
  • the target scene recognition result includes a video scene and an audio scene.
  • the step of extracting the image features in the image information intercepted each time through the preset scene recognition model, and performing scene recognition on the extracted image features, and obtaining the image scene recognition result it also includes: according to the image The scene recognition result determines the image scene category corresponding to each image information; counts the number of video categories according to the image scene category; judges whether the number of video categories is less than a preset number; when the number of video categories is less than the preset number , adjusting the preset sampling period; according to the adjusted sampling period, intercepting the image information of the multimedia file played on the display interface according to the preset interception frequency.
  • the image scene category may be an image scene category corresponding to the image information recognized by the preset scene recognition model according to the image information.
  • the number of video categories may be the number of categories of videos played within a preset sampling period.
  • the preset quantity may be a self-defined quantity.
  • the preset sampling period is generally to increase the sampling period, so that the types of videos acquired within the sampling period are not less than the preset number.
  • the above-mentioned processing logic is also applicable to the audio scene recognition result.
  • the number of image scene categories and the number of audio scene categories can be determined according to the audio scene recognition results and the image scene recognition results, and whether the sum of the image scene category numbers and the audio scene category numbers is greater than the specified number can be judged. the preset quantity.
  • Step S30 Determine the video push category according to the scene recognition result, and perform video push according to the video push category.
  • determining the category of the video to push according to the scene recognition result may be determining the category of the video watched by the user according to the scene recognition result, and using the category of the video watched by the user as the category of the video to push.
  • Performing video push according to the video push category may be selecting videos belonging to the video push category from a large number of videos to push the video.
  • the step S30 includes determining the video category of the multimedia file and the corresponding playback duration of the video category according to the scene recognition result;
  • the playback duration determines the playback weight corresponding to the video category; determines the video push category according to the playback weight, and performs video push according to the video push category.
  • the playback weight corresponding to the video category can be determined according to the playback duration corresponding to the video category, for example, the sampling period is 30 minutes, wherein the playback duration of the live basketball game is 10 minutes, and the playback duration of the news is 8 minutes , entertainment videos are played for 2 minutes, lifestyle videos are played for 3 minutes, and piano music is played for 7 minutes.
  • the playback weights corresponding to the video categories are 10 for basketball games, 8 for news, 2 for entertainment, 3 for life, and 7 for piano music.
  • the recommended categories that users are more inclined to order from large to small are basketball games, news, piano music, life and entertainment. Determine the video push category according to the weight, that is, determine the video push category according to the playing time.
  • the top 3 types of playback volume As the type of video push, that is, push audio and video of basketball games, news, and piano music. It is also possible to select more video categories to push according to the user's usage time. Among them, when the video category with greater weight is pushed, the corresponding number of pushed videos is also larger. For example, the current video push category is For basketball games and news, the weight of basketball games is 10, and the weight of news is 7. If the number of videos to be pushed is 20, it can be 11 for basketball games and 3 for news, that is, the number of videos pushed The number of videos is related to the weight of this type of video.
  • This embodiment obtains the audio and video information of the multimedia file played on the display interface; performs scene recognition on the audio and video information through the preset scene recognition model to obtain the scene recognition result; determines the video push category according to the scene recognition result, and according to the set Push the video in the above video push category.
  • scene recognition is performed on audio and video information through a preset scene recognition model to obtain a scene recognition result; the video push category is determined according to the scene recognition result, and the video push is performed according to the video push category.
  • the above method of this embodiment can push videos that users are interested in, and improve user experience.
  • the product links used as drainage in the video content will not arouse the disgust of users. Instead, it can increase the success rate of users clicking and entering the background mall, better meet the user's usage habits and improve the success rate of drainage, while increasing operating income. , It can also reduce the complaint rate of users.
  • FIG. 3 is a schematic flowchart of a second embodiment of the video push method of the present application.
  • step S30 before the step S30, it also includes:
  • Step S201 Obtain the historical scene recognition result of the multimedia file.
  • the historical scene recognition result may be the recognition result of the scene recognition performed on the multimedia file by the preset scene recognition model. After acquiring the audio and video information of the multimedia file, it is necessary to perform multiple scene recognition on the audio and video information through a preset scene recognition model.
  • the video playback device performs multiple scene recognition on the audio and video information through the preset scene recognition model, and can customize the number of recognition times to avoid errors in single recognition, and obtain historical scene recognition results before video push .
  • Step S202 Determine whether the historical scene recognition result is consistent with the scene recognition result.
  • the historical scene recognition result is compared with the scene recognition result to determine whether the historical scene recognition result is consistent with the scene recognition result.
  • Step S203 If yes, count the number of times that the historical scene recognition result is consistent with the scene recognition result, and when the number of times reaches the preset number threshold, perform the determination of the video push category according to the scene recognition result, and according to Steps for performing video push by the video push category.
  • the preset number of times threshold may be a preset number of times. Only when the result of this number of consecutive historical scene recognition results is consistent with the scene recognition result, can it be judged that there is no misjudgment by the model, and then determine the category of the video push according to the scene recognition result, and push the video according to the video push category. For example, after the audio and video information of the multimedia file is acquired, scene recognition is performed on the audio and video information three times through a preset scene recognition model.
  • the set preset number of times threshold is 2 times.
  • the scene recognition result has a weight of 20 for news and a weight of 10 for basketball games. If the historical scene recognition result is that the first recognition result is news weight 20, basketball match weight 10; the second recognition result is news weight 20, basketball match weight 10.
  • the video push category can be determined according to the scene recognition result. If the first recognition result in the historical scene recognition results is news weight 10, basketball game weight 20. Then the number of times that the recognition results are consistent is 1 time, and if it is less than the set preset number of times threshold, the scene recognition is performed again. When the scene recognition result is news weight 20 and basketball game weight 10, the video push is determined according to the scene recognition result. category. If the second recognition result in the historical scene recognition results is news weight 10, basketball game weight 20. Then the number of times that the recognition results are consistent is 0, because the judgment condition here is that the continuous recognition results are consistent. At this time, at least two recognitions are required. The scene recognition result determines the video push category.
  • This embodiment obtains the historical scene recognition result of the multimedia file; judges whether the historical scene recognition result is consistent with the scene recognition result; if so, counts the number of times that the historical scene recognition result is consistent with the scene recognition result, When the number of times reaches the preset number of times threshold, the step of determining the video push category according to the scene recognition result and performing video push according to the video push category is executed.
  • audio and video information is identified multiple times, and it is judged whether the historical scene recognition result is consistent with the scene recognition result; As a result, the video push category is determined, and the video push is performed according to the video push category. It can reduce the misjudgment of the model and make the pushed video more in line with the user's expectations.
  • FIG. 4 is a structural block diagram of a first embodiment of a video push device of the present application.
  • the video push device proposed in the embodiment of the present application includes:
  • Obtaining module 10 is used for obtaining the audio-video information of the multimedia file that display interface plays;
  • the recognition module 20 is used to perform scene recognition on the audio and video information through a preset scene recognition model to obtain a scene recognition result;
  • the push module 30 is configured to determine a video push category according to the scene recognition result, and perform video push according to the video push category.
  • This embodiment obtains the audio and video information of the multimedia file played on the display interface; performs scene recognition on the audio and video information through the preset scene recognition model to obtain the scene recognition result; determines the video push category according to the scene recognition result, and according to the set Push the video in the above video push category.
  • scene recognition is performed on audio and video information through a preset scene recognition model to obtain a scene recognition result; the video push category is determined according to the scene recognition result, and the video push is performed according to the video push category.
  • the above method of this embodiment can push videos that users are interested in, and improve user experience.
  • the acquisition module 10 is further configured to intercept the image information of the multimedia file played on the display interface according to the preset interception frequency within the preset sampling period; and/or, within the preset sampling period, according to the preset interception frequency Preset recording frequency and recording duration to record audio information of multimedia files played on the display interface.
  • the acquisition module 10 is further configured to extract image features in the image information intercepted each time through a preset scene recognition model, and perform scene recognition on the extracted image features to obtain an image scene recognition result; and/or , extracting voiceprint features in the audio information by using a preset scene recognition model, and performing scene recognition on the extracted voiceprint features to obtain an audio scene recognition result.
  • the acquisition module 10 is also used to determine the image scene category corresponding to each image information according to the image scene recognition result; count the number of video categories according to the image scene category; judge whether the number of video categories is less than the preset Set the number; when the number of video categories is less than the preset number, adjust the preset sampling period; according to the adjusted sampling period, intercept the image information of the multimedia file played on the display interface according to the preset interception frequency.
  • the push module 30 is also used to obtain the historical scene recognition result of the multimedia file; judge whether the historical scene recognition result is consistent with the scene recognition result; if so, count the historical scene recognition result and the The number of times that the scene recognition results are consistent, when the number of times reaches the preset number threshold, execute the step of determining the video push category according to the scene recognition result, and performing video push according to the video push category.
  • the pushing module 30 is also used to determine the video category of the multimedia file and the corresponding playback duration of the video category according to the scene recognition result; determine the playback weight corresponding to the video category according to the playback duration ; Determine the video push category according to the playback weight, and perform video push according to the video push category.
  • the acquiring module 10 is also used to acquire historical video push categories; determine the current video to be pushed according to the historical video push categories, and display the to-be-pushed videos.
  • the embodiment of the present application also proposes a storage medium, on which a video push program is stored, and when the video push program is executed by a processor, the steps of the video push method as described above are implemented.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

本申请属于多媒体领域,公开了一种视频推送方法、设备及存储介质。该方法包括:获取显示界面播放的多媒体文件的音视频信息;通过预设场景识别模型对音视频信息进行场景识别,获得场景识别结果;根据场景识别结果确定视频推送类别,并根据视频推送类别进行视频推送。

Description

视频推送方法、设备及存储介质
本申请要求于2021年10月11日申请的、申请号为202111184229.1的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及多媒体技术领域,尤其涉及一种视频推送方法、装置、设备及存储介质。
背景技术
现有的短视频播放深受用户喜爱,但是短视频使用用户的时间一般比较碎片化,当用户进入视频显示页面时,当前推送的短视频内容是随机的,是从后台短视频引擎随机推送的短视频,因为当前的投递多半是盲投,造成投递的短视频与用户的感兴趣的内容和产品并不一致,造成了投递效果未达预期甚至引起用户的投诉的问题。
上述内容仅用于辅助理解本申请的技术方案,并不代表承认上述内容是现有技术。
技术问题
本申请的主要目的在于提供了一种视频推送方法、装置、设备及存储介质,旨在解决现有技术中用户观看视频时推送的视频内容是随机的,不能满足用户的观看需要的技术问题。
技术解决方案
为实现上述目的,本申请提供了一种视频推送方法,所述方法包括以下步骤:
获取显示界面播放的多媒体文件的音视频信息;
通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果;
根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送。
在一实施例中,所述音视频信息包括图像信息和/或音频信息:
所述获取显示界面播放的多媒体文件的音视频信息的步骤,包括:
在预设采样周期内,按预设截取频率截取显示界面播放的多媒体文件的图像信息;
和/或,在预设采样周期内,按预设录制频率和录制时长录制显示界面播放的多媒体文件的音频信息。
在一实施例中,所述场景识别结果包括:图像场景识别结果和/或音频场景识别结果;
所述通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果的步骤,包括:
通过预设场景识别模型提取每次截取到的图像信息中的图像特征,并对提取的图像特征进行场景识别,获得图像场景识别结果;
和/或,通过预设场景识别模型提取所述音频信息中的声纹特征,并对提取的声纹特征进行场景识别,获得音频场景识别结果。
在一实施例中,所述通过预设场景识别模型提取每次截取到的图像信息中的图像特征,并对提取的图像特征进行场景识别,获得图像场景识别结果的步骤之后,还包括:
根据所述图像场景识别结果确定各图像信息所对应的图像场景类别;
根据所述图像场景类别统计视频类别数量;
判断所述视频类别数量是否小于预设数量;
在所述视频类别数量小于所述预设数量时,对所述预设采样周期进行调整;
根据调整后的采样周期,按预设截取频率截取显示界面播放的多媒体文件的图像信息。
在一实施例中,所述根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送的步骤之前,还包括:
获取所述多媒体文件的历史场景识别结果;
判断所述历史场景识别结果是否与所述场景识别结果一致;
若是,则统计所述历史场景识别结果与所述场景识别结果一致的次数,在所述次数达到预设次数阈值时,执行所述根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送的步骤。
在一实施例中,所述根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送的步骤,包括:
根据所述场景识别结果确定所述多媒体文件的视频类别和所述视频类别对应的播放时长;
根据所述播放时长确定所述视频类别对应的播放权重;
根据所述播放权重确定视频推送类别,并根据所述视频推送类别进行视频推送。
在一实施例中,所述获取显示界面播放的多媒体文件的音视频信息的步骤之前,包括:
获取历史视频推送类别;
根据所述历史视频推送类别确定当前的待推送视频,并展示所述待推送视频。
此外,为实现上述目的,本申请还提供一种视频推送装置,所述装置包括:
获取模块,用于获取显示界面播放的多媒体文件的音视频信息;
识别模块,用于通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果;
推送模块,用于根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送。
此外,为实现上述目的,本申请还提出一种视频推送设备,所述设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的视频推送程序,所述视频推送程序配置为实现如上文所述的视频推送方法的步骤。
此外,为实现上述目的,本申请还提出一种存储介质,所述存储介质上存储有视频推送程序,所述视频推送程序被处理器执行时实现如上文所述的视频推送方法的步骤。
有益效果
本申请获取显示界面播放的多媒体文件的音视频信息;通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果;根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送。由于本申请是通过预设场景识别模型对音视频信息进行场景识别,获得场景识别结果;根据场景识别结果确定视频推送类别,并根据视频推送类别进行视频推送。相对于现有的随机为用户展示视频的方式,本申请上述方式能够推送用户感兴趣的视频,提升用户体验感。且视频内容中做为引流的商品链接也不会引起用户的反感,反而可以提升用户点击并进入后台商城的成功率,更好满足用户使用习惯的同时提升引流的成功率,提升运营收入的同时,还可降低用户使用的投诉率。
附图说明
图1是本申请实施例方案涉及的硬件运行环境的视频推送设备的结构示意图;
图2为本申请视频推送方法第一实施例的流程示意图;
图3为本申请视频推送方法第二实施例的流程示意图;
图4为本申请视频推送装置第一实施例的结构框图。
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。
本发明的实施方式
应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。
参照图1,图1为本申请实施例方案涉及的硬件运行环境的视频推送设备结构示意图。
如图1所示,该视频推送设备可以包括:处理器1001,例如中央处理器(Central Processing Unit,CPU),通信总线1002、用户接口1003,网络接口1004,存储器1005。其中,通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),可选用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如无线保真(Wireless-Fidelity,WI-FI)接口)。存储器1005可以是高速的随机存取存储器(Random Access Memory,RAM),也可以是稳定的非易失性存储器(Non-Volatile Memory,NVM),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储装置。
本领域技术人员可以理解,图1中示出的结构并不构成对视频推送设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。
如图1所示,作为一种存储介质的存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及视频推送程序。
在图1所示的视频推送设备中,网络接口1004主要用于与网络服务器进行数据通信;用户接口1003主要用于与用户进行数据交互;本申请视频推送设备中的处理器1001、存储器1005可以设置在视频推送设备中,所述视频推送设备通过处理器1001调用存储器1005中存储的视频推送程序,并执行本申请实施例提供的视频推送方法。
基于上述视频推送设备,本申请实施例提供了一种视频推送方法,参照图2,图2为本申请视频推送方法第一实施例的流程示意图。
本实施例中,所述视频推送方法包括以下步骤:
步骤S10:获取显示界面播放的多媒体文件的音视频信息。
需要说明的是,本实施例的执行主体可以是一种具有数据处理、网络通信以及程序运行功能的计算服务设备,例如手机、平板电脑、个人电脑等,或者是一种能够实现上述功能的电子设备或视频播放设备。以下以所述视频播放设备为例,对本实施例及下述各实施例进行说明。
需要说明的是,所述显示界面可以是具有视频浏览功能的电视或手机等设备的显示界面。所述多媒体文件可以是所述显示界面正在播放的视频文件或音频文件。所述音视频信息可以是所述多媒体文件中包含的图像信息和/或音频信息。
在具体实施中,视频播放设备获取显示界面当前播放的多媒体文件的音视频信息。
进一步的,为了避免因为没有权限等问题,导致无法直接从后台获取用户观看的音视频信息,所述步骤S10包括:在预设采样周期内,按预设截取频率截取显示界面播放的多媒体文件的图像信息;和/或,在预设采样周期内,按预设录制频率和录制时长录制显示界面播放的多媒体文件的音频信息。
需要说明的是,所述预设采样周期可以是预先设置的采样周期。可以是以10分钟或30分钟等时长作为一个周期。所述预设截取频率可以是预先设置的采样间隔时长。例如,以300ms/次进行截取显示界面播放的多媒体文件的图像信息。所述预设录制频率可以是预先设置的录制间隔时长。例如,以100ms/次进行录制显示界面播放的多媒体文件的音频信息。也可以不设置录制频率,在检测到有声音时,持续进行音频的录制。所述录制时长可以是一次录制音频的时间,若没有设置录制频率,则检测到有声音时,持续进行音频的录制,此时的录制时长为所述预设采样周期。上述预设采样周期、预设截取频率和录制时长均可以根据具体的使用场景自适应设置,本实施例在此不加以限制。
进一步的,为了使推送的视频更加符合用户的期望,提升用户体验感,所述步骤S10之前,还包括:获取历史视频推送类别;根据所述历史视频推送类别确定当前的待推送视频,并展示所述待推送视频。
需要说明的是,所述历史视频推送类别可以是上一次用户使用视频播放设备观看视频时场景识别结果对应的视频推送类别。
应理解的是,在用户刚进入视频浏览页面时,本次的场景识别结果还未完成,需要根据上一次场景识别结果对应的视频推送类别进行当前的视频推送。例如,用户进入视频浏览页面时,每次均采用昨天的视频推送类别进行视频的筛选和推送,在一周中,周二采用周一的视频推送类别进行视频的筛选和推送,周三采用周二的视频推送类别进行视频的筛选和推送。也可以设置一个总结周期,既每隔一个总结周期,则根据该周期内的每次视频推送类别总结综合视频推送类别,根据综合视频推送类别进行视频的筛选和推送。
步骤S20:通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果。
需要说明的是,所述预设场景识别模型可以是预先通过大量的样本数据进行训练得到的场景识别模型,其可以根据输入的音视频信息识别出当前的播放场景。所述场景识别结果包括:图像场景识别结果和/或音频场景识别结果。
进一步的,为了使识别出的场景更加准确,所述步骤S20包括:通过预设场景识别模型提取每次截取到的图像信息中的图像特征,并对提取的图像特征进行场景识别,获得图像场景识别结果;和/或,通过预设场景识别模型提取所述音频信息中的声纹特征,并对提取的声纹特征进行场景识别,获得音频场景识别结果。
需要说明的是,所述图像特征可以是所述图像信息中的文字、水印、logo、物品信息等特征,例如,在用户收看篮球类的赛事直播时,获取的图像信息中的左上角有典型的篮球logo图片,图像中也存在篮球因素,因此,可以判定当前的视频类型为篮球类。所述声纹特征可以是录制的音频信息中的声纹。例如,钢琴曲的声纹片段中会有特殊的声纹信息支持预设场景识别模型进行场景判断。
在具体实施中,视频播放设备在检测到当前获取的为图像信息时,通过预设场景识别模型提取每次截取到的图像信息中的图像特征,并对提取的图像特征进行场景识别,获得图像场景识别结果,其中,所述图像场景识别结果中所包含的视频类型可能有一种或多种。视频播放设备在检测到当前获取的为音频信息时,通过预设场景识别模型提取所述音频信息中的声纹特征,并对提取的声纹特征进行场景识别,获得音频场景识别结果。所述音频场景识别结果中所包含的音频类型也可能有一种或多种。在所述视频播放设备检测到当前既有图像信息也有音频信息,对所述图像信息和音频信息分别进行场景识别,获得目标场景识别结果。所述目标场景识别结果中包含有视频场景和音频场景。
进一步的,所述通过预设场景识别模型提取每次截取到的图像信息中的图像特征,并对提取的图像特征进行场景识别,获得图像场景识别结果的步骤之后,还包括:根据所述图像场景识别结果确定各图像信息所对应的图像场景类别;根据所述图像场景类别统计视频类别数量;判断所述视频类别数量是否小于预设数量;在所述视频类别数量小于所述预设数量时,对所述预设采样周期进行调整;根据调整后的采样周期,按预设截取频率截取显示界面播放的多媒体文件的图像信息。
需要说明的是,所述图像场景类别可以是预设场景识别模型根据图像信息识别出来的图像信息对应的图像场景类别。所述视频类别数量可以是在预设采样周期内,播放过的视频的类别个数。所述预设数量可以是自定义的数量。
应理解的是,为了避免某些特殊情况下图像场景识别结果中只包含一种类别的视频导致推送的视频类别单一,在视频类别数量小于所述预设数量时,对所述预设采样周期进行调整,一般为增加采样周期,使得在采样周期内获取的视频的类别不少于预设数量。相应的,对于音频场景识别结果也同样适用上述的处理逻辑,在识别出来的音频类型数量小于预设数量时,增加采样的时长,使得在采样周期内获取的音频的类别数量不少于预设数量。当音视频信息包括图像信息和音频信息时,可以根据音频场景识别结果和图像场景识别结果确定图像场景类别数量和音频场景类别数量,根据图像场景类别数量和音频场景类别数量之和判断是否大于所述预设数量。
步骤S30:根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送。
需要说明的是,根据所述场景识别结果确定视频推送类别可以是根据所述场景识别结果确定用户观看的视频的类别,将用户观看的视频的类别作为所述视频推送类别。根据所述视频推送类别进行视频推送可以是从海量的视频中筛选出属于该视频推送类别的视频进行视频的推送。
进一步的,为了使推送的视频更符合用户的期望,提升用户体验感,所述步骤S30,包括根据所述场景识别结果确定所述多媒体文件的视频类别和所述视频类别对应的播放时长;根据所述播放时长确定所述视频类别对应的播放权重;根据所述播放权重确定视频推送类别,并根据所述视频推送类别进行视频推送。
需要说明的是,所述视频类别对应的播放权重可以根据视频类别对应的播放时长确定,例如,采样周期为30分钟,其中,篮球赛事直播的播放时长为10分钟,新闻的播放时长为8分钟,娱乐类视频播放时长为2分钟,生活类为3分钟,钢琴曲的播放时长为7分钟。则视频类别对应的播放权重分别为篮球赛事为10,新闻为8,娱乐类为2,生活类为3,钢琴曲为7。根据播放权重的排序可知,用户更倾向的推荐类别从大到小排序依次为篮球赛事、新闻、钢琴曲、生活类和娱乐类。根据权重确定视频推送类别即根据播放的时长确定视频推送类别,可以将播放量排名前3的类型作为视频推送的类型,即推送篮球赛事、新闻、钢琴曲类的音视频。也可以是根据占用用户的使用时长选取更多的视频类别进行推送,其中,权重更大的视频类别在进行推送时,对应的推送的视频的数量也更多,例如,当前的视频推送类别为篮球赛事和新闻,篮球赛事的权重为10,新闻的权重为7,若推送的视频数量为20个,可以是推送篮球赛事类的为11个,新闻类的为3个,即推送的该类视频的数量与该类视频的权重相关。
本实施例获取显示界面播放的多媒体文件的音视频信息;通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果;根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送。由于本实施例是通过预设场景识别模型对音视频信息进行场景识别,获得场景识别结果;根据场景识别结果确定视频推送类别,并根据视频推送类别进行视频推送。相对于现有的随机为用户展示视频的方式,本实施例上述方式能够推送用户感兴趣的视频,提升用户体验感。且视频内容中做为引流的商品链接也不会引起用户的反感,反而可以提升用户点击并进入后台商城的成功率,更好满足用户使用习惯的同时提升引流的成功率,提升运营收入的同时,还可降低用户使用的投诉率。
参考图3,图3为本申请视频推送方法第二实施例的流程示意图。
基于上述第一实施例,在本实施例中,所述步骤S30之前,还包括:
步骤S201:获取所述多媒体文件的历史场景识别结果。
需要说明的是,所述历史场景识别结果可以是之前预设场景识别模型对多媒体文件进行场景识别的识别结果。在获取到多媒体文件的音视频信息后,需要通过预设场景识别模型对音视频信息进行多次的场景识别。
在具体实施中,视频播放设备通过预设场景识别模型对音视频信息进行多次的场景识别,可以自定义识别次数,避免单次的识别出现误差,在进行视频推送之前,获取历史场景识别结果。
步骤S202:判断所述历史场景识别结果是否与所述场景识别结果一致。
应理解的是,为了避免场景识别结果出现模型的误判,将历史场景识别结果与所述场景识别结果进行对比,判断所述历史场景识别结果是否与所述场景识别结果一致。
步骤S203:若是,则统计所述历史场景识别结果与所述场景识别结果一致的次数,在所述次数达到预设次数阈值时,执行所述根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送的步骤。
需要说明的是,所述预设次数阈值可以是预先设置的次数。只有在历史场景识别结果中连续该次数的结果与场景识别结果一致,才判定模型不存在误判,进而根据场景识别结果确定视频推送的类别,并根据所述视频推送类别进行视频推送。例如,在获取到多媒体文件的音视频信息后,通过预设场景识别模型对音视频信息进行3次的场景识别。设定的预设次数阈值为2次。所述场景识别结果为新闻权重20,篮球赛事权重10。若历史场景识别结果为第一次识别结果为新闻权重20,篮球赛事权重10;第二次识别结果为新闻权重20,篮球赛事权重10。则可以判定历史场景识别结果与所述场景识别结果一致的次数为2次,而设定的预设次数阈值为2次,则可以根据所述场景识别结果确定视频推送类别。若历史场景识别结果中第一次识别结果为新闻权重10,篮球赛事权重20。则识别结果一致的次数为1次,小于设定的预设次数阈值,则再进行一次场景识别,在场景识别结果为新闻权重20,篮球赛事权重10时,根据所述场景识别结果确定视频推送类别。若历史场景识别结果中第二次识别结果为新闻权重10,篮球赛事权重20。则识别结果一致的次数为0次,因为此处判定条件为连续识别结果一致,此时,最少需要再进行2次识别,在2次识别结果均与所述场景识别结果一致时,根据所述场景识别结果确定视频推送类别。
本实施例获取所述多媒体文件的历史场景识别结果;判断所述历史场景识别结果是否与所述场景识别结果一致;若是,则统计所述历史场景识别结果与所述场景识别结果一致的次数,在所述次数达到预设次数阈值时,执行所述根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送的步骤。本实施例通过对音视频信息进行多次识别,并判断历史场景识别结果是否与场景识别结果一致;在历史场景识别结果与场景识别结果一致的次数达到预设次数阈值时,根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送。可以减少模型的误判,使得推送的视频更加符合用户的期望。
参照图4,图4为本申请视频推送装置第一实施例的结构框图。
如图4所示,本申请实施例提出的视频推送装置包括:
获取模块10,用于获取显示界面播放的多媒体文件的音视频信息;
识别模块20,用于通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果;
推送模块30,用于根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送。
本实施例获取显示界面播放的多媒体文件的音视频信息;通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果;根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送。由于本实施例是通过预设场景识别模型对音视频信息进行场景识别,获得场景识别结果;根据场景识别结果确定视频推送类别,并根据视频推送类别进行视频推送。相对于现有的随机为用户展示视频的方式,本实施例上述方式能够推送用户感兴趣的视频,提升用户体验感。
需要说明的是,以上所描述的工作流程仅仅是示意性的,并不对本申请的保护范围构成限定,在实际应用中,本领域的技术人员可以根据实际的需要选择其中的部分或者全部来实现本实施例方案的目的,此处不做限制。
另外,未在本实施例中详尽描述的技术细节,可参见本申请任意实施例所提供的参数运行方法,此处不再赘述。
基于本申请上述视频推送装置第一实施例,提出本申请视频推送装置的第二实施例。
在本实施例中,所述获取模块10,还用于在预设采样周期内,按预设截取频率截取显示界面播放的多媒体文件的图像信息;和/或,在预设采样周期内,按预设录制频率和录制时长录制显示界面播放的多媒体文件的音频信息。
进一步的,所述获取模块10,还用于通过预设场景识别模型提取每次截取到的图像信息中的图像特征,并对提取的图像特征进行场景识别,获得图像场景识别结果;和/或,通过预设场景识别模型提取所述音频信息中的声纹特征,并对提取的声纹特征进行场景识别,获得音频场景识别结果。
进一步的,所述获取模块10,还用于根据所述图像场景识别结果确定各图像信息所对应的图像场景类别;根据所述图像场景类别统计视频类别数量;判断所述视频类别数量是否小于预设数量;在所述视频类别数量小于所述预设数量时,对所述预设采样周期进行调整;根据调整后的采样周期,按预设截取频率截取显示界面播放的多媒体文件的图像信息。
进一步的,所述推送模块30,还用于获取所述多媒体文件的历史场景识别结果;判断所述历史场景识别结果是否与所述场景识别结果一致;若是,则统计所述历史场景识别结果与所述场景识别结果一致的次数,在所述次数达到预设次数阈值时,执行所述根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送的步骤。
进一步的,所述推送模块30,还用于根据所述场景识别结果确定所述多媒体文件的视频类别和所述视频类别对应的播放时长;根据所述播放时长确定所述视频类别对应的播放权重;根据所述播放权重确定视频推送类别,并根据所述视频推送类别进行视频推送。
进一步的,所述获取模块10,还用于获取历史视频推送类别;根据所述历史视频推送类别确定当前的待推送视频,并展示所述待推送视频。
本申请视频推送装置的其他实施例或具体实现方式可参照上述各方法实施例,此处不再赘述。
此外,本申请实施例还提出一种存储介质,所述存储介质上存储有视频推送程序,所述视频推送程序被处理器执行时实现如上文所述的视频推送方法的步骤。
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如只读存储器/随机存取存储器、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。

Claims (15)

  1. 一种视频推送方法,其中,所述视频推送方法包括以下步骤:
    获取显示界面播放的多媒体文件的音视频信息;
    通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果;以及
    根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送。
  2. 如权利要求1所述的视频推送方法,其中,所述音视频信息包括图像信息;
    所述获取显示界面播放的多媒体文件的音视频信息的步骤,包括:
    在预设采样周期内,按预设截取频率截取显示界面播放的多媒体文件的图像信息。
  3. 如权利要求1所述的视频推送方法,其中,所述音视频信息包括音频信息;
    所述获取显示界面播放的多媒体文件的音视频信息的步骤,包括:
    在预设采样周期内,按预设录制频率和录制时长录制显示界面播放的多媒体文件的音频信息。
  4. 如权利要求1所述的视频推送方法,其中,所述场景识别结果包括图像场景识别结果;
    所述通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果的步骤,包括:
    通过预设场景识别模型提取每次截取到的图像信息中的图像特征,并对提取的图像特征进行场景识别,获得图像场景识别结果。
  5. 如权利要求1所述的视频推送方法,其中,所述场景识别结果包括音频场景识别结果;
    所述通过预设场景识别模型对所述音视频信息进行场景识别,获得场景识别结果的步骤,包括:
    通过预设场景识别模型提取所述音频信息中的声纹特征,并对提取的声纹特征进行场景识别,获得音频场景识别结果。
  6. 如权利要求4所述的视频推送方法,其中,所述通过预设场景识别模型提取每次截取到的图像信息中的图像特征,并对提取的图像特征进行场景识别,获得图像场景识别结果的步骤之后,还包括:
    根据所述图像场景识别结果确定各图像信息所对应的图像场景类别;
    根据所述图像场景类别统计视频类别数量;
    判断所述视频类别数量是否小于预设数量;
    在所述视频类别数量小于所述预设数量时,对所述预设采样周期进行调整;以及
    根据调整后的采样周期,按预设截取频率截取显示界面播放的多媒体文件的图像信息。
  7. 如权利要求1所述的视频推送方法,其中,所述根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送的步骤之前,还包括:
    获取所述多媒体文件的历史场景识别结果;
    判断所述历史场景识别结果是否与所述场景识别结果一致;以及
    若所述历史场景识别结果与所述场景识别结果一致,则统计所述历史场景识别结果与所述场景识别结果一致的次数,在所述次数达到预设次数阈值时,执行所述根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送的步骤。
  8. 如权利要求1所述的视频推送方法,其中,所述根据所述场景识别结果确定视频推送类别,并根据所述视频推送类别进行视频推送的步骤,包括:
    根据所述场景识别结果确定所述多媒体文件的视频类别和所述视频类别对应的播放时长;
    根据所述播放时长确定所述视频类别对应的播放权重;以及
    根据所述播放权重确定视频推送类别,并根据所述视频推送类别进行视频推送。
  9. 如权利要求1-8任一项所述的视频推送方法,其中,所述获取显示界面播放的多媒体文件的音视频信息的步骤之前,包括:
    获取历史视频推送类别;以及
    根据所述历史视频推送类别确定当前的待推送视频,并展示所述待推送视频。
  10. 如权利要求1所述的视频推送方法,其中,所述预设场景识别模型预先通过大量的样本数据进行训练得到的场景识别模型,根据输入的音视频信息识别出当前的播放场景。
  11. 如权利要求4所述的视频推送方法,其中,所述图像特征为所述图像信息中的文字、水印、logo或物品信息特征。
  12. 如权利要求4所述的视频推送方法,其中,所述声纹特征为录制的音频信息中的声纹。
  13. 如权利要求1所述的视频推送方法,其中,所述视频类别对应的播放权重根据视频类别对应的播放时长确定。
  14. 一种视频推送设备,其中,所述设备包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的视频推送程序,所述视频推送程序配置为实现如权利要求1至13中任一项所述的视频推送方法的步骤。
  15. 一种存储介质,其中,所述存储介质上存储有视频推送程序,所述视频推送程序被处理器执行时实现如权利要求1至13中任一项所述的视频推送方法的步骤。
PCT/CN2021/139233 2021-10-11 2021-12-17 视频推送方法、设备及存储介质 WO2023060759A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111184229.1 2021-10-11
CN202111184229.1A CN113923523B (zh) 2021-10-11 2021-10-11 视频推送方法、装置、设备及存储介质

Publications (1)

Publication Number Publication Date
WO2023060759A1 true WO2023060759A1 (zh) 2023-04-20

Family

ID=79239335

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/139233 WO2023060759A1 (zh) 2021-10-11 2021-12-17 视频推送方法、设备及存储介质

Country Status (2)

Country Link
CN (1) CN113923523B (zh)
WO (1) WO2023060759A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117478937A (zh) * 2023-12-01 2024-01-30 陕西伟辰科技有限公司 一种基于推送信息的处理方法及信息推送平台
CN117478937B (zh) * 2023-12-01 2024-06-11 陕西伟辰科技有限公司 一种基于推送信息的处理方法及信息推送平台

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003085207A (ja) * 2001-09-07 2003-03-20 Nippon Telegr & Teleph Corp <Ntt> 映像情報レコメンドシステム、方法及び装置、並びに、映像情報レコメンドプログラム及びプログラムの記録媒体
CN103501449A (zh) * 2013-10-08 2014-01-08 十分(北京)信息科技有限公司 与电视节目关联的视频源推荐方法及推荐装置
CN104079997A (zh) * 2014-07-10 2014-10-01 东莞中山大学研究院 一种数字电视个性化节目的推荐系统及其方法
CN105227973A (zh) * 2014-06-27 2016-01-06 中兴通讯股份有限公司 基于场景识别的信息推荐方法及装置
CN111416995A (zh) * 2020-03-25 2020-07-14 深圳创维-Rgb电子有限公司 一种基于场景识别的内容推送方法、系统及智能终端
CN112330371A (zh) * 2020-11-26 2021-02-05 深圳创维-Rgb电子有限公司 基于ai的智能广告推送方法及装置、系统及存储介质

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6223713B2 (ja) * 2013-05-27 2017-11-01 株式会社東芝 電子機器、方法及びプログラム
JP6579558B2 (ja) * 2018-11-14 2019-09-25 みこらった株式会社 スポーツ競技ライブ観戦システムの観戦者端末及び観戦者端末用プログラム
CN111405369A (zh) * 2019-11-29 2020-07-10 深圳市赛亿科技开发有限公司 智能电视及其控制方法、计算机可读存储介质
CN111417024A (zh) * 2020-03-30 2020-07-14 深圳创维-Rgb电子有限公司 一种基于场景识别的节目推荐方法、系统及存储介质
CN111930974A (zh) * 2020-08-10 2020-11-13 北京字节跳动网络技术有限公司 一种音视频类型的推荐方法、装置、设备及存储介质

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003085207A (ja) * 2001-09-07 2003-03-20 Nippon Telegr & Teleph Corp <Ntt> 映像情報レコメンドシステム、方法及び装置、並びに、映像情報レコメンドプログラム及びプログラムの記録媒体
CN103501449A (zh) * 2013-10-08 2014-01-08 十分(北京)信息科技有限公司 与电视节目关联的视频源推荐方法及推荐装置
CN105227973A (zh) * 2014-06-27 2016-01-06 中兴通讯股份有限公司 基于场景识别的信息推荐方法及装置
CN104079997A (zh) * 2014-07-10 2014-10-01 东莞中山大学研究院 一种数字电视个性化节目的推荐系统及其方法
CN111416995A (zh) * 2020-03-25 2020-07-14 深圳创维-Rgb电子有限公司 一种基于场景识别的内容推送方法、系统及智能终端
CN112330371A (zh) * 2020-11-26 2021-02-05 深圳创维-Rgb电子有限公司 基于ai的智能广告推送方法及装置、系统及存储介质

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117478937A (zh) * 2023-12-01 2024-01-30 陕西伟辰科技有限公司 一种基于推送信息的处理方法及信息推送平台
CN117478937B (zh) * 2023-12-01 2024-06-11 陕西伟辰科技有限公司 一种基于推送信息的处理方法及信息推送平台

Also Published As

Publication number Publication date
CN113923523B (zh) 2023-03-24
CN113923523A (zh) 2022-01-11

Similar Documents

Publication Publication Date Title
US8732275B2 (en) Methods and systems for delivering a personalized version of an executable application to a secondary access device associated with a user
US20070124679A1 (en) Video summary service apparatus and method of operating the apparatus
CN113473189B (zh) 用于在内容列表中提供内容的系统和方法
US11025967B2 (en) Method for inserting information push into live video streaming, server, and terminal
US20160316233A1 (en) System and method for inserting, delivering and tracking advertisements in a media program
US10313713B2 (en) Methods, systems, and media for identifying and presenting users with multi-lingual media content items
CN106488311B (zh) 音效调整方法及用户终端
WO2015070761A1 (zh) 智能电视媒体播放器及其字幕处理方法、智能电视
JP2014086087A (ja) メディアストリームに広告を挿入するための方法及びシステム
CN112753227A (zh) 用于在体育事件电视节目中检测人群噪声的发生的音频处理
CN109474562B (zh) 标识的显示方法和装置、请求的响应方法和装置
US11388561B2 (en) Providing a summary of media content to a communication device
WO2018059333A1 (zh) 一种媒体信息处理方法、系统、电子设备及存储介质
CN107659831A (zh) 媒体数据处理方法、客户端、及存储介质
CN111444415A (zh) 弹幕处理方法、服务器、客户端、电子设备及存储介质
TW201225669A (en) System and method for synchronizing with multimedia broadcast program and computer program product thereof
US20090019504A1 (en) Method for Managing Multimedia Data and System for Operating The Same
CN111698261A (zh) 基于流媒体的视频播放方法、装置、设备及存储介质
WO2023060759A1 (zh) 视频推送方法、设备及存储介质
CN111741333B (zh) 直播数据获取方法、装置、计算机设备及存储介质
CN109194971A (zh) 一种为多媒体文件的生成方法及装置
US8612313B2 (en) Metadata subscription systems and methods
WO2021047181A1 (zh) 基于视频类型的播放控制实现方法、装置及计算机设备
KR20180069626A (ko) 스트리밍 데이터의 재생 통계 정보 산출 방법 및 이를 위한 장치
TWI528807B (zh) Scene scheduling system, method and its recording medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21960483

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2021960483

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2021960483

Country of ref document: EP

Effective date: 20240403