WO2021135334A1 - Method and apparatus for processing live streaming content, and system - Google Patents

Method and apparatus for processing live streaming content, and system Download PDF

Info

Publication number
WO2021135334A1
WO2021135334A1 PCT/CN2020/112856 CN2020112856W WO2021135334A1 WO 2021135334 A1 WO2021135334 A1 WO 2021135334A1 CN 2020112856 W CN2020112856 W CN 2020112856W WO 2021135334 A1 WO2021135334 A1 WO 2021135334A1
Authority
WO
WIPO (PCT)
Prior art keywords
video stream
live
original video
additional information
live broadcast
Prior art date
Application number
PCT/CN2020/112856
Other languages
French (fr)
Chinese (zh)
Inventor
王云
Original Assignee
广州华多网络科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州华多网络科技有限公司 filed Critical 广州华多网络科技有限公司
Publication of WO2021135334A1 publication Critical patent/WO2021135334A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/231Content storage operation, e.g. caching movies for short term storage, replicating data over plural servers, prioritizing data for deletion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/438Interfacing the downstream path of the transmission network originating from a server, e.g. retrieving MPEG packets from an IP network
    • H04N21/4383Accessing a communication channel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • H04N21/44218Detecting physical presence or behaviour of the user, e.g. using sensors to detect if the user is leaving the room or changes his face expression during a TV program

Definitions

  • This application relates to the field of live broadcast, and in particular to a method, device, and system for processing live broadcast content.
  • short videos are appearing and integrated into people’s lives.
  • Using short videos on various short video platforms has become one of the ways for people to entertain and spend their lives.
  • Some short video content is some wonderful moments recorded by the anchor during the live broadcast.
  • Wonderful moments often require the live broadcast platform to filter the live broadcast content of the host to find the brilliant moments during the live broadcast of the high-quality host, and then publish them to a third-party short video platform.
  • the live broadcast pictures of the host on the live broadcast platform often include not only the original picture of the host obtained through the host’s camera, but also some advertisements such as stickers and subtitles added by the host himself or the copyright mark of the live broadcast platform.
  • These advertisements and The logo must appear in the live broadcast screen of the host on the live broadcast platform, but cannot appear in the short video screen of the wonderful moments published on the short video platform of the third party.
  • the short video generated on the live screen of the selected host is posted to the third party In the short video platform, the generated short video pictures usually carry these advertisements and logos.
  • the short video pictures of the wonderful moments posted on the third-party short video platform also include advertisements and logos.
  • the present application provides a method, device, live broadcast system, equipment, and computer-readable storage medium for processing live broadcast content.
  • a method for processing live content including:
  • the specified image frame of the original video stream is saved based on the evaluation index of the user behavior.
  • a method for processing live content including:
  • the user behavior evaluation index saves the designated image frames of the original video stream, where the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel.
  • a live broadcast system includes an anchor client, a server, and an audience client;
  • the anchor client is used to obtain the original video stream
  • the server is configured to synthesize the original video stream and the additional information of the live broadcast into a live video stream, and send the live video stream to the viewer client;
  • the audience client is used to receive and display the live video stream.
  • an apparatus for processing live content including:
  • the receiving module is used to receive the original video stream sent by the host client through the video channel and the live broadcast additional information sent through the signaling channel;
  • a synthesis module for synthesizing the original video stream and the live broadcast additional information into a live video stream according to the live broadcast additional information
  • the transmission module is used to send the live video stream to the audience client;
  • the saving module is configured to save the designated image frame of the original video stream based on the evaluation index of the user behavior.
  • an apparatus for processing live content including:
  • the acquisition module is used to acquire the original video stream
  • the sending module is configured to send the original video stream and live additional information to the server respectively, so that the server synthesizes the original video stream and the live additional information into the live video stream according to the live additional information signaling to the viewer client
  • the live video stream is sent, and the designated image frame of the original video stream is saved based on the evaluation index of the user behavior, wherein the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel.
  • the host client of this application sends the original video stream and live additional information to the server through different transmission channels, and the server completes the action of synthesizing the live additional information and the original video stream into the live video stream instead of pushing the stream by the host client
  • this application can not only allow the viewer client to watch the live video stream imperceptibly, but also share the original video stream to other video platforms in a timely and convenient manner.
  • Fig. 1a is a schematic diagram of a picture of an original video stream collected in a live broadcast scenario according to an exemplary embodiment of the present application
  • Fig. 1b is a schematic diagram of a synthesized live broadcast picture according to an exemplary embodiment of the present application
  • Fig. 2 is a flowchart of a method for processing live content shown in an exemplary embodiment of the present application
  • Fig. 3 is a flowchart of another method for processing live content shown in an exemplary embodiment of the present application.
  • Figure 3a is an application example in an application scenario of this application.
  • Fig. 4 is a schematic diagram of an apparatus for processing live content shown in an exemplary embodiment of the present application
  • Fig. 5 is a schematic diagram of another device for processing live content shown in an exemplary embodiment of the present application.
  • Fig. 6 is a schematic diagram of an electronic device shown in an exemplary embodiment of the present application.
  • first, second, third, etc. may be used in this application to describe various information, the information should not be limited to these terms. These terms are only used to distinguish the same type of information from each other.
  • first information may also be referred to as second information, and similarly, the second information may also be referred to as first information.
  • word “if” as used herein can be interpreted as "when” or “when” or "in response to determination”.
  • the live broadcast screen of the host on the live broadcast platform often includes not only the original image of the host obtained through the host’s camera, or the original image captured by the host on the terminal display interface, but also includes some stickers and subtitles added by the host himself.
  • this screen is the host’s original image obtained by the host’s camera. No additional live broadcast information is added to the host’s original image, and only the host’s image obtained by the host’s camera is displayed. .
  • the screen includes some text, image advertisements, and copyright identification information added by the host on the original screen obtained by the camera.
  • the host uses this screen as a live broadcast for viewers to watch, Figure 1b
  • These advertisements and logos in the live broadcast screen shown are necessary for the live broadcast platform, so they must appear in the live broadcast screen of the host on the live broadcast platform, but cannot appear on the short video platform published to the third party. In the short video screen at the moment.
  • This application provides a method for processing live content.
  • the server combines the original video stream sent by the host client and the live additional information into the live video stream, and sends the live video stream to the viewer client, as well as evaluation indicators based on user behavior Save the designated image frame of the original video stream.
  • the live video stream synthesized by the original video stream and the additional information of the live broadcast is used for the live broadcast, and the specified image frames of the original video stream are saved for publishing to a third-party video platform.
  • Fig. 2 is a flowchart of a method for processing live content shown in an exemplary embodiment of the application. As shown in Fig. 2, the method includes the following steps:
  • S201 Receive the original video stream sent by the host client through the video channel and the live additional information sent through the signaling channel;
  • S202 Synthesize the original video stream and the live additional information into a live video stream according to the live additional information, and send the live video stream to the viewer client;
  • S203 Save the designated image frame of the original video stream based on the evaluation index of the user behavior.
  • the original image frame sent by the host client can be the image frame of any screen captured in real time by the camera, and the frame can be the screen that the host wants to use for live broadcast, for example, the host can capture itself through the camera during the live broadcast. , Or some pictures of the surrounding environment of the host.
  • the acquired original image frame can also be the screen captured by the host on the terminal display interface, such as a game or movie being broadcast by the host, the host can capture the game interface or movie playback interface displayed on a mobile device or a fixed device as the original image frame .
  • the live broadcast additional information signaling carries live broadcast additional information.
  • the live broadcast additional information may indicate some subtitles and copyright identifications added to the original image frame, or texture pictures added manually by the host, and these subtitles and pictures may be some advertisement information.
  • the live broadcast additional information includes the attribute information and location information of the live broadcast additional information.
  • the attribute information is some fixed attributes of the live broadcast additional information.
  • the attribute information can be the style of subtitles, for example, it can be subtitled. Font, size, color, background color and other text styles.
  • the attribute information can be the URL of the picture and the zoom factor of the picture according to the attribute information and location information.
  • the location information is the coordinates of the live broadcast additional information added to the image frame of the original video stream.
  • the original video stream and the live broadcast additional information are combined into the live video stream.
  • the original video stream can be combined with the live broadcast according to the attribute information and location information.
  • the additional information is combined into a live video stream.
  • the designated image frame is the original image frame determined by the server based on the evaluation index of the user behavior.
  • the evaluation index of the user behavior may include at least one of the number of gifts and the activity of the public screen. Of course, it may be other than these two. Evaluation indicators other than indicators. Which parameter to choose as the evaluation index can be determined according to the characteristics of the specified image frame that the person using this scheme needs to select. For example, the number of gifts or fair activity can also be used as the evaluation index, or other parameters can be used as the evaluation index. index.
  • the server receives the live video stream sent by the host client and sends the live video stream to the audience client for viewing by the audience client.
  • the audience can enter the live broadcast room of the host they like according to their personal preferences.
  • the host As the number of bullet screens and the number of gifts increases, or the value of the gift is higher, the host’s The heat of the live broadcast room will also rise.
  • the server When the server sends the live video frame to the audience client, it will use the real-time audience barrage, the number of audience gifts, the current live broadcast room heat, etc. as evaluation indicators, and real-time statistics of the audience barrage in the host's live room corresponding to the current live video frame According to these evaluation indicators, determine whether the current live video frame is a video frame of a wonderful moment.
  • the wonderful moment refers to if the audience barrage in the live broadcast room of the host within a certain period of time If the number, the number of viewers giving gifts, and the current popularity of the live broadcast room reach a certain index, then it is determined that the live video frames within the time period constitute a wonderful moment.
  • the specified image frame can be the original image frame corresponding to the live video frame included in the time period of the wonderful moment, that is, the original image frame of the wonderful moment. Since the server will judge in real time whether the current moment is a wonderful moment, when it is judged that the current moment is a wonderful moment Start recording the received original image frame at the moment, and stop recording and save the recorded file as a video file when the current moment is judged to be a non-exciting moment.
  • the format of the video file can be, for example, MP4 format, AVI format, etc., and the server It will continue to judge whether the current moment is a wonderful moment. Once it is judged that the current moment is a wonderful moment, it will be recorded and saved again. Since the original image frame of the wonderful moment does not carry some additional live broadcast information such as subtitles, copyright signs, or stickers manually added by the anchor, it can be used to publish to other video platforms.
  • the process for the server to store the specified file can refer to the following process: the server can store the original video stream in segments according to the time sequence of the received original video stream, and judge whether there is a specified image frame in the saved original video stream.
  • the timing can be performed in real time or asynchronously with the saved action. For example, each time a file is saved, it can be queried whether the time period includes a wonderful moment. If the time period includes a wonderful moment, it is determined that the original image frame of the wonderful moment exists in the file, and the file is compared with other original image frames of the wonderful moment.
  • File integration and archiving if the time period of the file does not include the wonderful moments, it is determined that the original image frames of the wonderful moments do not exist in the file, and the file can be deleted directly to avoid occupying the memory space of the device. Since the files uniformly sent to the server after integrated archiving are all files that include the original image frames of the wonderful moments, there is no need for the server to filter again, so that users can directly share these files obtained by the server to other video platforms when needed.
  • the designated image frame may be distributed to the current live broadcast platform or a third-party video platform.
  • the user can obtain the video files of the wonderful moments saved on the server from the server, and the user can edit the obtained video files, for example, perform appropriate editing, add special effects, etc., and then distribute the processed video files to third-party videos
  • the platform is for viewers of third-party video platforms to watch these wonderful moments, or the processed video files can also be distributed to the current live broadcast platform for viewers of the current live broadcast platform to watch these wonderful moments.
  • Fig. 3 is a flowchart of a method for processing live content shown in an exemplary embodiment of the application. As shown in Fig. 3, the method includes the following steps:
  • S302 Send the original video stream and the live broadcast additional information signaling to the server respectively, so that the server synthesizes the original video stream and the live broadcast additional information into the live video stream according to the live broadcast additional information signaling, and sends it to the viewer client.
  • the live video stream and the evaluation index based on user behavior save the designated image frames of the original video stream.
  • This application also provides a live broadcast system, which includes an anchor client, a server, and an audience client;
  • the anchor client is used to obtain the original video stream
  • the server is configured to synthesize the original video stream and the additional information of the live broadcast into a live video stream, and send the live video stream to the viewer client;
  • the specified image frame of the original video stream is saved based on the evaluation index of the user behavior.
  • the host client can also locally synthesize the original image frame and the additional information of the live broadcast into a local live image frame for the host to watch the live image locally.
  • the server can directly send the saved file of the specified image frame to the third-party video platform, or the user can obtain the file of the specified image frame from the server and publish it directly or after editing and publishing to the third-party video platform for viewers on the third-party video platform Watch.
  • FIG. 3a there are an anchor client 301a, several viewer clients 302a, a media server 303a, a synthesis server 304a, a live video stream database 305a and an original video stream database 306a in the live broadcast system.
  • the host can enter the live broadcast room through the host client 301a.
  • the camera collects the original video stream, and pushes the collected original video stream to the media server 303a in the form of streaming media data.
  • the media server 303a in the form of streaming media data.
  • Some additional live broadcast information of the viewer is sent to the composition server 304a in the form of signaling (for example, through a channel for sending control signaling).
  • the media server 303a After receiving the original video stream, the media server 303a also sends the original video stream to the synthesis server 304a.
  • the synthesis server 304a needs to synthesize the original video stream and the live broadcast additional information to generate a live video stream for the viewer function client to watch.
  • the live video stream viewed by the viewer is a video picture with subtitles, copyright identification, and other information attached.
  • the synthesis server 304a stores the synthesized live video stream in the live video stream database 305a, and the original video stream in the original video stream database 306a.
  • the original video stream can be stored in multiple folders according to the time sequence of the received file.
  • the live video stream is distributed by the synthesis server 304a to each viewer client 302a in real time.
  • the video data stored in the original video stream database 306a it can be judged whether it contains the image frame of the wonderful moment in real time or at an appropriate time according to different needs.
  • the judgment method can be realized by means of AI image recognition.
  • the basis for judgment can be Use certain evaluation indicators that can analyze user behavior. File the retrieved folders containing the image frames of the notable moments; delete the folders that do not contain the image frames of the notable moments.
  • the original video stream stored in the original video stream database 306a can be distributed to other video platforms at an appropriate time, or the user can use a client to obtain it from the synthesis server 304a and send it to the user, so that the user can send other videos to other video streams.
  • the synthesis server 304a when the synthesis server 304a pushes the synthesized live video stream, it may not push it to the anchor client 301a, because the anchor client 301a can obtain the original video stream collected by the anchor locally. , It is also possible to obtain additional live broadcast information such as subtitles and pictures, so the host client 301a also has the ability to synthesize live video streams. The host client 301a can preview the live broadcast effect locally after completing the synthesis of the local live video stream, which makes the preview process more efficient.
  • this application also provides an embodiment of an apparatus for processing live content.
  • FIG. 4 is an apparatus 400 for processing live content shown in an exemplary embodiment of this application, including:
  • the receiving module 401 is configured to receive the original video stream sent by the host client through the video channel and the live broadcast additional information sent through the signaling channel;
  • the synthesis module 402 is configured to synthesize the original video stream and the live broadcast additional information into a live video stream according to the live broadcast additional information;
  • the transmission module 403 is configured to send the live video stream to the audience client;
  • the saving module 404 is configured to save the designated image frame of the original video stream based on the evaluation index of the user behavior.
  • FIG. 5 shows an apparatus 500 for processing live content according to an exemplary embodiment of this application, including:
  • the obtaining module 501 is used to obtain the original video stream
  • the sending module 502 is configured to send the original video stream and the live broadcast additional information to the server respectively, so that the server combines the original video stream and the live broadcast additional information into the live video stream according to the live broadcast additional information signaling, and presents it to the audience client
  • the terminal sends the live video stream, and saves the designated image frame of the original video stream based on the evaluation index of the user behavior, wherein the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel.
  • the embodiments of the apparatus for processing live content in this application can be applied to devices.
  • the device embodiments can be implemented by software, or can be implemented by hardware or a combination of software and hardware.
  • Taking software implementation as an example as a logical device, it is formed by reading the corresponding computer program instructions in the non-volatile memory into the memory through the processor of the device where it is located.
  • FIG. 6 a hardware structure diagram of the device where the device for processing live content of this application is located, except for the processor, memory, network interface, and non-volatile memory shown in FIG.
  • the device where the device is located in the embodiment usually includes other hardware according to the actual function of the device, which will not be repeated here.
  • non-volatile memory is used to store executable instructions of the processor, and the processor is configured to execute the instructions to implement the method for processing live content described in any of the foregoing embodiments.
  • the present application also provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, the method for processing live content described in any of the above embodiments is implemented.
  • the relevant part can refer to the part of the description of the method embodiment.
  • the device embodiments described above are merely illustrative.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in One place, or it can be distributed to multiple network units.
  • Some or all of the modules can be selected according to actual needs to achieve the purpose of the solution of the present application. Those of ordinary skill in the art can understand and implement without creative work.

Abstract

Provided is a method for processing live streaming content. The method comprises: receiving an original video stream sent by an anchor client by means of a video channel and additional live streaming information sent by the anchor client by means of a signaling channel; according to the additional live streaming information, synthesizing the original video stream and the additional live streaming information into a live streaming video stream, and sending the live streaming video stream to an audience client; and storing a specified image frame of the original video stream on the basis of an evaluation index of user behaviors. The audience client can view the live streaming video stream without perception, and the original video stream can be conveniently shared to other video platforms in a timely manner.

Description

处理直播内容的方法、装置、系统Method, device and system for processing live content
本申请要求于2019年12月31日提交中国专利局、申请号为201911407509.7、发明名称为“处理直播内容的方法、装置、系统”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on December 31, 2019, the application number is 201911407509.7, and the invention title is "Methods, Devices, and Systems for Processing Live Content", the entire content of which is incorporated herein by reference. Applying.
技术领域Technical field
本申请涉及直播领域,尤其涉及一种处理直播内容的方法、装置、系统。This application relates to the field of live broadcast, and in particular to a method, device, and system for processing live broadcast content.
背景技术Background technique
现如今短视频出现并融入人们的生活,在各个短视频平台上刷短视频已经成为人们娱乐消磨的方式之一,而有些短视频的内容是主播在直播过程中记录下来的一些精彩时刻,这些精彩时刻往往需要直播平台通过对主播的直播内容进行筛选,来找出优质主播直播过程中的精彩时刻,然后再将其发布到第三方的短视频平台上。Nowadays, short videos are appearing and integrated into people’s lives. Using short videos on various short video platforms has become one of the ways for people to entertain and spend their lives. Some short video content is some wonderful moments recorded by the anchor during the live broadcast. Wonderful moments often require the live broadcast platform to filter the live broadcast content of the host to find the brilliant moments during the live broadcast of the high-quality host, and then publish them to a third-party short video platform.
但直播平台上主播的直播画面中往往不仅仅包括了通过主播的摄像头获取的主播的原始画面,同时还包括了一些主播自己添加的贴图、字幕等一些广告或者直播平台的版权标识,这些广告和标识必须出现在直播平台上主播的直播画面中,但却不能出现在发布到第三方的短视频平台上的精彩时刻的短视频画面中,然而在选取主播的直播画面生成短视频发布到第三方的短视频平台时,生成的短视频画面中通常携带了这些广告和标识,导致发布到第三方的短视频平台上的精彩时刻的短视频画面中也包括了广告和标识,因此如何在保留直播平台上主播的直播画面中存在广告和标识的同时,避免发布到第三方的短视频平台上的精彩时刻的短视频画面中携带广告和标识成为必须解决的问题。However, the live broadcast pictures of the host on the live broadcast platform often include not only the original picture of the host obtained through the host’s camera, but also some advertisements such as stickers and subtitles added by the host himself or the copyright mark of the live broadcast platform. These advertisements and The logo must appear in the live broadcast screen of the host on the live broadcast platform, but cannot appear in the short video screen of the wonderful moments published on the short video platform of the third party. However, the short video generated on the live screen of the selected host is posted to the third party In the short video platform, the generated short video pictures usually carry these advertisements and logos. As a result, the short video pictures of the wonderful moments posted on the third-party short video platform also include advertisements and logos. Therefore, how to keep the live broadcast While there are advertisements and logos in the live broadcast screens of the anchors on the platform, it is a problem that must be solved to avoid the short video screens of the wonderful moments posted to the short video platform of the third party carrying the advertisements and logos.
发明内容Summary of the invention
有鉴于此,本申请提供一种处理直播内容的方法、装置、直播系统、设备以及计算机可读存储介质。In view of this, the present application provides a method, device, live broadcast system, equipment, and computer-readable storage medium for processing live broadcast content.
根据本申请实施例的第一方面,提供一种处理直播内容的方法,所述方法包括:According to a first aspect of the embodiments of the present application, there is provided a method for processing live content, the method including:
接收主播客户端通过视频通道发送的原始视频流和通过信令通道发送的直播附加信息;Receive the original video stream sent by the host client through the video channel and the additional live broadcast information sent through the signaling channel;
根据所述直播附加信息将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流;以及Synthesize the original video stream and the live additional information into a live video stream according to the live additional information, and send the live video stream to the viewer client; and
基于用户行为的评估指标保存所述原始视频流的指定图像帧。The specified image frame of the original video stream is saved based on the evaluation index of the user behavior.
根据本申请实施例的第二方面,提供一种处理直播内容的方法,所述方法包括:According to a second aspect of the embodiments of the present application, there is provided a method for processing live content, the method including:
获取原始视频流;Obtain the original video stream;
分别将原始视频流和直播附加信息发送给服务器,以使服务器根据所述直播附加信息将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流,以及基于用户行为的评估指标保存所述原始视频流的指定图像帧,其中,所述原始视频流通过视频通道发送,所述直播附加信息通过信令通道发送。Send the original video stream and the live additional information to the server respectively, so that the server synthesizes the original video stream and the live additional information into the live video stream according to the live additional information, sends the live video stream to the viewer client, and The user behavior evaluation index saves the designated image frames of the original video stream, where the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel.
根据本申请实施例的第三方面,提供一种直播系统,所述直播系统包括主播客户端、服务器、观众客户端;According to a third aspect of the embodiments of the present application, a live broadcast system is provided, and the live broadcast system includes an anchor client, a server, and an audience client;
所述主播客户端用于获取原始视频流;The anchor client is used to obtain the original video stream;
分别将所述原始视频流与直播附加信息发送给服务器,其中,所述原始视频流通过视频通道发送,所述直播附加信息通过信令通道发送;Sending the original video stream and the live broadcast additional information to the server respectively, where the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel;
所述服务器用于将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流;以及The server is configured to synthesize the original video stream and the additional information of the live broadcast into a live video stream, and send the live video stream to the viewer client; and
基于用户行为的评估指标保存所述原始视频流的指定图像帧;Saving the designated image frame of the original video stream based on the evaluation index of the user behavior;
所述观众客户端,用于接收并展示所述直播视频流。The audience client is used to receive and display the live video stream.
根据本申请实施例的第四方面,提供一种处理直播内容的装置,所述装置包括:According to a fourth aspect of the embodiments of the present application, there is provided an apparatus for processing live content, the apparatus including:
接收模块,用于接收主播客户端通过视频通道发送的原始视频流和通过信令通道发送的直播附加信息;The receiving module is used to receive the original video stream sent by the host client through the video channel and the live broadcast additional information sent through the signaling channel;
合成模块,用于根据所述直播附加信息将所述原始视频流和直播附加 信息合成直播视频流;A synthesis module for synthesizing the original video stream and the live broadcast additional information into a live video stream according to the live broadcast additional information;
传输模块,用于向观众客户端发送所述直播视频流;The transmission module is used to send the live video stream to the audience client;
保存模块,用于基于用户行为的评估指标保存所述原始视频流的指定图像帧。The saving module is configured to save the designated image frame of the original video stream based on the evaluation index of the user behavior.
根据本申请实施例的第五方面,提供一种处理直播内容的装置,所述装置包括:According to a fifth aspect of the embodiments of the present application, there is provided an apparatus for processing live content, the apparatus including:
获取模块,用于获取原始视频流;The acquisition module is used to acquire the original video stream;
发送模块,用于分别将所述原始视频流与直播附加信息发送给服务器,以使服务器根据所述直播附加信息信令将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流,以及基于用户行为的评估指标保存所述原始视频流的指定图像帧,其中,所述原始视频流通过视频通道发送,所述直播附加信息通过信令通道发送。The sending module is configured to send the original video stream and live additional information to the server respectively, so that the server synthesizes the original video stream and the live additional information into the live video stream according to the live additional information signaling to the viewer client The live video stream is sent, and the designated image frame of the original video stream is saved based on the evaluation index of the user behavior, wherein the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel.
本申请主播客户端将原始视频流和直播附加信息分别通过不同的传输通道发给服务器,由服务器来完成将直播附加信息和原始视频流合成直播视频流的动作而不是由主播客户端在推流时完成,由于服务器可以获得未添加直播附加信息的原始视频流,因此本申请既可以让观众客户端无感知的观看直播视频流,又可以及时方便的将原始视频流分享到其他视频平台。The host client of this application sends the original video stream and live additional information to the server through different transmission channels, and the server completes the action of synthesizing the live additional information and the original video stream into the live video stream instead of pushing the stream by the host client When the time is completed, since the server can obtain the original video stream without additional live broadcast information, this application can not only allow the viewer client to watch the live video stream imperceptibly, but also share the original video stream to other video platforms in a timely and convenient manner.
附图说明Description of the drawings
图1a是本申请一示例性实施例示出的一种直播场景下采集的原始视频流的画面的示意图;Fig. 1a is a schematic diagram of a picture of an original video stream collected in a live broadcast scenario according to an exemplary embodiment of the present application;
图1b是本申请一示例性实施例示出的一种合成后的直播画面的示意图;Fig. 1b is a schematic diagram of a synthesized live broadcast picture according to an exemplary embodiment of the present application;
图2是本申请一示例性实施例示出的一种处理直播内容的方法的流程图;Fig. 2 is a flowchart of a method for processing live content shown in an exemplary embodiment of the present application;
图3是本申请一示例性实施例示出的另一种处理直播内容的方法的流程图;Fig. 3 is a flowchart of another method for processing live content shown in an exemplary embodiment of the present application;
图3a为本申请一应用场景下的应用实例;Figure 3a is an application example in an application scenario of this application;
图4是本申请一示例性实施例示出的一种处理直播内容的装置的示意 图;Fig. 4 is a schematic diagram of an apparatus for processing live content shown in an exemplary embodiment of the present application;
图5是本申请一示例性实施例示出的另一种处理直播内容的装置的示意图;Fig. 5 is a schematic diagram of another device for processing live content shown in an exemplary embodiment of the present application;
图6是本申请一示例性实施例示出的电子设备的示意图。Fig. 6 is a schematic diagram of an electronic device shown in an exemplary embodiment of the present application.
具体实施方式Detailed ways
这里将详细地对示例性实施例进行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本申请相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本申请的一些方面相一致的装置和方法的例子。The exemplary embodiments will be described in detail here, and examples thereof are shown in the accompanying drawings. When the following description refers to the accompanying drawings, unless otherwise indicated, the same numbers in different drawings represent the same or similar elements. The implementation manners described in the following exemplary embodiments do not represent all implementation manners consistent with the present application. On the contrary, they are merely examples of devices and methods consistent with some aspects of the application as detailed in the appended claims.
在本申请使用的术语是仅仅出于描述特定实施例的目的,而非旨在限制本申请。在本申请和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式,除非上下文清楚地表示其他含义。还应当理解,本文中使用的术语“和/或”是指并包含一个或多个相关联的列出项目的任何或所有可能组合。The terms used in this application are only for the purpose of describing specific embodiments, and are not intended to limit the application. The singular forms of "a", "said" and "the" used in this application and the appended claims are also intended to include plural forms, unless the context clearly indicates other meanings. It should also be understood that the term "and/or" as used herein refers to and includes any or all possible combinations of one or more associated listed items.
应当理解,尽管在本申请可能采用术语第一、第二、第三等来描述各种信息,但这些信息不应限于这些术语。这些术语仅用来将同一类型的信息彼此区分开。例如,在不脱离本申请范围的情况下,第一信息也可以被称为第二信息,类似地,第二信息也可以被称为第一信息。取决于语境,如在此所使用的词语“如果”可以被解释成为“在……时”或“当……时”或“响应于确定”。It should be understood that although the terms first, second, third, etc. may be used in this application to describe various information, the information should not be limited to these terms. These terms are only used to distinguish the same type of information from each other. For example, without departing from the scope of this application, the first information may also be referred to as second information, and similarly, the second information may also be referred to as first information. Depending on the context, the word "if" as used herein can be interpreted as "when" or "when" or "in response to determination".
随着短视频的爆发,现如今短视频已经完全融入人们的生活,随时随地每个人都可以是导演,通过摄像头记录自己或身边精彩的瞬间并分享到各大短视频平台以供屏幕另一端的观众观看,这也使得在短视频平台上刷短视频成为娱乐消磨的最常见的方式之一。而一些短视频的内容通常都是主播在直播过程中记录下来的一些精彩时刻,这些精彩时刻往往需要直播平台通过对主播的直播内容进行筛选,来找出优质主播直播过程中的精彩时刻,然后再将其发布到第三方的短视频平台上。With the outbreak of short videos, short videos are now fully integrated into people’s lives. Everyone can be a director anytime, anywhere. They can record their own or the wonderful moments around them through the camera and share them on major short video platforms for viewing on the other side of the screen. Audience watching, which also makes short videos on short video platforms one of the most common ways of entertainment and consumption. The content of some short videos is usually some wonderful moments recorded by the host during the live broadcast. These wonderful moments often require the live broadcast platform to filter the live broadcast content of the host to find the wonderful moments during the live broadcast of the high-quality host. Then publish it to a third-party short video platform.
但直播平台上主播的直播画面中往往不仅仅包括了通过主播的摄像头获取的主播的原始画面,或者主播在终端显示界面捕获到的原始画面,同时还包括了一些主播自己添加的贴图、字幕等一些广告或者直播平台的版权标识,如图1a所示,该画面为主播摄像头获取的主播原始画面,该主播原始画面中并未添加任何直播附加信息,只显示了主播摄像头获取到的主播的画面。However, the live broadcast screen of the host on the live broadcast platform often includes not only the original image of the host obtained through the host’s camera, or the original image captured by the host on the terminal display interface, but also includes some stickers and subtitles added by the host himself. Some advertisements or copyright signs of live broadcast platforms, as shown in Figure 1a, this screen is the host’s original image obtained by the host’s camera. No additional live broadcast information is added to the host’s original image, and only the host’s image obtained by the host’s camera is displayed. .
如图1b所示,该画面中包括了主播在摄像头获取的原始画面的基础上添加的一些文字、图片的广告以及版权标识的信息,主播将该画面作为直播的画面以供观众观看,图1b所示的直播画面中的这些广告和标识对于直播平台而言是必须的,因此必须要出现在直播平台上主播的直播画面中,但却不能出现在发布到第三方的短视频平台上的精彩时刻的短视频画面中。As shown in Figure 1b, the screen includes some text, image advertisements, and copyright identification information added by the host on the original screen obtained by the camera. The host uses this screen as a live broadcast for viewers to watch, Figure 1b These advertisements and logos in the live broadcast screen shown are necessary for the live broadcast platform, so they must appear in the live broadcast screen of the host on the live broadcast platform, but cannot appear on the short video platform published to the third party. In the short video screen at the moment.
本申请提供一种处理直播内容的方法,由服务器来将主播客户端发送的原始视频流和直播附加信息合成直播视频流并向观众客户端发送所述直播视频流,以及基于用户行为的评估指标保存所述原始视频流的指定图像帧。实现了将原始视频流和直播附加信息合成的直播视频流用于直播,以及保存原始视频流的指定图像帧以供发布到第三方视频平台。This application provides a method for processing live content. The server combines the original video stream sent by the host client and the live additional information into the live video stream, and sends the live video stream to the viewer client, as well as evaluation indicators based on user behavior Save the designated image frame of the original video stream. The live video stream synthesized by the original video stream and the additional information of the live broadcast is used for the live broadcast, and the specified image frames of the original video stream are saved for publishing to a third-party video platform.
图2为本申请一示例性实施例示出的一种处理直播内容的方法流程图,如图2所示,该方法包括以下步骤:Fig. 2 is a flowchart of a method for processing live content shown in an exemplary embodiment of the application. As shown in Fig. 2, the method includes the following steps:
S201:接收主播客户端通过视频通道发送的原始视频流和通过信令通道发送的直播附加信息;S201: Receive the original video stream sent by the host client through the video channel and the live additional information sent through the signaling channel;
S202:根据所述直播附加信息将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流;S202: Synthesize the original video stream and the live additional information into a live video stream according to the live additional information, and send the live video stream to the viewer client;
S203:基于用户行为的评估指标保存所述原始视频流的指定图像帧。S203: Save the designated image frame of the original video stream based on the evaluation index of the user behavior.
在S201中,主播客户端发送的原始图像帧可以是通过摄像头实时采集到的任何画面的图像帧,该画面可以是主播想用来进行直播的画面,例如主播在直播时通过摄像头拍摄到的自身的画面,或者拍摄到的主播周围环境的一些画面。获取的原始图像帧还可以是主播在终端显示界面上捕获到的画面,例如主播正在直播的游戏或电影,主播可以捕获移动设备或是固 定设备上显示的游戏界面或电影播放界面作为原始图像帧。In S201, the original image frame sent by the host client can be the image frame of any screen captured in real time by the camera, and the frame can be the screen that the host wants to use for live broadcast, for example, the host can capture itself through the camera during the live broadcast. , Or some pictures of the surrounding environment of the host. The acquired original image frame can also be the screen captured by the host on the terminal display interface, such as a game or movie being broadcast by the host, the host can capture the game interface or movie playback interface displayed on a mobile device or a fixed device as the original image frame .
直播附加信息信令携带有直播附加信息,直播附加信息可以表示该原始图像帧添加的一些字幕、版权标识或者由主播人工添加的贴图图片等等,这些字幕和图片可以是一些广告信息。The live broadcast additional information signaling carries live broadcast additional information. The live broadcast additional information may indicate some subtitles and copyright identifications added to the original image frame, or texture pictures added manually by the host, and these subtitles and pictures may be some advertisement information.
在S202中,直播附加信息包括直播附加信息的属性信息和位置信息,属性信息为直播附加信息的一些固定属性,例如直播附加信息为字幕时,属性信息可以是字幕的样式,例如可以是字幕的字体、大小、颜色、背景色等文字的样式,直播附加信息为用于获取图片的相关信息时,根据所述属性信息和位置信息属性信息可以是图片的URL和图片的缩放系数两种参数中的一种,或同时包含这两种参数。位置信息为直播附加信息加在原始视频流的图像帧上的坐标,根据直播附加信息信令将原始视频流和直播附加信息合成直播视频流可以是根据属性信息和位置信息将原始视频流和直播附加信息合成直播视频流。In S202, the live broadcast additional information includes the attribute information and location information of the live broadcast additional information. The attribute information is some fixed attributes of the live broadcast additional information. For example, when the live broadcast additional information is subtitles, the attribute information can be the style of subtitles, for example, it can be subtitled. Font, size, color, background color and other text styles. When the live broadcast additional information is used to obtain the relevant information of the picture, the attribute information can be the URL of the picture and the zoom factor of the picture according to the attribute information and location information. One or both of these parameters. The location information is the coordinates of the live broadcast additional information added to the image frame of the original video stream. According to the live broadcast additional information signaling, the original video stream and the live broadcast additional information are combined into the live video stream. The original video stream can be combined with the live broadcast according to the attribute information and location information. The additional information is combined into a live video stream.
S203中,指定图像帧为服务器基于用户行为的评估指标确定的原始图像帧,用户行为的评估指标可以包括送礼数量和公屏活跃度中的至少一种指标,当然,也可以是除这两种指标之外的其他评估指标。具体选取何种参数作为评估指标,可以根据使用本方案的人所需要选取的指定图像帧的特点来确定,例如也可以仅将送礼数量或公平活跃度作为评估指标,或者将其他的参数作为评估指标。In S203, the designated image frame is the original image frame determined by the server based on the evaluation index of the user behavior. The evaluation index of the user behavior may include at least one of the number of gifts and the activity of the public screen. Of course, it may be other than these two. Evaluation indicators other than indicators. Which parameter to choose as the evaluation index can be determined according to the characteristics of the specified image frame that the person using this scheme needs to select. For example, the number of gifts or fair activity can also be used as the evaluation index, or other parameters can be used as the evaluation index. index.
例如,以下描述一个以精彩时刻的图像帧作为指定图像帧的例子。在直播场景下,服务器接收到主播客户端发送的直播视频流并将该直播视频流发送给观众客户端以供观众客户端的观众观看,观众可以根据个人的喜好进入自己喜欢的主播的直播间,向自己喜欢的主播送礼或者在直播间公屏留言、发弹幕与主播或其他一同观看直播的观众进行交流互动,随着弹幕数量和送礼数量增多,或者送礼的价值越高,该主播的直播间的热度也会上涨。服务器在向观众客户端发送直播视频帧的同时,会将实时的观众弹幕数量、观众送礼数量、当前直播间热度等作为评估指标,实时统计当前直播视频帧对应的主播直播间的观众弹幕数量、观众送礼数量、当前直播间热度等,并根据这些评估指标确定当前直播视频帧是否为精彩时刻的 视频帧,所述的精彩时刻是指,如果在一段时间内主播直播间的观众弹幕数量、观众送礼数量、当前直播间热度等达到一定的指标,那么确定该时间段内的直播视频帧组成一个精彩时刻。指定图像帧可以是精彩时刻的时间段内包括的直播视频帧所对应的原始图像帧,即精彩时刻的原始图像帧,由于服务器会实时判断当前时刻是否为精彩时刻,当判断出当前时刻为精彩时刻时开始对接收的原始图像帧进行录制,并在判断出当前时刻为非精彩时刻时停止录制并保存录制的文件为视频文件,视频文件的格式可以是例如MP4格式、AVI格式等,同时服务器将继续持续判断当前时刻是否为精彩时刻,一旦判断出当前时刻为精彩时刻,将再次进行录制保存。由于精彩时刻的原始图像帧并未携带有字幕、版权标识或者由主播人工添加的贴图图片等一些直播附加信息,因此可用于发布到其他视频平台。For example, the following describes an example in which an image frame of a wonderful moment is used as a designated image frame. In the live broadcast scenario, the server receives the live video stream sent by the host client and sends the live video stream to the audience client for viewing by the audience client. The audience can enter the live broadcast room of the host they like according to their personal preferences. Give a gift to your favorite host or leave a message on the public screen in the live broadcast room, send a bullet screen to communicate with the host or other viewers who watch the live broadcast together. As the number of bullet screens and the number of gifts increases, or the value of the gift is higher, the host’s The heat of the live broadcast room will also rise. When the server sends the live video frame to the audience client, it will use the real-time audience barrage, the number of audience gifts, the current live broadcast room heat, etc. as evaluation indicators, and real-time statistics of the audience barrage in the host's live room corresponding to the current live video frame According to these evaluation indicators, determine whether the current live video frame is a video frame of a wonderful moment. The wonderful moment refers to if the audience barrage in the live broadcast room of the host within a certain period of time If the number, the number of viewers giving gifts, and the current popularity of the live broadcast room reach a certain index, then it is determined that the live video frames within the time period constitute a wonderful moment. The specified image frame can be the original image frame corresponding to the live video frame included in the time period of the wonderful moment, that is, the original image frame of the wonderful moment. Since the server will judge in real time whether the current moment is a wonderful moment, when it is judged that the current moment is a wonderful moment Start recording the received original image frame at the moment, and stop recording and save the recorded file as a video file when the current moment is judged to be a non-exciting moment. The format of the video file can be, for example, MP4 format, AVI format, etc., and the server It will continue to judge whether the current moment is a wonderful moment. Once it is judged that the current moment is a wonderful moment, it will be recorded and saved again. Since the original image frame of the wonderful moment does not carry some additional live broadcast information such as subtitles, copyright signs, or stickers manually added by the anchor, it can be used to publish to other video platforms.
作为例子,服务器存储指定文件的过程可以参照以下过程:服务器可以按照所接收的原始视频流的时间先后顺序,分段保存原始视频流,而判断所保存的原始视频流中是否存在指定图像帧的时机,既可以是实时进行、也可以是与保存的动作异步进行。例如可以在每保存一个文件时,查询该时间段是否包括精彩时刻,若该时间段包括精彩时刻,确定该文件存在精彩时刻的原始图像帧,将该文件与其他存在精彩时刻的原始图像帧的文件整合归档;若该文件所属时间段不包括精彩时刻,确定该文件不存在精彩时刻的原始图像帧,可以直接将该文件删除以免占用设备内存空间。由于整合归档后统一发送给服务器的文件都是包括精彩时刻的原始图像帧的文件,因此不需要服务器再次进行筛选,方便用户在需要时直接将服务器获取到的这些文件分享到其他视频平台。As an example, the process for the server to store the specified file can refer to the following process: the server can store the original video stream in segments according to the time sequence of the received original video stream, and judge whether there is a specified image frame in the saved original video stream. The timing can be performed in real time or asynchronously with the saved action. For example, each time a file is saved, it can be queried whether the time period includes a wonderful moment. If the time period includes a wonderful moment, it is determined that the original image frame of the wonderful moment exists in the file, and the file is compared with other original image frames of the wonderful moment. File integration and archiving; if the time period of the file does not include the wonderful moments, it is determined that the original image frames of the wonderful moments do not exist in the file, and the file can be deleted directly to avoid occupying the memory space of the device. Since the files uniformly sent to the server after integrated archiving are all files that include the original image frames of the wonderful moments, there is no need for the server to filter again, so that users can directly share these files obtained by the server to other video platforms when needed.
在一个实施例中,指定图像帧可以被分发至当前直播平台或者第三方视频平台。用户可以从服务器获取到服务器保存的精彩时刻的视频文件,用户可以对获取的视频文件进行编辑,例如可以进行适当的剪辑、添加特效等处理,再将处理后得到的视频文件分发到第三方视频平台以供第三方视频平台的观众观看到这些精彩时刻,或者也可以将处理后得到的视频文件分发到当前直播平台以供当前直播平台的观众观看到这些精彩时刻。In one embodiment, the designated image frame may be distributed to the current live broadcast platform or a third-party video platform. The user can obtain the video files of the wonderful moments saved on the server from the server, and the user can edit the obtained video files, for example, perform appropriate editing, add special effects, etc., and then distribute the processed video files to third-party videos The platform is for viewers of third-party video platforms to watch these wonderful moments, or the processed video files can also be distributed to the current live broadcast platform for viewers of the current live broadcast platform to watch these wonderful moments.
图3为本申请一示例性实施例示出的一种处理直播内容的方法流程 图,如图3所示,该方法包括以下步骤:Fig. 3 is a flowchart of a method for processing live content shown in an exemplary embodiment of the application. As shown in Fig. 3, the method includes the following steps:
S301:获取原始视频流;S301: Obtain the original video stream;
S302:分别将所述原始视频流与直播附加信息信令发送给服务器,以使服务器根据所述直播附加信息信令将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流,以及基于用户行为的评估指标保存所述原始视频流的指定图像帧。S302: Send the original video stream and the live broadcast additional information signaling to the server respectively, so that the server synthesizes the original video stream and the live broadcast additional information into the live video stream according to the live broadcast additional information signaling, and sends it to the viewer client. The live video stream and the evaluation index based on user behavior save the designated image frames of the original video stream.
本申请还提供一种直播系统,所述直播系统包括主播客户端、服务器、观众客户端;This application also provides a live broadcast system, which includes an anchor client, a server, and an audience client;
所述主播客户端用于获取原始视频流;The anchor client is used to obtain the original video stream;
分别将所述原始视频流与直播附加信息信令发送给服务器;以及,Separately sending the original video stream and the live broadcast additional information signaling to the server; and,
所述服务器用于将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流;以及The server is configured to synthesize the original video stream and the additional information of the live broadcast into a live video stream, and send the live video stream to the viewer client; and
基于用户行为的评估指标保存所述原始视频流的指定图像帧。The specified image frame of the original video stream is saved based on the evaluation index of the user behavior.
作为例子,主播客户端还可以在本地将原始图像帧与直播附加信息合成本地的直播图像帧以供主播本地观看直播画面。另外,服务器可以将保存的指定图像帧的文件直接发送给第三方视频平台,也可以由用户从服务器获取指定图像帧的文件直接发布或编辑后发布到第三方视频平台以供第三方视频平台观众观看。As an example, the host client can also locally synthesize the original image frame and the additional information of the live broadcast into a local live image frame for the host to watch the live image locally. In addition, the server can directly send the saved file of the specified image frame to the third-party video platform, or the user can obtain the file of the specified image frame from the server and publish it directly or after editing and publishing to the third-party video platform for viewers on the third-party video platform Watch.
为了更好的理解本方案,以下列举一个应用实例。如图3a所示的直播环境下,直播系统中存在主播客户端301a、若干个观众客户端302a、媒体服务器303a、合成服务器304a以及直播视频流数据库305a和原始视频流数据库306a。In order to better understand this solution, an application example is listed below. In the live broadcast environment as shown in FIG. 3a, there are an anchor client 301a, several viewer clients 302a, a media server 303a, a synthesis server 304a, a live video stream database 305a and an original video stream database 306a in the live broadcast system.
主播可通过主播客户端301a进入直播间,当主播进行直播时,利用摄像头采集原始视频流,并将所采集的原始视频流以流媒体数据的形式推流给媒体服务器303a,另外将需要呈现给观众的一些直播附加信息以信令的形式(例如,可以通过发送控制信令的通道)发送给合成服务器304a。The host can enter the live broadcast room through the host client 301a. When the host performs live broadcast, the camera collects the original video stream, and pushes the collected original video stream to the media server 303a in the form of streaming media data. In addition, it will be presented to the media server 303a. Some additional live broadcast information of the viewer is sent to the composition server 304a in the form of signaling (for example, through a channel for sending control signaling).
媒体服务器303a在接收到原始视频流后,将原始视频流也发送给合成服务器304a。After receiving the original video stream, the media server 303a also sends the original video stream to the synthesis server 304a.
合成服务器304a需要对原始视频流和直播附加信息进行合成处理,生 成供观众功能客户端观看的直播视频流,观众看到的直播视频流是附加有字幕、版权标识等信息的视频画面。另外,合成服务器304a将合成的直播视频流存储到直播视频流数据库305a、原始视频流存储到原始视频流数据库306a。在保存原始视频流时,可以将原始视频流按照接收到文件的时序先后分多个文件夹存储。The synthesis server 304a needs to synthesize the original video stream and the live broadcast additional information to generate a live video stream for the viewer function client to watch. The live video stream viewed by the viewer is a video picture with subtitles, copyright identification, and other information attached. In addition, the synthesis server 304a stores the synthesized live video stream in the live video stream database 305a, and the original video stream in the original video stream database 306a. When saving the original video stream, the original video stream can be stored in multiple folders according to the time sequence of the received file.
直播视频流被合成服务器304a实时的分发给各个观众客户端302a。而对于原始视频流数据库306a中存储的视频数据,可以按照不同的需求,实时或选择合适时机判断是否包含精彩时刻的图像帧,判断的方式可以采用AI图像识别等手段实现,判断的依据可以是利用某些可以分析出用户行为的评估指标。将所检索出的包含精彩时刻的图像帧的文件夹进行归档;未包含精彩时刻的图像帧的文件夹删除。The live video stream is distributed by the synthesis server 304a to each viewer client 302a in real time. As for the video data stored in the original video stream database 306a, it can be judged whether it contains the image frame of the wonderful moment in real time or at an appropriate time according to different needs. The judgment method can be realized by means of AI image recognition. The basis for judgment can be Use certain evaluation indicators that can analyze user behavior. File the retrieved folders containing the image frames of the notable moments; delete the folders that do not contain the image frames of the notable moments.
存储在原始视频流数据库306a中的原始视频流可以选择适当的时机分发到其他视频平台,或用户可以利用某个客户端向合成服务器304a获取时,发送给该用户,以实现该用户向其他视频平台分享的需求。The original video stream stored in the original video stream database 306a can be distributed to other video platforms at an appropriate time, or the user can use a client to obtain it from the synthesis server 304a and send it to the user, so that the user can send other videos to other video streams. The need for platform sharing.
从图3a中可以看出,合成服务器304a在推送所合成的直播视频流时,可以不向该主播客户端301a推送,这是因为主播客户端301a本地既可以获取到主播所采集的原始视频流、也可以获取到字幕、图片等直播附加信息,因此主播客户端301a同样具有合成直播视频流的能力。主播客户端301a来完成本地直播视频流的合成后即可在本地预览直播效果,使得预览过程更加高效。As can be seen from Figure 3a, when the synthesis server 304a pushes the synthesized live video stream, it may not push it to the anchor client 301a, because the anchor client 301a can obtain the original video stream collected by the anchor locally. , It is also possible to obtain additional live broadcast information such as subtitles and pictures, so the host client 301a also has the ability to synthesize live video streams. The host client 301a can preview the live broadcast effect locally after completing the synthesis of the local live video stream, which makes the preview process more efficient.
与前述处理直播内容的方法的实施例相对应,本申请还提供了处理直播内容的装置的实施例。Corresponding to the foregoing embodiment of the method for processing live content, this application also provides an embodiment of an apparatus for processing live content.
如图4所示,图4为本申请一示例性实施例示出的一种处理直播内容的装置400,包括:As shown in FIG. 4, FIG. 4 is an apparatus 400 for processing live content shown in an exemplary embodiment of this application, including:
接收模块401,用于接收主播客户端通过视频通道发送的原始视频流和通过信令通道发送的直播附加信息;The receiving module 401 is configured to receive the original video stream sent by the host client through the video channel and the live broadcast additional information sent through the signaling channel;
合成模块402,用于根据所述直播附加信息将所述原始视频流和直播附加信息合成直播视频流;The synthesis module 402 is configured to synthesize the original video stream and the live broadcast additional information into a live video stream according to the live broadcast additional information;
传输模块403,用于向观众客户端发送所述直播视频流;The transmission module 403 is configured to send the live video stream to the audience client;
保存模块404,用于基于用户行为的评估指标保存所述原始视频流的指定图像帧。The saving module 404 is configured to save the designated image frame of the original video stream based on the evaluation index of the user behavior.
如图5所示,图5为本申请一示例性实施例示出的一种处理直播内容的装置500包括:As shown in FIG. 5, FIG. 5 shows an apparatus 500 for processing live content according to an exemplary embodiment of this application, including:
获取模块501,用于获取原始视频流;The obtaining module 501 is used to obtain the original video stream;
发送模块502,用于分别将所述原始视频流与直播附加信息发送给服务器,以使服务器根据所述直播附加信息信令将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流,以及基于用户行为的评估指标保存所述原始视频流的指定图像帧,其中,所述原始视频流通过视频通道发送,所述直播附加信息通过信令通道发送。The sending module 502 is configured to send the original video stream and the live broadcast additional information to the server respectively, so that the server combines the original video stream and the live broadcast additional information into the live video stream according to the live broadcast additional information signaling, and presents it to the audience client The terminal sends the live video stream, and saves the designated image frame of the original video stream based on the evaluation index of the user behavior, wherein the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel.
本申请处理直播内容的装置的实施例可以应用在设备上。装置实施例可以通过软件实现,也可以通过硬件或者软硬件结合的方式实现。以软件实现为例,作为一个逻辑意义上的装置,是通过其所在设备的处理器将非易失性存储器中对应的计算机程序指令读取到内存中运行形成的。从硬件层面而言,如图6所示,为本申请处理直播内容的装置所在设备的一种硬件结构图,除了图6所示的处理器、内存、网络接口、以及非易失性存储器之外,实施例中装置所在的设备通常根据该设备的实际功能,还可以包括其他硬件,对此不再赘述。The embodiments of the apparatus for processing live content in this application can be applied to devices. The device embodiments can be implemented by software, or can be implemented by hardware or a combination of software and hardware. Taking software implementation as an example, as a logical device, it is formed by reading the corresponding computer program instructions in the non-volatile memory into the memory through the processor of the device where it is located. From a hardware perspective, as shown in FIG. 6, a hardware structure diagram of the device where the device for processing live content of this application is located, except for the processor, memory, network interface, and non-volatile memory shown in FIG. In addition, the device where the device is located in the embodiment usually includes other hardware according to the actual function of the device, which will not be repeated here.
其中,非易失性存储器用于存储所述处理器可执行指令,所述处理器被配置为执行所述指令,以实现上述任一实施例所述的处理直播内容的方法。Wherein, the non-volatile memory is used to store executable instructions of the processor, and the processor is configured to execute the instructions to implement the method for processing live content described in any of the foregoing embodiments.
本申请还提供一种计算机可读存储介质,其上存储有计算机程序,所述程序被处理器执行时实现上述任一实施例所述的处理直播内容的方法。The present application also provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, the method for processing live content described in any of the above embodiments is implemented.
上述装置中各个单元的功能和作用的实现过程具体详见上述方法中对应步骤的实现过程,在此不再赘述。For the implementation process of the functions and roles of each unit in the foregoing device, refer to the implementation process of the corresponding steps in the foregoing method for details, and details are not described herein again.
对于装置实施例而言,由于其基本对应于方法实施例,所以相关之处参见方法实施例的部分说明即可。以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一 个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本申请方案的目的。本领域普通技术人员在不付出创造性劳动的情况下,即可以理解并实施。For the device embodiment, since it basically corresponds to the method embodiment, the relevant part can refer to the part of the description of the method embodiment. The device embodiments described above are merely illustrative. The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in One place, or it can be distributed to multiple network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the solution of the present application. Those of ordinary skill in the art can understand and implement without creative work.
以上所述仅为本申请的较佳实施例而已,并不用以限制本申请,凡在本申请的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本申请保护的范围之内。The above descriptions are only preferred embodiments of this application and are not intended to limit this application. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of this application shall be included in this application Within the scope of protection.

Claims (11)

  1. 一种处理直播内容的方法,其特征在于,所述方法包括:A method for processing live content, characterized in that the method includes:
    接收主播客户端通过视频通道发送的原始视频流和通过信令通道发送的直播附加信息;Receive the original video stream sent by the host client through the video channel and the additional live broadcast information sent through the signaling channel;
    根据所述直播附加信息将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流;以及Synthesize the original video stream and the live additional information into a live video stream according to the live additional information, and send the live video stream to the viewer client; and
    基于用户行为的评估指标保存所述原始视频流的指定图像帧。The specified image frame of the original video stream is saved based on the evaluation index of the user behavior.
  2. 根据权利要求1所述的方法,其特征在于,所述直播附加信息至少包括:The method according to claim 1, wherein the additional live broadcast information includes at least:
    字幕、用于获取图片的相关信息和/或版权标识。Subtitles, related information and/or copyright identification used to obtain pictures.
  3. 根据权利要求2所述的方法,其特征在于,所述直播附加信息包括直播附加信息的属性信息和位置信息。The method according to claim 2, wherein the live broadcast additional information includes attribute information and location information of the live broadcast additional information.
  4. 根据权利要求3所述的方法,其特征在于,所述合成直播视频流的步骤具体包括:The method according to claim 3, wherein the step of synthesizing a live video stream specifically comprises:
    根据所述属性信息和位置信息将所述原始视频流和所述直播附加信息合成直播视频流。The original video stream and the live broadcast additional information are combined into a live video stream according to the attribute information and location information.
  5. 根据权利要求4所述的方法,其特征在于,所述属性信息包括所述字幕的样式、所述图片的URL和/或所述图片的缩放系数;The method according to claim 4, wherein the attribute information includes the style of the subtitle, the URL of the picture, and/or the zoom factor of the picture;
    所述位置信息包括所述字幕和/或所述图片添加在原始视频流的图像帧上的坐标。The position information includes the coordinates of the subtitle and/or the picture added on the image frame of the original video stream.
  6. 根据权利要求1所述的方法,其特征在于,所述用户行为的评估指标包括送礼数量和/或公屏活跃度。The method according to claim 1, wherein the evaluation index of the user behavior includes the number of gifts and/or public screen activity.
  7. 一种处理直播内容的方法,其特征在于,所述方法包括:A method for processing live content, characterized in that the method includes:
    获取原始视频流;Obtain the original video stream;
    分别将原始视频流和直播附加信息发送给服务器,以使服务器根据所述直播附加信息将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流,以及基于用户行为的评估指标保存所述原始视频流的指定图像帧,其中,所述原始视频流通过视频通道发送,所述直播附加信息通过信令通道发送。Send the original video stream and the live additional information to the server respectively, so that the server synthesizes the original video stream and the live additional information into the live video stream according to the live additional information, sends the live video stream to the viewer client, and The user behavior evaluation index saves the designated image frames of the original video stream, where the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel.
  8. 根据权利要求7所述的方法,其特征在于,所述方法由主播客户端执行,所述方法还包括:The method according to claim 7, wherein the method is executed by the host client, and the method further comprises:
    将所述原始视频流和所述直播附加信息在本地合成为直播视频流,并在本地播放所述直播视频流。The original video stream and the live broadcast additional information are locally synthesized into a live video stream, and the live video stream is played locally.
  9. 一种直播系统,其特征在于,所述直播系统包括主播客户端、服务器、观众客户端;A live broadcast system, characterized in that the live broadcast system includes an anchor client, a server, and an audience client;
    所述主播客户端用于获取原始视频流;The anchor client is used to obtain the original video stream;
    分别将所述原始视频流与直播附加信息发送给服务器,其中,所述原始视频流通过视频通道发送,所述直播附加信息通过信令通道发送;Sending the original video stream and the live broadcast additional information to the server respectively, where the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel;
    所述服务器用于将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流;以及The server is configured to synthesize the original video stream and the additional information of the live broadcast into a live video stream, and send the live video stream to the viewer client; and
    基于用户行为的评估指标保存所述原始视频流的指定图像帧;Saving the designated image frame of the original video stream based on the evaluation index of the user behavior;
    所述观众客户端,用于接收并展示所述直播视频流。The audience client is used to receive and display the live video stream.
  10. 一种处理直播内容的装置,其特征在于,所述装置包括:A device for processing live broadcast content, characterized in that the device includes:
    接收模块,用于接收主播客户端通过视频通道发送的原始视频流和通过信令通道发送的直播附加信息;The receiving module is used to receive the original video stream sent by the host client through the video channel and the live broadcast additional information sent through the signaling channel;
    合成模块,用于根据所述直播附加信息将所述原始视频流和直播附加信息合成直播视频流;A synthesis module, configured to synthesize the original video stream and the live broadcast additional information into a live video stream according to the live broadcast additional information;
    传输模块,用于向观众客户端发送所述直播视频流;The transmission module is used to send the live video stream to the audience client;
    保存模块,用于基于用户行为的评估指标保存所述原始视频流的指定图像帧。The saving module is configured to save the designated image frame of the original video stream based on the evaluation index of the user behavior.
  11. 一种处理直播内容的装置,其特征在于,所述装置包括:A device for processing live broadcast content, characterized in that the device includes:
    获取模块,用于获取原始视频流;The acquisition module is used to acquire the original video stream;
    发送模块,用于分别将所述原始视频流与直播附加信息发送给服务器,以使服务器根据所述直播附加信息信令将所述原始视频流和直播附加信息合成直播视频流,向观众客户端发送所述直播视频流,以及基于用户行为的评估指标保存所述原始视频流的指定图像帧,其中,所述原始视频流通过视频通道发送,所述直播附加信息通过信令通道发送。The sending module is used to send the original video stream and the live additional information to the server respectively, so that the server combines the original video stream and the live additional information into the live video stream according to the live additional information signaling to the viewer client The live video stream is sent, and the designated image frame of the original video stream is saved based on the evaluation index of the user behavior, wherein the original video stream is sent through a video channel, and the live broadcast additional information is sent through a signaling channel.
PCT/CN2020/112856 2019-12-31 2020-09-01 Method and apparatus for processing live streaming content, and system WO2021135334A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911407509.7A CN111083515B (en) 2019-12-31 2019-12-31 Method, device and system for processing live broadcast content
CN201911407509.7 2019-12-31

Publications (1)

Publication Number Publication Date
WO2021135334A1 true WO2021135334A1 (en) 2021-07-08

Family

ID=70320557

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/112856 WO2021135334A1 (en) 2019-12-31 2020-09-01 Method and apparatus for processing live streaming content, and system

Country Status (2)

Country Link
CN (1) CN111083515B (en)
WO (1) WO2021135334A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111083515B (en) * 2019-12-31 2021-07-23 广州华多网络科技有限公司 Method, device and system for processing live broadcast content
CN112084369A (en) * 2020-08-03 2020-12-15 广州数说故事信息科技有限公司 Highlight moment mining method and model based on video live broadcast
CN112087669B (en) * 2020-08-07 2023-03-10 广州方硅信息技术有限公司 Method and device for presenting virtual gift and electronic equipment
CN113490001A (en) * 2020-11-28 2021-10-08 青岛海信电子产业控股股份有限公司 Audio and video data sharing method, server, device and medium
CN112954374B (en) * 2021-01-28 2023-05-23 广州虎牙科技有限公司 Video data processing method and device, electronic equipment and storage medium
CN113691877A (en) * 2021-08-27 2021-11-23 余浪 Live broadcasting method and device
CN113873296A (en) * 2021-09-24 2021-12-31 上海哔哩哔哩科技有限公司 Video stream processing method and device
CN115022654B (en) * 2022-05-18 2024-01-19 北京达佳互联信息技术有限公司 Video editing method and device in live broadcast scene

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050232610A1 (en) * 2004-04-16 2005-10-20 Gateway, Inc. User automated content deletion
CN103686450A (en) * 2013-12-31 2014-03-26 广州华多网络科技有限公司 Video processing method and system
CN105872580A (en) * 2016-04-15 2016-08-17 广州酷狗计算机科技有限公司 Recording method and device of live broadcast video
CN106131591A (en) * 2016-06-30 2016-11-16 广州华多网络科技有限公司 Live broadcasting method, device and terminal
US20170134595A1 (en) * 2015-11-11 2017-05-11 Vivint, Inc. Automated image album
CN106792122A (en) * 2017-02-20 2017-05-31 北京金山安全软件有限公司 Automatic video recording method and device and terminal
CN108289159A (en) * 2017-05-25 2018-07-17 广州华多网络科技有限公司 A kind of terminal live streaming special efficacy add-on system, method and terminal live broadcast system
CN111083515A (en) * 2019-12-31 2020-04-28 广州华多网络科技有限公司 Method, device and system for processing live broadcast content

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9681160B2 (en) * 2011-06-22 2017-06-13 Tout Inc. Method and apparatus for automatically associating media segments with broadcast media streams
CN105282617A (en) * 2014-06-12 2016-01-27 李英元 Video-on-demand system capable of realizing picture identification differentiation
CN105245801A (en) * 2015-09-24 2016-01-13 天脉聚源(北京)科技有限公司 Method for transmitting interactive signals of interactive television system
CN108696474A (en) * 2017-04-05 2018-10-23 杭州登虹科技有限公司 The communication means of multimedia transmission
CN108521584B (en) * 2018-04-20 2020-08-28 广州虎牙信息科技有限公司 Interactive information processing method, device, anchor side equipment and medium
CN110198456B (en) * 2019-04-26 2023-02-07 腾讯科技(深圳)有限公司 Live broadcast-based video pushing method and device and computer-readable storage medium
CN110392226A (en) * 2019-06-19 2019-10-29 视联动力信息技术股份有限公司 A kind of live streaming implementation method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050232610A1 (en) * 2004-04-16 2005-10-20 Gateway, Inc. User automated content deletion
CN103686450A (en) * 2013-12-31 2014-03-26 广州华多网络科技有限公司 Video processing method and system
US20170134595A1 (en) * 2015-11-11 2017-05-11 Vivint, Inc. Automated image album
CN105872580A (en) * 2016-04-15 2016-08-17 广州酷狗计算机科技有限公司 Recording method and device of live broadcast video
CN106131591A (en) * 2016-06-30 2016-11-16 广州华多网络科技有限公司 Live broadcasting method, device and terminal
CN106792122A (en) * 2017-02-20 2017-05-31 北京金山安全软件有限公司 Automatic video recording method and device and terminal
CN108289159A (en) * 2017-05-25 2018-07-17 广州华多网络科技有限公司 A kind of terminal live streaming special efficacy add-on system, method and terminal live broadcast system
CN111083515A (en) * 2019-12-31 2020-04-28 广州华多网络科技有限公司 Method, device and system for processing live broadcast content

Also Published As

Publication number Publication date
CN111083515B (en) 2021-07-23
CN111083515A (en) 2020-04-28

Similar Documents

Publication Publication Date Title
WO2021135334A1 (en) Method and apparatus for processing live streaming content, and system
US10735798B2 (en) Video broadcast system and a method of disseminating video content
US11937010B2 (en) Data segment service
US9794615B2 (en) Broadcast management system
TW482985B (en) Automatic media and advertising system
US20080127272A1 (en) Aggregation of Multiple Media Streams to a User
CN103258557B (en) Display control unit and display control method
JP2023115088A (en) Image file generator, method for generating image file, image generator, method for generating image, image generation system, and program
KR101843815B1 (en) method of providing inter-video PPL edit platform for video clips
US9779306B2 (en) Content playback system, server, mobile terminal, content playback method, and recording medium
JP2009124516A (en) Motion picture editing apparatus, playback device, motion picture editing method, and playback method
CN109874024A (en) A kind of barrage processing method, system and storage medium based on dynamic video poster
KR102069897B1 (en) Method for generating user video and Apparatus therefor
US9264746B2 (en) Content distribution system, content distribution server, content distribution method, software program, and storage medium
JP2015142207A (en) View log recording system and motion picture distribution system
KR100886149B1 (en) Method for forming moving image by inserting image into original image and recording media
JP2005191892A (en) Information acquisition device and multi-media information preparation system using it
US20200366973A1 (en) Automatic Video Preview Creation System
CN111107388A (en) Method, device, system, equipment and storage medium for processing live broadcast content
CN114554232A (en) Mixed reality live broadcast method and system based on naked eye 3D

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20910248

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20910248

Country of ref document: EP

Kind code of ref document: A1