CN112954374A

CN112954374A - Video data processing method and device, electronic equipment and storage medium

Info

Publication number: CN112954374A
Application number: CN202110121083.XA
Authority: CN
Inventors: 罗创; 杨时权; 何远清
Original assignee: Guangzhou Huya Technology Co Ltd
Current assignee: Guangzhou Huya Technology Co Ltd
Priority date: 2021-01-28
Filing date: 2021-01-28
Publication date: 2021-06-11
Anticipated expiration: 2041-01-28
Also published as: CN112954374B

Abstract

The embodiment of the invention provides a video data processing method and device, electronic equipment and a storage medium, and relates to the technical field of internet. The video data processing method comprises the following steps: sequentially extracting video additional information corresponding to each frame of live broadcast image in live broadcast video data, and storing the video additional information in a cache region; acquiring multiple frames of the video additional information from the buffer area and compressing the video additional information to obtain an additional information data packet; and respectively sending the additional information data packet and the live video data to corresponding clients. Therefore, the compression rate of the video additional information is improved, the size of the obtained additional information data packet is reduced, and transmission resources are occupied in the transmission process.

Description

Video data processing method and device, electronic equipment and storage medium

Technical Field

The invention relates to the technical field of internet, in particular to a video data processing method and device, electronic equipment and a storage medium.

Background

With the development of internet technology, the playing content of video data is more and more diversified, and especially in a live scene, the playing of video data is no longer limited to a video picture itself, and video additional information (for example, outline information of a graphic element that a user pays attention to) of the video data is often required to be displayed.

Currently, the video server encodes, packages and transmits the extracted video additional information together with video pictures of the video data to a client used by a user. However, this approach requires that the resulting video additional information for each frame be encoded and packetized. Obviously, on one hand, the compression workload is increased, and on the other hand, the size of the transmission data packet still has an optimized space, which results in unnecessary waste of data transmission resources.

Disclosure of Invention

In view of the above, the present invention provides a video data processing method, an apparatus, an electronic device and a storage medium.

In order to achieve the above purpose, the embodiment of the present invention adopts the following technical solutions:

in a first aspect, the present invention provides a video data processing method, applied to a live broadcast server, where the video data processing method includes: sequentially extracting video additional information corresponding to each frame of live broadcast image in live broadcast video data, and storing the video additional information in a cache region; acquiring multiple frames of the video additional information from the buffer area and compressing the video additional information to obtain an additional information data packet; and respectively sending the additional information data packet and the live video data to corresponding clients.

It can be seen that, in the above embodiment, the video data processing method utilizes the characteristic that the difference between the video additional information of adjacent multiple live broadcast images in the same live broadcast video data is not very large, and selects to pack and compress the video additional information corresponding to the multiple live broadcast images together. That is, for the same amount of video additional information, the size of the obtained data packet can be obviously reduced after the data packet is packed and compressed by the scheme. On the basis, the additional information data packet and the live video data obtained after packaging are respectively sent to the client side for playing, so that the live video data and the additional information data packet can reach the client side, and meanwhile, the occupation of transmission resources is reduced.

In an optional embodiment, the step of obtaining multiple frames of the video additional information from the buffer for compression includes: acquiring transmission time delay of the live broadcast image transmitted to the client; acquiring the video additional information from the cache region for compression according to the transmission delay and by combining the acquisition time stamp of the video additional information; and the acquisition time stamp of the video additional information is the same as the acquisition time stamp of the corresponding live broadcast image.

It is understood that live video data in a live scene is collected in real time, and therefore, video additional information corresponding to live images is extracted in real time. In order to pack the multiple frames of video additional information extracted in real time, the video additional information extracted in real time needs to be placed into the buffer area first, so that the multiple frames of video additional information are stored in the buffer area and then compressed and packed. Although the more video additional information compressed together, the higher the compression rate, the more video additional information is obtained at the cost of longer time the video additional information stays in the buffer. However, the client needs to render the live image in the live video data synchronously with the corresponding video additional information.

Therefore, in the above embodiment, the video additional information is collected and compressed and packaged by using the inherent time delay in the live video data transmission process, so that the video additional information is prevented from being sent to the client later than the corresponding live video, and the live real-time performance is prevented from being influenced.

The collection and compression and packaging of the video additional information by using the inherent time delay in the live video data transmission process can be realized by adopting the following two modes:

in the first mode, the maximum time difference between the video additional information stored in the cache region is calculated according to the acquisition time stamp of the video additional information stored in the cache region; and under the condition that the maximum time difference is matched with the transmission time delay, the video additional information stored in the cache region is taken out and compressed.

In a second mode, under the condition that the sum of the storage duration of the video additional information written first in the buffer area and the transmission time consumption corresponding to the additional information data packet is equal to the transmission delay, all the video additional information in the buffer area is taken out and compressed.

In an optional implementation manner, the step of sending the additional information data packet and the live video data to corresponding clients respectively includes: generating a video data packet according to a live image of the live video data; sending the video data packet to the client through a video transmission channel; and transmitting the additional information data packet to the client through a data packet transmission channel.

In a second aspect, the present invention provides a video data processing method, applied to a client, where the video data processing method includes: receiving a video data packet and an additional information data packet corresponding to live video data; the additional information data packet is a data packet obtained by compressing multi-frame video additional information extracted from the live video data; the video data packet is generated based on a live image of the live video data; and after the video data packet and the additional information data packet are respectively decoded, acquiring the live broadcast image and the video additional information with corresponding identifications so as to synchronously render.

In an optional embodiment, the video data processing method further includes: periodically calculating transmission time delay according to the acquired acquisition time stamp and the universal time of the live broadcast image; and feeding back the transmission delay to a live broadcast server to guide the live broadcast server to create the additional information data packet.

In a third aspect, the present invention provides a video data processing apparatus applied to a live broadcast server, the video data processing apparatus comprising: the extraction module is used for sequentially extracting video additional information corresponding to each frame of live broadcast image in the live broadcast video data and storing the video additional information into a cache region; the compression module is used for acquiring multiple frames of the video additional information from the cache region and compressing the video additional information to obtain an additional information data packet; and the sending module is used for sending the additional information data packet and the live video data to corresponding clients respectively.

In a fourth aspect, the present invention provides a video data processing apparatus applied to a client, the video data processing apparatus comprising:

the receiving module is used for receiving a video data packet and an additional information data packet corresponding to the live video data; the additional information data packet is a data packet obtained by compressing multi-frame video additional information extracted from the live video data; the video data packet is generated based on a live image of the live video data;

and the rendering module is used for acquiring the live broadcast image and the video additional information with corresponding identifications after respectively decoding the video data packet and the additional information data packet so as to synchronously render.

In a fifth aspect, the present invention provides an electronic device comprising a processor and a memory, the memory storing machine executable instructions executable by the processor to implement the method of any one of the preceding embodiments.

In a sixth aspect, the invention provides a computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the method according to any one of the preceding embodiments.

In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.

Fig. 1 shows a live scene diagram provided in an embodiment of the present invention.

Fig. 2 shows a schematic diagram of an electronic device provided by an embodiment of the invention.

Fig. 3 is a flowchart illustrating steps of a video data processing method applied to a live server according to an embodiment of the present invention.

Fig. 4 is a diagram illustrating an example of encoding video additional information in the related art.

Fig. 5 is a diagram illustrating an example of merging and compressing additional video information according to an embodiment of the present invention.

Fig. 6 shows a flowchart of sub-steps of step S102 provided by the embodiment of the present invention.

Fig. 7 conceptually illustrates an exemplary view of the transmission of video data packets and additional information data packets to a viewer's client.

Fig. 8 shows a schematic diagram of a video data processing apparatus applied to a live server according to an embodiment of the present invention.

Fig. 9 is a flowchart illustrating steps of a video data processing method applied to a client according to an embodiment of the present invention.

Fig. 10 is a schematic diagram illustrating a video data processing apparatus applied to a client according to an embodiment of the present invention.

Icon: 100-an electronic device; 110-a memory; 120-a processor; 130-a communication module; 400-video data processing means; 401-an extraction module; 402-a compression module; 403-a sending module; 500-video data processing means; 501-a receiving module; 502-rendering module.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.

Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.

It is noted that relational terms such as "first" and "second," and the like, may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.

In order to optimize a transmission scheme of video additional information corresponding to video data, particularly a transmission scheme of video additional information corresponding to live video data in a live scene. The embodiment of the invention provides a video data processing method and device, electronic equipment and a storage medium.

The application scene of the video data processing method and the device can be the scene where the image information acquired in real time needs to be acquired. For example, but not limited to, live scenes, monitored scenes, and the like may be included.

The following describes a video data processing method and apparatus, an electronic device 100, and a storage medium provided by embodiments of the present invention, taking an example in which the video data processing method and apparatus are applied to a live broadcast scene shown in fig. 1. As shown in fig. 1, a live client (also called a main broadcast) collects live image frames to generate live video data, and sends the live video data to a video server in the form of a video stream. And after the video server carries out relevant processing on the live video data, the live video data are sent to the corresponding audience client. The viewer client may be a client used by a viewer entering a live room created by a live client.

It should be noted that the live client and the viewer client are essentially clients that obtain services from the video server, and the difference is only in roles that are assumed in the live activities. Each client can switch between the live client and the viewer client according to the user's selection. For example, if a user a uses a client a to perform live broadcasting, the client a is a live broadcasting client, and a client B used by a user B who enters a live broadcasting room created by the user a is an audience client. After the user A downloads, when the client A enters a live broadcast room created by the user B, the client B used by the user B who is carrying out live broadcast is a live broadcast client, and the client A is an audience client.

The video data processing method and device provided by the embodiment of the invention can be applied to the live broadcast server and the corresponding audience client in the figure 1.

The live server may be, but is not limited to, a Personal Computer (PC), a server, a distributed computer, and the like. It is to be understood that the server is not limited to a physical server, and may also be a virtual machine on the physical server, a virtual machine built on a cloud platform, or the like, which can provide the same function as the server or the virtual machine.

The client may be an intelligent device with image capturing and displaying functions, for example, the client may include a mobile phone, an intelligent home device, a wearable device, an intelligent mobile device, a virtual reality device, an augmented reality device, or the like, and may also be a tablet computer, a laptop computer, or a built-in device in a motor vehicle, or the like.

For convenience of describing a hardware environment for operating the video data processing method and apparatus, the live broadcast server and the corresponding viewer client are both referred to as the electronic device 100 below.

Referring to fig. 2, fig. 2 is a block diagram of the electronic device 100. The electronic device 100 includes a memory 110, a processor 120, and a communication module 130. The memory 110, the processor 120 and the communication module 130 are electrically connected to each other directly or indirectly to realize data transmission or interaction. For example, the components may be electrically connected to each other via one or more communication buses or signal lines.

The memory 110 is used to store programs or data. The Memory 110 may be, but is not limited to, a Random Access Memory 110 (RAM), a Read Only Memory 110 (ROM), a Programmable Read Only Memory 110 (PROM), an Erasable Read Only Memory 110 (EPROM), an electrically Erasable Read Only Memory 110 (EEPROM), and the like.

The processor 120 is used to read/write data or programs stored in the memory 110 and perform corresponding functions.

The communication module 130 is configured to establish a communication connection between the electronic device 100 and another communication terminal through the network, and to transmit and receive data through the network.

It should be understood that the structure shown in fig. 2 is only a schematic structural diagram of the electronic device 100, and the electronic device 100 may also include more or fewer components than shown in fig. 2, or have a different configuration than shown in fig. 2. The components shown in fig. 2 may be implemented in hardware, software, or a combination thereof.

First embodiment

Referring to fig. 3, fig. 3 shows a video data processing method according to an embodiment of the invention. The video data processing method may be applied to a live server, and as shown in fig. 3, the video data processing method may include the following steps:

and S101, sequentially extracting video additional information corresponding to each frame of live broadcast image in the live broadcast video data, and storing the video additional information into a cache region.

The live video data may be a video stream received by a live server. In a live scene, the live video data may be a video stream collected and transmitted by a live client. In a monitoring scene, the live video data may be a video stream collected by a camera installed in a monitored field.

The live video data is composed of a plurality of frames of live images which are sequentially sequenced according to the acquisition time.

The video additional information may be information extracted based on the live image and interested by the user, or may be information written in relation to the live image. For example, in a live video scene, the video additional information may be clothes appearing in the live video image and a purchase link of the clothes appearing to be added to the live video image. Of course, the description information for the live image may also be, for example, the residual error between the live image and the last live image, and texture detail area information in the video frame.

The buffer may be a first-in first-out storage area for temporary data storage.

In some embodiments, the corresponding video additional information may be extracted based on each frame of live image of the live video data in turn. Each frame of video additional information corresponding to a live broadcast image (hereinafter simply referred to as a frame of video additional information for convenience of description) is extracted and written into the buffer.

Unlike the related art, the video additional information is extracted and then is not directly encoded and packaged with the live image, but the video additional information is written into the buffer area first. It can be understood that the live video data has a certain real-time property, each frame of live image in the live video data is collected in real time, and correspondingly, each frame of video additional information is also extracted in real time, and in order to accumulate multiple frames of video additional information, the extracted video additional information needs to be written into the buffer area.

Step S102, obtaining multi-frame video additional information from the buffer area and compressing to obtain an additional information data packet.

The additional information packet is a packet obtained by compressing the video additional information.

In some embodiments, after buffering the multiple frames of video additional information in the buffer area, the buffered multiple frames of video additional information may be taken out, and subjected to merging, compression, and packaging processes to obtain an additional information data packet.

In the related art, since the video additional information needs to be encoded and packaged together with the live video, it is equivalent to correspondingly generate an additional information data packet every time a frame of video additional information is extracted, as shown in fig. 4. Although transmission resources required for a single additional information packet are not large, since the compression rate of a single additional information packet is not high, a plurality of additional information packets occupy more transmission resources.

Since the picture contents of the adjacent live images are usually not very different, the corresponding video additional information extracted from the adjacent live images is not very different. Multiple frames of video supplementary information are combined and compressed together to obtain an integrated supplementary information data packet, as shown in fig. 5. Obviously, compared with the plurality of additional information data in fig. 4, the compression rate of the entire additional information data packet is improved, the corresponding data volume is smaller, and the required transmission resources are less.

As an embodiment, in step S102, if the amount of the video additional information stored in the buffer exceeds a specified threshold, all the video additional information in the buffer is taken out, merged, compressed, and packed to obtain an additional information data packet.

Although the more video additional information that is compressed together, the higher the compression rate. However, collecting more video additional information results in a longer time for the video additional information to stay in the buffer. However, the live image and the corresponding video additional information in the live video data also generally need to be rendered synchronously. Although there is a certain transmission delay in the process of sending live video data from the live server to the viewer client and then playing the live video data, if the video additional information is sent to the client later than the corresponding live image, the live video additional information will further aggravate the live video delay.

Therefore, in the process of collecting the video additional information by using the buffer area, the propagation delay of the live video data needs to be considered. That is, the propagation delay is used to trigger the packing of the video additional information in the buffer.

That is, as another embodiment, as shown in fig. 6, the step S102 may further include the following sub-steps:

and a substep S102-1 of obtaining the transmission time delay of the live broadcast image transmitted to the client.

In some embodiments, the sub-step S102-1 may determine the corresponding transmission delay according to a data transmission speed measured by the live server and the viewer client.

In other embodiments, the sub-step S102-1 may further reversely deduce a corresponding transmission delay according to a time point of decoding each frame of the live video frame fed back by the viewer client and an acquisition time point of the live video frame.

And a substep S102-2, acquiring the video additional information from the buffer zone according to the transmission delay and by combining the acquisition time stamp of the video additional information, and compressing the video additional information.

And the acquisition time stamp of the video additional information is the same as the acquisition time stamp of the corresponding live broadcast image.

As one way, the sub-step S102-2 may be:

(1) and calculating the maximum time difference between the video additional information stored in the cache region according to the acquisition time stamp of the video additional information stored in the cache region. It can be understood that the buffer area stores the video additional information in a first-in first-out mode, so that the maximum time difference can be obtained by calculating the acquisition time difference between the first-in video additional information and the last-in video additional information in the buffer area.

It should be noted that the processing time consumption of each frame of live image from the extraction of the video additional information to the storage in the buffer area is almost the same, that is, the corresponding acquisition time difference between the acquisition time stamps of two video additional information written in the buffer area also represents the time difference of the two video additional information written in the buffer area. Therefore, in the case of continuously processing the video stream, the corresponding maximum time difference in the buffer may represent the staying time of the first video additional information in the buffer. Of course, the maximum time difference also increases with the newly written video additional information in the buffer.

(2) And under the condition that the maximum time difference is matched with the transmission delay, the video additional information stored in the buffer area is taken out and compressed.

In some embodiments, the maximum time difference may be matched to the transmission delay such that the difference between the maximum time difference and the transmission delay does not exceed the maximum transmission time consumption for transmitting the additional information packet.

Alternatively, the sub-step S102-2 may be: and under the condition that the sum of the storage duration of the video additional information written first in the buffer area and the transmission time consumption corresponding to the additional information data packet is equal to the transmission delay, taking out all the video additional information in the buffer area, and carrying out merging, compression and packaging processing.

It is understood that the buffer is emptied after the video additional information in the buffer is fetched. I.e. new video additional information continues to be received and cached.

In addition, when the maximum time difference is not matched with the transmission delay yet, but the video additional information corresponding to the last frame of live image is received, the video additional information stored in the buffer area can be triggered to be taken out and compressed.

And step S103, respectively sending the additional information data packet and the live video data to corresponding clients.

In some embodiments, the additional information data packets and the live video data may be transmitted to the corresponding viewer client using different transmission links.

Alternatively, the video data packet may be generated from a live image of live video data. As shown in fig. 7, the video data packets are transmitted to the corresponding viewer clients through the video transmission channels. And after the additional information data packet is generated, transmitting the additional information data packet to the corresponding audience client through the data packet transmission channel. There is no necessary precedence between the two transmissions.

In order to perform the corresponding steps in the above embodiments and various possible manners, an implementation manner of the video data processing apparatus 400 is given below, and optionally, the video data processing apparatus 400 may adopt the device structure of the electronic device 100 shown in fig. 2. Further, referring to fig. 8, fig. 8 is a functional block diagram of a video data processing apparatus 400 according to an embodiment of the present invention. It should be noted that the basic principle and the resulting technical effects of the video data processing apparatus 400 provided in the present embodiment are the same as those of the above embodiments, and for the sake of brief description, no part of the present embodiment is mentioned, and reference may be made to the corresponding contents in the above embodiments. The video data processing apparatus 400 includes: an extraction module 401, a compression module 402 and a sending module 403.

The extracting module 401 is configured to sequentially extract video additional information corresponding to each frame of live broadcast image in the live broadcast video data, and store the video additional information in a buffer area.

In some embodiments, the above step S101 may be performed by the extraction module 401.

A compressing module 402, configured to obtain multiple frames of the video additional information from the buffer and compress the obtained additional information to obtain an additional information data packet.

In some embodiments, step S102 described above may be performed by the compression module 402.

A sending module 403, configured to send the additional information data packet and the live video data to corresponding clients respectively.

In some embodiments, the step S103 may be performed by the sending module 403.

Second embodiment

Referring to fig. 9, fig. 9 shows a video data processing method according to an embodiment of the invention. The video data processing method may be applied to a client, as shown in fig. 9, and the video data processing method may include the following steps:

step S201, a video data packet and an additional information data packet corresponding to the live video data are received.

The additional information data packet is a data packet obtained by compressing the multi-frame video additional information extracted from the live video data.

The video data packet is generated based on a live image of the live video data.

In some embodiments, the generation process of the additional information packet may refer to the description in the first embodiment.

Step S202, after the video data packet and the additional information data packet are decoded respectively, the live broadcast image and the video additional information with corresponding identification are obtained for synchronous rendering.

The corresponding identification is used for identifying the live broadcast image and the video additional information with the corresponding relation. There is usually a correspondence between the video additional information and the live image from which it was extracted.

In some embodiments, the corresponding identifiers may be the same capture timestamp, that is, the live image and the video additional information have the same capture timestamp, and then the representation has the corresponding identifier therebetween.

In other embodiments, the corresponding identifier may also be a live video frame sequence number written in the video additional information, so that it is convenient to determine a live video and the video additional information having a corresponding relationship.

In some embodiments, the video data packets are decoded upon receipt. Similarly, the additional information packet is decoded immediately after the additional information packet is received. And determining live broadcast images and video additional information with corresponding identifications in the decoded live broadcast images and video additional information so as to synchronously render.

In some embodiments, the video data processing method may further include:

step S301, periodically calculating transmission time delay according to the acquired acquisition time stamp and the universal time of the live broadcast image.

The universal time may be an objective time for the viewing client to decompress the live images. And calculating the transmission delay through recording the acquisition time stamp of the live broadcast image and the corresponding world time.

In other embodiments, in order to reduce the transmission pressure of the live video data, the capture timestamp corresponding to the live image may be extracted as the corresponding video additional information. For example, the video additional information includes a frame number and an acquisition timestamp of a corresponding live image. Therefore, the acquisition time stamp can be periodically acquired from the video additional information, the world time of the decoded live broadcast image corresponding to the frame number of the video additional information is inquired, and finally, the transmission delay is calculated according to the acquisition time stamp and the corresponding world time.

Step S302, the transmission delay is fed back to the live broadcast server to guide the live broadcast server to create an additional information data packet.

It should be noted that, the step S102 in the first embodiment may be referred to for instructing the live broadcast server to create the additional information data packet by using the transmission delay, and details are not described herein again.

In order to execute the corresponding steps in the above embodiments and various possible manners, an implementation manner of the video data processing apparatus 500 is given below, and optionally, the video data processing apparatus 500 may adopt the device structure of the electronic device 100 shown in fig. 2. Referring to fig. 10, fig. 10 is a functional block diagram of a video data processing apparatus 500 according to an embodiment of the present invention. It should be noted that the basic principle and the resulting technical effects of the video data processing apparatus 500 provided in the present embodiment are the same as those of the above embodiments, and for the sake of brief description, no part of the present embodiment is mentioned, and reference may be made to the corresponding contents in the above embodiments. The video data processing apparatus 500 includes: a receiving module 501 and a rendering module 502.

A receiving module 501, configured to receive a video data packet and an additional information data packet corresponding to live video data; the additional information data packet is a data packet obtained by compressing multi-frame video additional information extracted from the live video data; the video data packet is generated based on a live image of the live video data.

In some embodiments, the step S201 may be performed by the receiving module 501.

And a rendering module 502, configured to obtain the live broadcast image and the video additional information with the corresponding identifiers after decoding the video data packet and the additional information data packet, respectively, so as to perform rendering synchronously.

In some embodiments, the above step S202 may be performed by the rendering module 502.

In some embodiments, the video data processing apparatus further comprises:

and the calculation module is used for periodically calculating the transmission time delay according to the acquired acquisition time stamp of the live broadcast image and the universal time for decoding the live broadcast image.

And the feedback module is used for feeding back the transmission delay to the live broadcast server so as to guide the live broadcast server to create the additional information data packet.

Alternatively, the modules may be stored in the memory 110 shown in fig. 2 in the form of software or Firmware (Firmware) or be fixed in an Operating System (OS) of the electronic device 100, and may be executed by the processor 120 in fig. 2. Meanwhile, data, codes of programs, and the like required to execute the above-described modules may be stored in the memory 110.

In summary, embodiments of the present invention provide a video data processing method, an apparatus, an electronic device, and a storage medium. The video data processing method comprises the steps of sequentially extracting video additional information corresponding to each frame of live broadcast image in live broadcast video data and storing the video additional information into a cache region; acquiring multiple frames of the video additional information from the buffer area and compressing the video additional information to obtain an additional information data packet; and respectively sending the additional information data packet and the live video data to corresponding clients. The additional information data packet and the live video data obtained after packaging are respectively sent to the client for playing, so that the live video data and the additional information data packet can reach the client, and occupation of transmission resources is reduced.

In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method can be implemented in other ways. The apparatus embodiments described above are merely illustrative, and for example, the flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

In addition, the functional modules in the embodiments of the present invention may be integrated together to form an independent part, or each module may exist separately, or two or more modules may be integrated to form an independent part.

The functions, if implemented in the form of software functional modules and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A video data processing method is applied to a live broadcast server, and comprises the following steps:

sequentially extracting video additional information corresponding to each frame of live broadcast image in live broadcast video data, and storing the video additional information in a cache region;

acquiring multiple frames of the video additional information from the buffer area and compressing the video additional information to obtain an additional information data packet;

and respectively sending the additional information data packet and the live video data to corresponding clients.

2. The method of claim 1, wherein the step of retrieving the frames of the video additional information from the buffer for compression comprises:

acquiring transmission time delay of the live broadcast image transmitted to the client;

acquiring the video additional information from the cache region for compression according to the transmission delay and by combining with the acquisition timestamp of the video additional information;

3. The video data processing method according to claim 2, wherein the step of obtaining the video additional information from the buffer for compression according to the transmission delay and with reference to the collection timestamp of the video additional information comprises:

calculating the maximum time difference between the video additional information stored in the cache region according to the acquisition time stamp of the video additional information stored in the cache region;

and under the condition that the maximum time difference is matched with the transmission time delay, the video additional information stored in the cache region is taken out and compressed.

4. The video data processing method according to claim 2, wherein the step of obtaining the video additional information from the buffer for compression according to the transmission delay and with reference to the collection timestamp of the video additional information comprises:

and under the condition that the sum of the storage duration of the video additional information written first in the cache region and the transmission time consumption corresponding to the additional information data packet is equal to the transmission delay, taking out all the video additional information in the cache region and compressing.

5. The video data processing method according to claim 1, wherein the step of sending the additional information data packet and the live video data to the corresponding client, respectively, comprises:

generating a video data packet according to a live image of the live video data;

sending the video data packet to the client through a video transmission channel;

and transmitting the additional information data packet to the client through a data packet transmission channel.

6. A video data processing method is applied to a client, and comprises the following steps:

receiving a video data packet and an additional information data packet corresponding to live video data; the additional information data packet is a data packet obtained by compressing multi-frame video additional information extracted from the live video data; the video data packet is generated based on a live image of the live video data;

and after the video data packet and the additional information data packet are respectively decoded, acquiring the live broadcast image and the video additional information with corresponding identifications so as to synchronously render.

7. The video data processing method according to claim 6, wherein the video data processing method further comprises:

periodically calculating transmission time delay according to the acquired acquisition time stamp of the live broadcast image and the universal time for decoding the live broadcast image;

and feeding back the transmission delay to a live broadcast server to guide the live broadcast server to create the additional information data packet.

8. A video data processing apparatus, applied to a live server, the video data processing apparatus comprising:

the extraction module is used for sequentially extracting video additional information corresponding to each frame of live broadcast image in the live broadcast video data and storing the video additional information into a cache region;

the compression module is used for acquiring multiple frames of the video additional information from the cache region and compressing the video additional information to obtain an additional information data packet;

and the sending module is used for sending the additional information data packet and the live video data to corresponding clients respectively.

9. A video data processing apparatus applied to a client, the video data processing apparatus comprising:

10. An electronic device comprising a processor and a memory, the memory storing machine executable instructions executable by the processor to perform the method of any one of claims 1 to 7.

11. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the method according to any one of claims 1-7.