WO2015117572A1

WO2015117572A1 - Labelling method for moving objects of concentrated video, and playing method and device

Info

Publication number: WO2015117572A1
Application number: PCT/CN2015/072793
Authority: WO
Inventors: 李辉
Original assignee: 中兴通讯股份有限公司
Priority date: 2014-07-28
Filing date: 2015-02-11
Publication date: 2015-08-13
Also published as: CN105323501A

Abstract

A labelling method for moving objects of a concentrated video, and a playing method and device. The method in the embodiments of the present invention comprises: acquiring labelling information about all moving objects in a concentrated video frame and relative playing time of the concentrated video frame; respectively encapsulating the concentrated video frame and the labelling information about all the moving objects in the concentrated video frame into a media data packet and a labelling information packet; and according to the relative playing time of the concentrated video frame, establishing a correlation between the media data packet and the labelling information packet of the concentrated video frame. By respectively encapsulating and writing labelling information about moving objects of a concentrated video frame and data of the concentrated video frame into a concentrated video file, the step of synthesizing the two is omitted, the processing efficiency is improved, and the labelling information can be conveniently saved and transmitted together with the video information, such that the moving objects are directly labelled in the process of playing the concentrated video, thereby improving the labelling efficiency and guaranteeing the playing efficiency.

Description

Moving target marking method, playing method and device for concentrated video

Technical field

Embodiments of the present invention relate to the field of video processing technologies, and in particular, to a moving target labeling method, a playing method, and a device for a concentrated video.

Background technique

Video enrichment not only concentrates on the essence of the event, but also the activity event. The video with no value will be eliminated. By analyzing the merge technology, it is possible to see all the activity targets in a short time, and to carry out the moving target. Arrows, circles, etc.

At present, there are two methods for labeling moving targets in concentrated video:

One is to synthesize the video frame data and the moving target annotation of the video frame during the video concentration analysis process. The disadvantage of this processing method is that the processing performance is expensive, and the annotation information cannot be separated from the concentrated video frame.

The other is that in the process of video concentration analysis, the annotation information of the moving target of the video frame is written in the description file; when the player plays the concentrated video, the moving target of the video frame is marked according to the information of the description file. The disadvantage of this processing method is that the description file is relatively large, which is not conducive to saving and transmitting. Moreover, during playback, the moving target of each frame of video is marked, and information search is needed in the description file, and then the player finds according to the search. Information is annotated and affects playback performance.

Summary of the invention

In order to solve the above technical problem, an embodiment of the present invention provides a moving target labeling method, a playing method, and a device for concentrating video, which can separate the concentrated video frame data and the labeling information, and the labeling information can be conveniently saved and transmitted with the video information. Thereby improving the labeling efficiency and ensuring the playback performance.

In order to achieve the above technical purpose, the present invention provides a moving target labeling method for a concentrated video, comprising: acquiring annotation information of all moving targets in a concentrated video frame and relative playing time of the concentrated video frame;

Encapsulating the enriched video frame and the labeling information of all moving targets of the concentrated video frame respectively Media packets and annotated packets;

Establishing an association between the media data packet of the condensed video frame and the tagged information packet according to the relative play time of the condensed video frame.

Optionally, it also includes:

The media data package, the labeling information packet, and the association between the media data packet and the labeling information package are respectively saved in the concentrated video file to obtain a target concentrated video file.

Optionally, the acquiring the annotation information of all the moving targets in a concentrated video frame and the relative playing time of the concentrated video frames include:

Concentrating the original video file to obtain a concentrated video frame;

Performing a moving target analysis on the concentrated video frame to extract a moving target; wherein the moving target includes: a moving sub-target overlapping with other moving sub-objects in the concentrated video frame, and a plurality of moving sub-objects having overlapping relationships ;

Obtaining the annotation information of each moving target separately, and determining the labeling information of all the moving targets.

Optionally, the annotation information of the moving target includes: coordinates of the moving target, a height of the moving target, and a width of the moving target.

Optionally, the encapsulating the condensed video frame into a media data packet comprises: packaging the condensed video frame into a media data packet in a first preset format.

Optionally, the encapsulating the annotation information of all moving targets of the concentrated video frame into the annotation information package includes:

And labeling information of all moving targets of the concentrated video frame into an annotation information packet of a second preset format.

The invention also provides a method for playing a concentrated video, comprising:

Parsing the target concentrated video file to obtain a concentrated video frame and a relative playing time of the concentrated video frame;

Obtaining, according to the relative play time, an annotation information packet associated with all moving targets of the concentrated video frame;

Parsing the annotation information packet to determine annotation information of a moving target of the concentrated video frame;

And concentrating the concentrated video frame and the concentrated video frame according to the relative play time The labeling information of the label is superimposed and displayed to complete the playback.

The annotation information of the moving target includes: coordinates of the moving target, height of the moving target, and width of the moving target.

The invention further provides a moving target marking device for concentrating video, comprising:

An obtaining module, configured to obtain annotation information of all moving targets in a concentrated video frame and a relative playing time of the concentrated video frame;

The encapsulation module is configured to encapsulate the enrichment video frame and the annotation information of all the moving targets of the enriched video frame into a media data packet and an annotation information packet, respectively;

The association module is configured to establish an association between the media data packet of the concentrated video frame and the labeling information packet according to the relative playing time of the concentrated video frame.

Optionally, it also includes:

The saving module is configured to save the media data package, the labeling information packet, and the association between the media data packet and the labeling information package to the concentrated video file to obtain the target concentrated video file.

Optionally, the obtaining module includes:

The concentrating module is configured to perform concentration processing on the original video file to obtain a concentrated video frame;

An extraction module, configured to perform a moving target analysis on the concentrated video frame, and extract a moving target; wherein the moving target includes: a moving sub-target that does not overlap with other moving sub-objects in the concentrated video frame, and an overlapping relationship Multiple sports sub-goals;

The determining module is configured to acquire the labeling information of each moving target separately, and determine the labeling information of all the moving targets.

Optionally, the encapsulating module at least includes:

The first encapsulation submodule is configured to encapsulate the enriched video frame into a media packet of a first preset format.

Optionally, the encapsulating module further includes:

The second encapsulation submodule is configured to encapsulate the annotation information of all the moving targets of the concentrated video frame into the annotation information packet of the second preset format.

The present invention further provides a playback device for a concentrated video, comprising:

a first parsing module configured to parse the target concentrated video file to obtain a concentrated video frame and Relative play time of the concentrated video frame;

a first acquiring module, configured to acquire, according to the relative playing time, an annotation information packet associated with all moving targets of the concentrated video frame;

a second parsing module, configured to parse the label information packet, and determine labeling information of a moving target of the concentrated video frame;

The playing module is configured to superimpose and display the enriched video frame and the labeling information of the moving target of the concentrated video frame according to the relative playing time to complete the playing.

In the moving target labeling method of the condensed video according to the embodiment of the present invention, the labeling information of the moving target of the condensed video frame and the condensed video frame data are separately encapsulated and written into the condensed video file, thereby reducing the step of synthesizing the two. The processing efficiency is improved, and the separation of the video frame data and the annotation information is realized. Meanwhile, in the embodiment of the present invention, the labeling information of the video frame and the moving target of the video is associated by the relative playing time of the video frame, so that the labeling information is convenient. The storage and transmission of the video information directly realizes the labeling of the moving target in the process of concentrating the video playing, thereby improving the labeling efficiency and ensuring the playing efficiency.

BRIEF abstract

The drawings described herein are intended to provide a further understanding of the invention, and are intended to be a part of the invention. In the drawing:

1 is a flowchart of a method for marking a moving target of a concentrated video according to an embodiment of the present invention;

2 is a flowchart of determining an association relationship of a moving target labeling method for a concentrated video according to an embodiment of the present invention;

3 is a flowchart of a method for playing a concentrated video in an embodiment of the present invention;

4 is a schematic diagram of a specific labeling and playing process of an MP4 file as an example in the embodiment of the present invention;

FIG. 5 is a schematic structural diagram of a moving target marking device for concentrating video according to an embodiment of the present invention; FIG.

FIG. 6 is a schematic structural diagram of a device for playing back a concentrated video according to an embodiment of the present invention.

Preferred embodiment of the invention

The embodiments of the present invention will be described in detail below with reference to the accompanying drawings. It should be noted that, in the case of no conflict, the features in the embodiments and the embodiments in the present application may be arbitrarily combined with each other.

The present invention is directed to the prior art that the concentrated video technology has high processing performance consumption, and the annotation information cannot be separated from the concentrated video frame or the moving object of the video frame is processed by the description file, but the description file is large, which is not conducive to preservation and transmission. The invention provides a moving target labeling method, a playing method and a device for concentrating video. In the embodiment of the present invention, the labeling information of the moving target of the concentrated video frame and the concentrated video frame data are separately encapsulated and written into the concentrated video file, thereby reducing The step of synthesizing the two improves the processing efficiency, and realizes the separation of the video frame data and the annotation information. Meanwhile, the embodiment of the present invention associates the video frame with the annotation information of the moving target of the video by the relative playing time of the video frame. The annotation information is conveniently saved and transmitted with the video information, and the moving target is directly marked in the concentrated video playback process, thereby improving the labeling efficiency and ensuring the playback efficiency.

As shown in FIG. 1 , an embodiment of the present invention provides a method for marking a moving target of a concentrated video, including:

Step 100: Obtain annotation information of all moving targets in a concentrated video frame and relative playing time of the concentrated video frame.

Step 101: Encapsulate the obtained concentrated video frame and the labeling information of all moving targets of the concentrated video frame into a media data packet and an annotation information packet, respectively;

Step 102: Establish an association between the media data packet of the concentrated video frame and the labeled information packet according to the relative playing time of the obtained concentrated video frame.

In the above embodiment of the present invention, the moving target in step 100 refers to a moving object in a video frame, such as a moving person, a vehicle, etc.; an arrow, a circle, or the like for the moving target can help the person watching the video in a short time. Read all the activity goals in time.

It should be noted that the method for labeling the moving target of the condensed video in the embodiment of the present invention is performed in the process of concentrating the original video file, that is, the original video file is condensed to concentrate one frame of the video frame, and then the following steps are performed. 101, step 102, until the concentration of the original video file is completed, the extraction of the annotation information of the moving target is also completed, and the annotation is directly called when the concentrated video is played. The information directly marks the moving target, and improves the labeling efficiency; at the same time, the concentrated video frame obtained in step 101 and the labeling information of the moving target of the concentrated video are separately packaged, respectively, and the concentrated video file is written to make the video frame data and The labeling information achieves separation, saving two combined processing steps and improving processing efficiency.

Further, since the labeling information of the moving video frame and the moving target of the concentrated video frame are respectively encapsulated, it is necessary to establish an association between the concentrated video frame and the labeling information of the moving target of the concentrated video frame according to the relative playing time, which is convenient. When the video is played, the corresponding concentrated video frame and the annotation information of the moving target of the concentrated video are found through the association relationship, and are combined and played to form a complete video.

In the above embodiment of the present invention, the moving target labeling method of the concentrated video further includes:

The established media data package, the labeled information package, and the association between the media data package and the labeled information package are respectively saved into the concentrated video file to obtain the target concentrated video file.

In a specific embodiment of the present invention, after determining a media data packet, an annotation information packet, and an association between the media data packet and the annotation information packet of a concentrated video frame, the media data packet and the annotation information packet of the next concentrated video frame are continuously obtained. And the association between the media packet and the annotated packet until all of the condensed video frames have been processed, and the media packets, annotated packets, and the association between the media packets and the annotated packets for each frame Save separately to get the target concentrated video file. The condensed video file of the purpose is a file obtained by labeling a moving target by using the moving target labeling method provided by the embodiment of the present invention.

It should be noted that the association relationship is written into the metadata description part of the condensed video file, and the metadata description part is generally used to store the association between the various parts in the video file; for example, the normal video file generally includes at least an audio track and a video. In order to ensure that audio and video are played synchronously during playback, it is necessary to store the relationship between audio and video in the metadata description section, that is, which corresponding audio file should be played simultaneously when playing the video.

In the embodiment of the present invention, as shown in FIG. 2, step 100 specifically includes:

Step 200: Perform concentration processing on the original video file to obtain a concentrated video frame.

Step 201: Perform moving object analysis on the obtained concentrated video frame to extract a moving target, where the moving target includes: a moving sub-target that does not overlap with other moving sub-objects in the concentrated video frame, and a plurality of sports sub-objects with overlapping relationship aims;

Step 202: Acquire label information of each moving target separately, and determine labeling information of all moving targets.

In the specific embodiment of the present invention, the concentrating process in step 200 is specifically: the specific process of concentrating the preset number of frame images may be performed by a concentrating algorithm, for example, if the preset number is 5 frames, the input will be input. The image of the original video file of the 5 frames is processed by a condensing algorithm, and a video image of the frame is output, and the video image of the frame is a condensed video frame image of the embodiment of the present invention, that is, the condensed video frame is obtained; It is the essence of the above 5 frames of the original video file image, and the condensed video frame is obtained by merging the valuable video and combining the valuable video.

The preset number can be set according to different needs of the user. For example, if the user needs to condense one video into 10M and the user needs to condense the same video into 5M, a different preset number can be selected; of course, the smaller the concentrated video is. , the larger the value of the preset number.

In the specific embodiment of the present invention, for the determination of the moving target, each moving object appearing in the concentrated video frame may be regarded as a moving sub-target, and then the moving sub-target that does not overlap with other moving sub-objects is determined as the moving target. And a plurality of sports sub-goals having overlapping relationship with other sports sub-goals are collectively determined as one moving target. For example, in a concentrated video frame, there are two moving sub-targets, a moving sub-target A and a moving sub-target B. If the moving sub-target A and the moving sub-target B do not overlap in the concentrated video frame, the moving sub-target A is determined. For one moving target, and determining the moving sub-target B as another moving target; if the moving sub-target A and the moving sub-target B overlap in the concentrated video frame, then the moving sub-target A and the moving sub-target B are seen together Become a moving target.

Further, in step 202, the same or different annotation information is determined for each moving target, and saved for subsequent calls.

Specifically, in the foregoing embodiment of the present invention, the labeling information of the moving target includes: a coordinate of the moving target, a height of the moving target, and a width of the moving target.

In the specific embodiment of the present invention, in order to accurately mark a moving target, at least the coordinates of the moving target (X-axis, Y-axis, and the Z-axis of the stereo), the height of the moving target, and the width of the moving target need to be acquired.

In the above embodiment of the present invention, step 101 further includes: encapsulating the concentrated video frame into a first Media packets in a preset format.

In the specific embodiment of the present invention, the media data packet includes the concentrated video frame, and the encapsulation process is mainly to convert the frame data into a frame format conforming to the playback format of the concentrated video file, such as mp4 format, rmvb format, mtv format, wmv format, etc. Etc., determine the format of its media packet based on the format of the condensed video file.

Further, in the foregoing embodiment of the present invention, the step 102 further includes: encapsulating the annotation information of all the moving targets of the concentrated video frame into the annotation information packets of the second preset format. among them,

The labeling information packet includes the labeling information of all the moving objects in the concentrated video frame, and the packaging process mainly packs the labeling information into a file conforming to the labeling information format of the concentrated video file, such as the labeling information track of the playing file in the mp4 format. The format, the format of the labeling information track of the playback file in the rmvb format, and the like are not listed here.

In the above specific embodiment of the present invention, according to the relative playing time of the concentrated video frame, if the media data packet and the labeled information packet of the frame are played when the preset time point is 5 s, the media data packet and the labeled information at the relative playing time are determined. The association of the package.

In the specific embodiment of the present invention, since the moving target labeling method of the present invention is marked in the enrichment process of the original video file, when each concentrated video frame is processed, the concentrated video file at this time is the target concentrated video file. And the concentration and corresponding label information are saved, which improves the concentration efficiency.

In order to achieve the above purpose, as shown in FIG. 3, an embodiment of the present invention further provides a method for playing a concentrated video, including:

Step 300, parsing the target concentrated video file to obtain a concentrated video frame and a relative playing time of the concentrated video frame;

Step 301: Acquire an annotation information packet associated with all moving targets of the concentrated video frame according to the relative playing time;

Step 302, parsing the label information packet, and determining labeling information of a moving target of the concentrated video frame;

Step 303: Perform superimposed display processing on the labeled information of the concentrated video frame and the moving target of the concentrated video frame according to the relative playing time, and complete the playing.

In the above embodiment of the present invention, the playing process of the concentrated video is the concentrated video provided by the present invention. The method of labeling the moving object corresponds to; the essence is opposite to the process of the labeling method of the moving object of the concentrated video provided by the present invention. Specifically, the annotation information of the moving target includes: coordinates of the moving target, a height of the moving target, and a width of the moving target.

Referring to FIG. 4, it is assumed that the original video file is an MP4 file, and the moving target labeling method and the corresponding concentrated video playing process provided by the present invention are specifically described:

Steps for generating the moving target annotation information in the MP4 concentrated video:

Step 401: Perform MP4 analysis processing on the MP4 concentrated video file, parse out one frame of concentrated video frame data, and perform a concentration algorithm processing;

Step 402: Concentration algorithm processing, if there is no output data, returning to step 401, continuing to parse the file; if there is data output, outputting the data;

Step 403: The video frame data after the concentration processing;

Step 404: labeling information such as an X-axis, a Y-axis, a Heigh height, and a Width width of the moving target of the video frame after the concentration processing;

Step 405: Encapsulate step 403 and step 404 into a video track and an information track in the MP4 file respectively, and package according to the relative play time, so as to correspondingly associate in the playing;

Step 406: Write the encapsulated video track and the information track to the MP4 concentrated video file respectively, and simultaneously write the information related to the video track and the information track with respect to the play time to the MP4 file metadata description part. Returning to step 401 processing until the original MP4 file is parsed.

The playing steps of the moving target annotation information in the MP4 concentrated video include:

Step 407: Parsing the MP4 concentrated video file, parsing the video frame data and the relative playing time of the frame, and finding the corresponding flag information according to the relative playing time;

Step 408: Obtain video frame data and relative play time.

Step 409: Obtain information information of the X-axis, Y-axis, Heigh, and Width of the information track and relative play time;

Step 410: Associate the video frame data with the labeling information of the moving target of the frame according to the relative playing time;

Step 411: When the player plays, the video frame data and the moving target label of the frame are superimposed; the looping to step 407 is performed until the MP4 concentrated video file is parsed, and the playing is completed.

In order to achieve the above purpose, as shown in FIG. 5, the embodiment of the present invention further provides a concentration. Video moving target marking device, including:

The obtaining module 10 is configured to obtain annotation information of all moving targets in a concentrated video frame and a relative playing time of the concentrated video frame;

The encapsulation module 20 is configured to encapsulate the enrichment video frame and the annotation information of all moving targets of the enriched video frame into a media data packet and an annotation information packet, respectively;

The association module 30 is configured to establish an association between the media data packet of the concentrated video frame and the annotation information packet according to the relative play time of the concentrated video frame.

In the above embodiment of the present invention, the apparatus further includes a saving module (not shown in FIG. 5) configured to associate the media data packet, the labeling information packet, and the media data packet with the labeling information packet, Save them separately to the concentrated video file to get the target concentrated video file.

In the foregoing embodiment of the present invention, the acquiring module 10 may specifically include:

In the foregoing embodiment of the present invention, the encapsulating module 20 may specifically include:

In the above embodiment of the present invention, the package module 20 further includes:

In the moving target labeling technical solution of the condensed video according to the embodiment of the present invention, the labeling information of the moving target of the condensed video frame and the condensed video frame data are separately encapsulated and written into the condensed video file, thereby reducing the steps of synthesizing the two. The processing efficiency is improved, and the separation of the video frame data and the annotation information is realized. Meanwhile, in the embodiment of the present invention, the labeling information of the video frame and the moving target of the video is associated by the relative playing time of the video frame, so that the labeling information is convenient. And video information storage and transmission, in the process of concentrated video playback directly achieve the labeling of moving targets, thereby improving The labeling efficiency ensures the playback efficiency.

In order to achieve the above objective, as shown in FIG. 6, the embodiment of the present invention further provides a playback device for a concentrated video, including:

The first parsing module 60 is configured to parse the target concentrated video file to obtain a concentrated video frame and a relative playing time of the concentrated video frame on a preset time axis;

The first obtaining module 70 is configured to acquire, according to the relative playing time, an annotated information packet associated with all moving targets of the concentrated video frame;

a second parsing module 80, configured to parse the label information packet, and determine label information of a moving target of the concentrated video frame;

The playing module 90 is configured to perform superimposed display processing on the labeling information of the moving video frame and the moving target of the concentrated video frame according to the relative playing time to complete the playing.

In the above embodiment of the present invention, the annotation information of the moving target includes: coordinates of the moving target, height of the moving target, and width of the moving target.

The above is a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should also be considered as the scope of protection of the present invention.

Industrial applicability

In the embodiment of the present invention, the annotation information of all the moving objects in a concentrated video frame and the relative playing time of the concentrated video frame are obtained in the embodiment of the present invention; Encapsulating information of the moving video frame and all moving targets of the concentrated video frame into a media data packet and an annotation information packet respectively; establishing a media data packet of the concentrated video frame according to a relative playing time of the concentrated video frame And the association between the tagged packets. By separately encapsulating the annotation information of the moving target of the concentrated video frame and the concentrated video frame data into the concentrated video file, the steps of synthesizing the two are reduced, the processing efficiency is improved, and the video frame data and the annotation information are realized. Separating; at the same time, the embodiment of the present invention associates the video frame with the labeling information of the moving target of the video by the relative playing time of the video frame, so that the labeling information is conveniently saved and transmitted with the video information, and is directly in the concentrated video playing process. The labeling of the moving target is realized, thereby improving the labeling efficiency and ensuring the playing efficiency.

Claims

A method for marking a moving target of a concentrated video, comprising:

Obtaining annotation information of all moving targets in a concentrated video frame and relative playing time of the concentrated video frame;

Encapsulating the enriched video frame and the annotation information of all moving targets of the concentrated video frame into a media data packet and an annotation information packet;

Establishing an association between the media data packet of the condensed video frame and the tagged information packet according to the relative play time of the condensed video frame.
The method of marking a moving object according to claim 1, further comprising:

The media data package, the labeling information packet, and the association between the media data packet and the labeling information package are respectively saved in the concentrated video file to obtain a target concentrated video file.
The moving target labeling method according to claim 1, wherein the acquiring the annotation information of all the moving objects in a concentrated video frame and the relative playing time of the concentrated video frame comprises:

Concentrating the original video file to obtain a concentrated video frame;

Performing a moving target analysis on the concentrated video frame to extract a moving target; wherein the moving target includes: a moving sub-target overlapping with other moving sub-objects in the concentrated video frame, and a plurality of moving sub-objects having overlapping relationships ;

Obtaining the annotation information of each moving target separately, and determining the labeling information of all the moving targets.
The moving target labeling method according to any one of claims 1 to 3, wherein the labeling information of the moving object comprises: a coordinate of the moving target, a height of the moving target, and a width of the moving target.
The method according to claim 1, wherein the encapsulating the condensed video frame into a media data packet comprises: packaging the condensed video frame into a media data packet in a first preset format.
The method for marking a moving object according to claim 1, wherein the encapsulating the annotation information of all moving targets of the concentrated video frame into the annotation information package comprises:

And labeling information of all moving targets of the concentrated video frame into an annotation information packet of a second preset format.
A method for playing a concentrated video, comprising:

Parsing the target concentrated video file to obtain a concentrated video frame and a relative playing time of the concentrated video frame;

Obtaining, according to the relative play time, an annotation information packet associated with all moving targets of the concentrated video frame;

Parsing the annotation information packet to determine annotation information of a moving target of the concentrated video frame;

And according to the relative playing time, the concentrated video frame and the labeling information of the moving target of the concentrated video frame are superimposed and displayed, and the playing is completed.
The playing method according to claim 7, wherein the annotation information of the moving target comprises: a coordinate of the moving target, a height of the moving target, and a width of the moving target.
A moving target marking device for concentrating video, comprising:

An obtaining module, configured to obtain annotation information of all moving targets in a concentrated video frame and a relative playing time of the concentrated video frame;

The encapsulation module is configured to encapsulate the enrichment video frame and the annotation information of all the moving targets of the enriched video frame into a media data packet and an annotation information packet, respectively;

The association module is configured to establish an association between the media data packet of the concentrated video frame and the labeling information packet according to the relative playing time of the concentrated video frame.
The moving object marking device according to claim 9, further comprising:

The saving module is configured to save the media data package, the labeling information packet, and the association between the media data packet and the labeling information package to the concentrated video file to obtain the target concentrated video file.
The moving object marking device according to claim 9, wherein the obtaining module comprises:

The concentrating module is configured to perform concentration processing on the original video file to obtain a concentrated video frame;

An extraction module, configured to perform a moving target analysis on the concentrated video frame, and extract a moving target; wherein the moving target includes: a moving sub-target that does not overlap with other moving sub-objects in the concentrated video frame, and an overlapping relationship Multiple sports sub-goals;

The determining module is configured to acquire the labeling information of each moving target separately, and determine the labeling information of all the moving targets.
The moving object marking device according to claim 9, wherein the packaging module comprises at least:

The first encapsulation submodule is configured to encapsulate the enriched video frame into a media packet of a first preset format.
The moving object marking device according to claim 12, wherein the packaging module further comprises:

The second encapsulation submodule is configured to encapsulate the annotation information of all the moving targets of the concentrated video frame into the annotation information packet of the second preset format.
A playback device for a concentrated video, comprising:

a first parsing module configured to parse the target concentrated video file to obtain a concentrated video frame and a relative playing time of the concentrated video frame;

a first acquiring module, configured to acquire, according to the relative playing time, an annotation information packet associated with all moving targets of the concentrated video frame;

a second parsing module, configured to parse the label information packet, and determine labeling information of a moving target of the concentrated video frame;

The playing module is configured to superimpose and display the enriched video frame and the labeling information of the moving target of the concentrated video frame according to the relative playing time to complete the playing.
The playback apparatus according to claim 14, wherein the annotation information of the moving object comprises: coordinates of the moving target, a height of the moving target, and a width of the moving target.