CN105491386A - Format conversion method and device of video data - Google Patents

Format conversion method and device of video data Download PDF

Info

Publication number
CN105491386A
CN105491386A CN201410482570.9A CN201410482570A CN105491386A CN 105491386 A CN105491386 A CN 105491386A CN 201410482570 A CN201410482570 A CN 201410482570A CN 105491386 A CN105491386 A CN 105491386A
Authority
CN
China
Prior art keywords
video data
frame video
intelligent information
information
added
Prior art date
Application number
CN201410482570.9A
Other languages
Chinese (zh)
Other versions
CN105491386B (en
Inventor
陈祖文
陈杰
郭斌
Original Assignee
杭州海康威视数字技术股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 杭州海康威视数字技术股份有限公司 filed Critical 杭州海康威视数字技术股份有限公司
Priority to CN201410482570.9A priority Critical patent/CN105491386B/en
Publication of CN105491386A publication Critical patent/CN105491386A/en
Application granted granted Critical
Publication of CN105491386B publication Critical patent/CN105491386B/en

Links

Abstract

The invention discloses a format conversion method and device of video data. The method comprises: respectively decoding received frames of original video data into a preset format; with respect to the decoded frames of video data, when determining that the video data frames have corresponding intelligent information, superimposing the intelligent information to the video data frames, recoding the video data frames with the intelligent information superimposed according to a preset compression format, or directly recoding the video data frames according to the preset compression format; packaging the recoded video data frames into object code streams. The solution of the invention can enrich the information content displayed to a user.

Description

A kind of format conversion method of video data and device

Technical field

The present invention relates to data processing technique, particularly a kind of format conversion method of video data and device.

Background technology

The format conversion of video data, refer to and to encapsulate original video data and the conversion etc. of coded format, the image information before and after conversion is basically identical, cannot add the information outside original image information, as intelligent information, thus limit the follow-up information content showing user.

Summary of the invention

In view of this, the invention provides a kind of format conversion method and device of video data, the information content showing user can be enriched.

In order to achieve the above object, technical scheme of the present invention is achieved in that

A format conversion method for video data, comprising:

The each frame original video data received is decoded into predetermined format respectively;

For decoded every frame video data, when determining that it exists corresponding intelligent information, the intelligent information of correspondence is added on this frame video data, and this frame video data being superimposed with intelligent information is recoded according to predetermined compressed format, otherwise, directly this frame video data is recoded according to predetermined compressed format;

Each frame video data after recoding is packaged into target code stream.

A format conversion apparatus for video data, comprising: parsing module, decoder module, information fusion module, coding module and synthesis module;

Described parsing module, for obtaining each frame original video data, and sends to described decoder module;

Described decoder module, after each frame original video data received is decoded into predetermined format respectively, sends to described information fusion module;

Described information fusion module, for for the every frame video data received, when determining that it exists corresponding intelligent information, the intelligent information of correspondence is added to after on this frame video data, send to described coding module, otherwise, directly this frame video data is sent to described coding module;

Described coding module, for by receiving after each frame video data recodes according to predetermined compressed format respectively, sends to described synthesis module;

Described synthesis module, for being packaged into target code stream by each frame video data received.

Visible, adopt scheme of the present invention, for video data, intelligent information can be superposed outside original image information, thus achieve the information fusion of video data and intelligent information, and then enrich the follow-up information content showing user; And scheme of the present invention implements simple and convenient, thus be convenient to carry out popularizing and promoting.

Accompanying drawing explanation

Fig. 1 is the flow chart of the format conversion method embodiment of video data of the present invention.

Fig. 2 is the process schematic that the present invention superposes intelligent information on video data.

Fig. 3 is the component value storage format schematic diagram of each pixel under existing YV12 form.

Fig. 4 is the composition structural representation of the format conversion apparatus embodiment of video data of the present invention.

Embodiment

For problems of the prior art, provide a kind of format conversion scheme of video data in the present invention, the information content etc. showing user can be enriched.

In order to make technical scheme of the present invention clearly, understand, to develop simultaneously embodiment referring to accompanying drawing, scheme of the present invention be described in further detail.

Fig. 1 is the flow chart of the format conversion method embodiment of video data of the present invention.As shown in Figure 1,11 ~ 13 are comprised the following steps.

Step 11: each frame original video data received is decoded into predetermined format respectively.

In actual applications, can extract original video data from container, and be decoded into predetermined format respectively, described predetermined format can be yuv format or rgb format etc.

Step 12: for decoded every frame video data, when determining that it exists corresponding intelligent information, the intelligent information of correspondence is added on this frame video data, and this frame video data being superimposed with intelligent information is recoded according to predetermined compressed format, otherwise, directly this frame video data is recoded according to predetermined compressed format.

According to existing processing mode, for decoded every frame video data, directly it can be recoded according to predetermined compressed format, and after adopting scheme of the present invention, for decoded every frame video data, when determining that it exists corresponding intelligent information, first the intelligent information of correspondence can be added on this frame video data, and then this frame video data being superimposed with intelligent information is recoded according to predetermined compressed format, otherwise, namely when determining that this frame video data does not exist corresponding intelligent information, just can directly this frame video data be recoded according to predetermined compressed format.

Which kind of form is described predetermined compressed format be specially and can be decided according to the actual requirements.

Step 13: each frame video data after recoding is packaged into target code stream.

In this step, each frame video data after processing can be packaged into target code stream and export, the follow-up display can carrying out video pictures and intelligent information on ordinary playing device according to mode shown in step 12.

The specific implementation of step 11 and step 13 is prior art, recodification of how carrying out described in step 12 is similarly prior art, all repeat no more, main to how to superpose intelligent information on video data below, the information fusion namely how realizing video data and intelligent information is described in detail.

One) information fusion

Specifically, for decoded every frame video data, can determine whether respectively to receive intelligent information corresponding to this frame video data; If so, then the intelligent information received is added on this frame video data; If not, then carry out intelligent information retrieval for this frame video data, and when extracting result and not being empty, the intelligent information extracted is added on this frame video data.

That is, when needs are to a certain frame video data overlay intelligent information, if having received intelligent information corresponding to this frame video data, then can directly the intelligent information received be added on this frame video data; Otherwise, according to existing mode, intelligent information retrieval can be carried out for this frame video data, and when extracting result and not being empty, the intelligent information extracted is added on this frame video data; If both do not receive intelligent information, do not extract intelligent information, so then without the need to superposing intelligent information on this frame video data yet.

In actual applications, user prespecifiedly may need to show which intelligent information (needed for oneself), instead of show all intelligent information that are that receive or that extract, like this, for every frame video data, when determining to receive intelligent information corresponding to this frame video data, also need the intelligent information whether comprised in the intelligent information determining further to receive needed for user, if so, then by receive, intelligent information needed for user is added on this frame video data; If not, then can adopt the processing mode that the intelligent information corresponding with not receiving this frame video data is the same, namely intelligent information retrieval is carried out for this frame video data, and when extracting result and not being empty, the intelligent information extracted is added on this frame video data, specifically, the intelligent information retrieval that can carry out needed for user for this frame video data, and when extracting result and be sky, by extract, intelligent information needed for user is added on this frame video data.

Described intelligent information generally includes but is not limited to: target following frame, movement locus, warning line, temperature figure, character etc.

In addition, in scheme of the present invention, for every frame video data, mode intelligent information be added on this frame video data can be: by abstract for intelligent information for image pixel information rule; Image pixel information rule according to taking out is modified to this frame video data.

Image pixel information rule can comprise: the pixel position of needs amendment and amended value etc.

Need the pixel of amendment may be independently point, or be combined into the form of line or frame, the value of pixel is modified, the color of pixel and/or transparency etc. can be shown as and change.

Based on above-mentioned introduction, Fig. 2 is the process schematic that the present invention superposes intelligent information on video data.As shown in Figure 2,21 ~ 28 are comprised the following steps.

Step 21: obtain decoded each frame video data respectively.

Step 22: for the every frame video data got, determines whether to have received intelligent information corresponding to this frame video data respectively, if so, then performs step 23, otherwise, perform step 25.

Step 23: determine the intelligent information whether comprised in the intelligent information received needed for user, if so, then performs step 24, otherwise, perform step 25.

Step 24: using receive, intelligent information needed for user as the intelligent information of required superposition, perform step 27 afterwards.

Step 25: carry out the intelligent information retrieval needed for user for this frame video data, performs step 26 afterwards.

Step 26: when extracting result and be empty, using extract, intelligent information needed for user as the intelligent information of required superposition, execution step 27 afterwards.

Step 27: by abstract for the intelligent information of required superposition be image pixel information rule.

Step 28: the image pixel information rule according to taking out is modified to this frame video data, obtains this frame video data being superimposed with intelligent information.

In above-mentioned steps 27, how by abstract for intelligent information be image pixel information rule be prior art.Such as, detected a target following frame in picture, the positional information of each pixel so on this target following frame and the value etc. of set amended each pixel are image pixel information rule.

In addition, the image pixel information rule that in step 28, how basis takes out is modified to video data and is not restricted, and can be decided according to the actual requirements.Such as, the mode that individual element point can be adopted to modify, also can adopt the mode that other efficiency is higher.

For the YV12 form of plane (Planar) the most frequently used in coding and decoding video, this yuv format is the 4:2:0 sampled data that plane stores, each pixel correspond to a Y-component value, 4 adjacent pixels share a UV component value, and Fig. 3 is the component value storage format schematic diagram of each pixel under existing YV12 form.As shown in Figure 3, when the horizontal horizontal line of needs amendment, bulk can revise the YUV component value of each pixel on line successively, amended component value is determined by the image pixel information rule that intelligent information is abstract; For vertical horizontal line, the YUV component value due to each pixel on line is not Coutinuous store, can revise each pixel on line to scheme mode that image width is increment stepping; For oblique line, for ensureing that pixel is intensive, when the slope value of oblique line is between [-1,1], can along each pixel of horizontal direction (x-axis) traversal amendment, other slope value is then along each pixel of vertical direction (y-axis) traversal amendment.Frame is formed by connecting by line, and technological essence does not have difference, repeats no more.

The present invention discloses a kind of format conversion apparatus of video data.

Fig. 4 is the composition structural representation of the format conversion apparatus embodiment of video data of the present invention.As shown in Figure 4, comprising: parsing module, decoder module, information fusion module, coding module and synthesis module.

Parsing module, for obtaining each frame original video data, and sends to decoder module;

Decoder module, after each frame original video data received is decoded into predetermined format respectively, sends to information fusion module;

Information fusion module, for for the every frame video data received, when determining that it exists corresponding intelligent information, the intelligent information of correspondence is added to after on this frame video data, send to coding module, otherwise, directly this frame video data is sent to coding module;

Coding module, for by receiving after each frame video data recodes according to predetermined compressed format respectively, sends to synthesis module;

Synthesis module, for being packaged into target code stream by each frame video data received.

Particularly,

Information fusion module can, for the every frame video data received, determine whether to have received intelligent information corresponding to this frame video data respectively; If so, then the intelligent information received is added on this frame video data; If not, then carry out intelligent information retrieval for this frame video data, and when extracting result and not being empty, the intelligent information extracted is added on this frame video data.

The intelligent information that information fusion module receives comes from parsing module usually, after namely parsing module receives intelligent information, can directly send it to information fusion module, without the need to the process through decoder module.

In addition,

Information fusion module also can be further used for, for the every frame video data received, determine whether respectively to have received intelligent information corresponding to this frame video data, if, then determine the intelligent information whether comprised in the intelligent information received needed for user further, if so, then by receive, intelligent information needed for user is added on this frame video data; If do not receive the intelligent information that this frame video data is corresponding, or the intelligent information do not comprised in the intelligent information received needed for user, then carry out the intelligent information retrieval needed for user for this frame video data, and when extracting result and be sky, by extract, intelligent information needed for user is added on this frame video data.

Further,

Information fusion module can for needing the every frame video data superposing intelligent information, respectively by abstract for the intelligent information of required superposition be image pixel information rule, and according to image pixel information rule, this frame video data is modified, obtain this frame video data being superimposed with intelligent information.

Wherein,

Image pixel information rule can comprise: the pixel position of needs amendment and amended value.

The specific works flow process of Fig. 4 shown device embodiment please refer to the respective description in preceding method embodiment, repeats no more herein.

In a word, adopt scheme of the present invention, for video data, intelligent information can be superposed outside original image information, thus achieve the information fusion of video data and intelligent information, and then enrich the follow-up information content showing user; And scheme of the present invention implements simple and convenient, thus be convenient to carry out popularizing and promoting.

In sum, these are only preferred embodiment of the present invention, be not intended to limit protection scope of the present invention.Within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (10)

1. a format conversion method for video data, is characterized in that, comprising:
The each frame original video data received is decoded into predetermined format respectively;
For decoded every frame video data, when determining that it exists corresponding intelligent information, the intelligent information of correspondence is added on this frame video data, and this frame video data being superimposed with intelligent information is recoded according to predetermined compressed format, otherwise, directly this frame video data is recoded according to predetermined compressed format;
Each frame video data after recoding is packaged into target code stream.
2. method according to claim 1, is characterized in that,
Described for decoded every frame video data, when determining that it exists corresponding intelligent information, this frame video data that the intelligent information of correspondence is added to comprises:
For decoded every frame video data, determine whether respectively to have received intelligent information corresponding to this frame video data;
If so, then the intelligent information received is added on this frame video data;
If not, then carry out intelligent information retrieval for this frame video data, and when extracting result and not being empty, the intelligent information extracted is added on this frame video data.
3. method according to claim 2, is characterized in that,
Described this frame video data that the intelligent information received is added to comprises: determine the intelligent information whether comprised in the intelligent information received needed for user, if so, then by receive, intelligent information needed for user is added on this frame video data;
Describedly carry out intelligent information retrieval for this frame video data, and when extracting result and not being empty, the intelligent information extracted is added on this frame video data and comprises: carry out the intelligent information retrieval needed for user for this frame video data, and when extracting result and be sky, by extract, intelligent information needed for user is added on this frame video data;
The method comprises further: when not comprising the intelligent information needed for user in the intelligent information received, the intelligent information retrieval needed for user is carried out for this frame video data, and when extracting result and be sky, by extract, intelligent information needed for user is added on this frame video data.
4. the method according to claim 1,2 or 3, is characterized in that,
Described this frame video data that intelligent information is added to comprises:
By abstract for intelligent information be image pixel information rule;
According to described image pixel information rule, this frame video data is modified.
5. method according to claim 4, is characterized in that,
Described image pixel information rule comprises: the pixel position of needs amendment and amended value.
6. a format conversion apparatus for video data, is characterized in that, comprising: parsing module, decoder module, information fusion module, coding module and synthesis module;
Described parsing module, for obtaining each frame original video data, and sends to described decoder module;
Described decoder module, after each frame original video data received is decoded into predetermined format respectively, sends to described information fusion module;
Described information fusion module, for for the every frame video data received, when determining that it exists corresponding intelligent information, the intelligent information of correspondence is added to after on this frame video data, send to described coding module, otherwise, directly this frame video data is sent to described coding module;
Described coding module, for by receiving after each frame video data recodes according to predetermined compressed format respectively, sends to described synthesis module;
Described synthesis module, for being packaged into target code stream by each frame video data received.
7. device according to claim 6, is characterized in that,
Described information fusion module, for the every frame video data received, determines whether to have received intelligent information corresponding to this frame video data respectively; If so, then the intelligent information received is added on this frame video data; If not, then carry out intelligent information retrieval for this frame video data, and when extracting result and not being empty, the intelligent information extracted is added on this frame video data.
8. device according to claim 7, is characterized in that,
Described information fusion module is further used for, for the every frame video data received, determine whether respectively to have received intelligent information corresponding to this frame video data, if, then determine the intelligent information whether comprised in the intelligent information received needed for user further, if so, then by receive, intelligent information needed for user is added on this frame video data; If do not receive the intelligent information that this frame video data is corresponding, or the intelligent information do not comprised in the intelligent information received needed for user, then carry out the intelligent information retrieval needed for user for this frame video data, and when extracting result and be sky, by extract, intelligent information needed for user is added on this frame video data.
9. the device according to claim 6,7 or 8, is characterized in that,
Described information fusion module is for needing the every frame video data superposing intelligent information, respectively by abstract for the intelligent information of required superposition be image pixel information rule, and according to described image pixel information rule, this frame video data is modified, obtain this frame video data being superimposed with intelligent information.
10. device according to claim 9, is characterized in that,
Described image pixel information rule comprises: the pixel position of needs amendment and amended value.
CN201410482570.9A 2014-09-19 2014-09-19 A kind of format conversion method and device of video data CN105491386B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410482570.9A CN105491386B (en) 2014-09-19 2014-09-19 A kind of format conversion method and device of video data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410482570.9A CN105491386B (en) 2014-09-19 2014-09-19 A kind of format conversion method and device of video data

Publications (2)

Publication Number Publication Date
CN105491386A true CN105491386A (en) 2016-04-13
CN105491386B CN105491386B (en) 2019-05-28

Family

ID=55678053

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410482570.9A CN105491386B (en) 2014-09-19 2014-09-19 A kind of format conversion method and device of video data

Country Status (1)

Country Link
CN (1) CN105491386B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090310865A1 (en) * 2008-06-13 2009-12-17 Jenn Hwan Tarng Video Surveillance System, Annotation And De-Annotation Modules Thereof
CN102694985A (en) * 2011-03-22 2012-09-26 杭州普维光电技术有限公司 Information superposition method, information extraction method, apparatus and system of video images
CN103402100A (en) * 2013-08-23 2013-11-20 北京奇艺世纪科技有限公司 Video processing method and mobile terminal
CN103888840A (en) * 2014-03-27 2014-06-25 电子科技大学 Method and device for dragging and zooming video mobile terminal in real time

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090310865A1 (en) * 2008-06-13 2009-12-17 Jenn Hwan Tarng Video Surveillance System, Annotation And De-Annotation Modules Thereof
CN102694985A (en) * 2011-03-22 2012-09-26 杭州普维光电技术有限公司 Information superposition method, information extraction method, apparatus and system of video images
CN103402100A (en) * 2013-08-23 2013-11-20 北京奇艺世纪科技有限公司 Video processing method and mobile terminal
CN103888840A (en) * 2014-03-27 2014-06-25 电子科技大学 Method and device for dragging and zooming video mobile terminal in real time

Also Published As

Publication number Publication date
CN105491386B (en) 2019-05-28

Similar Documents

Publication Publication Date Title
US10275676B2 (en) Systems and methods for encoding image files containing depth maps stored as metadata
TWI449431B (en) Method,apparatus and computer program products for using parallelly decodable slices for multi-view video coding
KR101359381B1 (en) Data search, parser, and synchronization of video and telemetry data
US10200667B2 (en) Creating three dimensional graphics data
KR101288932B1 (en) Format for encoded stereoscopic image data file
EP2338278A2 (en) Systems and methods for video/multimedia rendering, composition, and user-interactivity
CN105263031A (en) System and method for distributing auxiliary data embedded in video data
US10339701B2 (en) Method, system and apparatus for generation and playback of virtual reality multimedia
WO2001084846A3 (en) Method and apparatus for transcoding an object-based coded picture signal into a block-based coded picture signal
NO20065381L (en) The process feed for encoding moving image data, the process feed for decoding terminal for performing this and two-way interactive system.
RU2014147445A (en) Data coding and decoding
WO2008054100A1 (en) Method and apparatus for decoding metadata used for playing stereoscopic contents
US9723317B2 (en) Method of generating media file and storage medium storing media file generation program
JP5372687B2 (en) Transmitting apparatus, transmitting method, receiving apparatus, and receiving method
RU2009141712A (en) Mosaic location of the displayed elements in the coding and decoding of the video
CN103098462A (en) Encoding method, display device, and decoding method
TW201642656A (en) Specifying visual dynamic range coding operations and parameters
US20090066785A1 (en) System and method for generating and reproducing 3d stereoscopic image file including 2d image
US9179124B2 (en) Method and apparatus for generating stereoscopic image data stream by using camera parameter, and method and apparatus for restoring stereoscopic image by using camera parameter
RU2011135541A (en) Video encoding packaging
KR101366091B1 (en) Method and apparatus for encoding and decoding image
AU2010231805B2 (en) Image signal decoding device, image signal decoding method, image signal encoding device, image signal encoding method, and program
KR101924662B1 (en) Display control method, recording medium, and display control device
WO2008081810A1 (en) Video encoding method, decoding method, device thereof, program thereof, and storage medium containing the program
WO2008088752A3 (en) System and method for encoding scrolling raster images

Legal Events

Date Code Title Description
PB01 Publication
C06 Publication
SE01 Entry into force of request for substantive examination
C10 Entry into substantive examination
GR01 Patent grant
GR01 Patent grant