CN102404577A

CN102404577A - Memory method for 3D (three-dimensional) video code

Info

Publication number: CN102404577A
Application number: CN2011103923999A
Authority: CN
Inventors: 林晓森; 李艳华
Original assignee: WUXI TAIHANG ELECTRONIC TECHNOLOGY Co Ltd
Current assignee: WUXI TAIHANG ELECTRONIC TECHNOLOGY Co Ltd
Priority date: 2011-12-01
Filing date: 2011-12-01
Publication date: 2012-04-04

Abstract

The invention provides a memory method for a 3D (three-dimensional) video code. For a section of 3D video with k viewpoints, the frame sequence of each of the viewpoints is supposed to be S1, S2 until Sk; the length of each frame sequence is M frame; the Fn-m represents the m[th] frame in the Sn frame sequences; the memory method comprises the following steps of: step 1, re-ordering all frames of the section of the 3D video; ordering the frames based on the priority of frame sequences and sub-priority of viewpoint sequences to acquire a full range of frames; step 2, inputting the full range of frames acquired by ordering in step 1 into a video coder to acquire a frame sequence F'; and step 3, packing the frame sequence F' acquired by coding in the step 2 by a media exchanging file format, namely an MXF (Media Exchange Format) format. The memory method disclosed by the invention has the advantages that: the method retains information of each viewpoint to the largest degree; the video of each viewpoint can be accurately synchronized and located in the playing process; and hard disk type memory media can be fully used due to the sequential memory manner.

Description

3D stereo scopic video coding storage means

Technical field

The present invention relates to a kind of 3D stereo scopic video coding storage means, belong to video coding and field of storage.

Background technology

The 3D three-dimensional video-frequency is a multi-view point video, describes the scene picture from time and space two angles.Therefore preserve the 3D three-dimensional video-frequency and need more data and more complicated structure.

The store method of at present common 3D three-dimensional video-frequency is that multi-view point video is stitched together at Spatial Dimension, and remaking is that a common planar video carries out code storage, abbreviates " picture mosaic " method as.For example, each frame of left and right sides viewpoint video is combined into one " about figure " side by side, perhaps " going up figure below " is a kind of common method.

This type " picture mosaic " mode has very big limitation.Resolution reduces greatly, and for example " about figure " picture mosaic mode resolution on level has reduced by 50%; " 3 * 3 " picture mosaic mode has all reduced by 2/3 in level and vertical direction resolution.If keep resolution on certain level, then the resolution of full figure possibly too highly cause the encoder encodes failure.In addition, " picture mosaic " method makes that the displaying principle of method for video coding and 3D display device is closely relevant, and the 3D three-dimensional video-frequency of making according to certain " picture mosaic " method can only be fit to certain type of 3D display device, and the 3D display device that is not suitable for other types uses.For example the video produced of " about figure " method is fit to the 3D display of vertical raster type, rather than the 3D display device of horizontal grating type or time division way.

Summary of the invention

The objective of the invention is to overcome the deficiency that exists in the prior art, a kind of coding and storing method of 3D three-dimensional video-frequency is provided, be used for the coding and the storage of many viewpoints 3D three-dimensional video-frequency.

According to technical scheme provided by the invention, said 3D stereo scopic video coding storage means is:

For one section 3D three-dimensional video-frequency with k viewpoint, suppose that the frame sequence of each viewpoint is respectively S1, S2 ..., Sk, the length of each frame sequence all is the M frame; Fn-m representes the m frame in the Sn frame sequence, and n is called the viewpoint sequence number, and m is called frame number; Operate as follows:

Step 1, with all frames rearrangement of this section 3D three-dimensional video-frequency, next mode of, viewpoint sequence number preferential according to frame number sorts:

F1-1,?F2-1,?…?,?Fk-1,

F1-2,?F2-2,?…?,?Fk-2,

F1-3,?F2-3,?…?,?Fk-3,

……

F1-M,?F2-M,?…?,?Fk-M,

The collating sequence of all frames is called " complete series frame ";

Step 2, the complete series frame input video encoder that the step 1 ordering is obtained obtain frame sequence F ', and the video encoder parameter selects to be made as one of following five kinds of patterns:

A) I frame coding is only done in setting, does not do P/B frame coding;

B) I/P frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference, and the I frame period that occurs in each view frame sequence is in full accord;

C) I/P frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P frame;

D) I/P/B frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference; For each B frame, each I/P frame of appointment and its front and back in same view frame sequence is frame as a reference; And the I/P/B frame period that occurs in each view frame sequence is in full accord;

E) I/P/B frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; For each B frame, each I/P frame frame as a reference before and after specifying in the complete series frame; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P/B frame;

Step 3, with the frame sequence F ' that step 2 coding obtains, adopt the exchange of media file format, i.e. MXF form packing, and this MXF form is followed following several rule:

F) in the MXF head, adding this MXF of sign is the metadata of " three-dimensional video-frequency "; The UUID of said metadata is the unique value of appointment;

G) in the MXF head, add a store video parameter metadata, the video parameter of being stored comprises: wide, high, frame per second, sample rate, picture ratio, frame length, code check, wherein sample rate=viewpoint number * frame per second;

H) at the concordance list section of MXF afterbody, to the frame sequence F ' 1-1 of first viewpoint, F ' 1-2, F ' 1-3 ... F ' 1-M indexes; Frame sequence to other viewpoints does not index.

Described video encoder can be for MFEG2, MFEG4, H.264 or video coding algorithm such as VC1.

Advantage of the present invention is: keep the information of each viewpoint to greatest extent, and can be accurately when playing synchronously with the video of each viewpoint of location, and sequential storage makes hard disk class storage medium can bring into play maximum performance.

Embodiment

Below in conjunction with accompanying drawing and embodiment the present invention is described further.

For one section 3D three-dimensional video-frequency with 2 viewpoints, frame per second is 24pfs, supposes that the frame sequence of each viewpoint is respectively S1, S2, and the length of each frame sequence all is 1000 frames.Set video encoder and adopt the MFEG-2 video coding algorithm.3D stereo scopic video coding storage means of the present invention is following.

1. with all frames rearrangement of this section 3D three-dimensional video-frequency, next mode of, viewpoint sequence number preferential according to frame number sorts:

F1-1,?F2-1,?F1-2,?F2-2,?F1-3,?F2-3,?…,?F1-1000,?F2-1000，

Above collating sequence is called " complete series frame ";

2. the complete series frame input MFEG-2 video encoder that the step 1 ordering is obtained, the video encoder parameter can be made as one of following five kinds of patterns:

A) I frame coding is only done in setting, does not do P/B frame coding;

E) I/P/B frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; For each B frame, each I/P frame frame as a reference before and after specifying in the complete series frame; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P/B frame."/" be or the meaning, be the conventional literary style in present technique field.

Here, the video encoder parameter setting is for doing I/P frame coding.For each P frame, appointment and its former frame in same view frame sequence be frame as a reference, and the reference frame of F1-2 is F1-1, and the reference frame of F2-2 is F2-1, and the reference frame of F1-3 is F1-2.And the I frame period that occurs in each view frame sequence is 12, that is to say F1-1, and F2-1, F1-13, F2-13 etc. are the I frames.

3. the frame sequence F ' that step 2 is obtained through video encoder encodes, adopt exchange of media file format (MXF form) packing, and this MXF follows following several rule:

A) in the MXF head, adding this MXF of sign is the metadata of " three-dimensional video-frequency ".The UUID of this metadata (general unique identifier) is: 0x060e2b340253010c0d01010101016300.

B) in the MXF head, add a store video parameter metadata, the video parameter of being stored comprises: key message such as wide, high, frame per second, sample rate, picture ratio, frame length, code check.Sample rate=38 wherein.

C) at the concordance list section of MXF afterbody, to the frame sequence of first viewpoint

F’1-1,?F’1-2,?F’1-3,?…，F’1-1000

Index.Frame sequence to other viewpoints does not index.

So just, accomplished the code storage of 3D three-dimensional video-frequency.The advantage of doing like this is:

1. the frame with each viewpoint adds frame sequence to video encoder encodes as an independent frame, rather than with giving video encoder encodes after the frame picture mosaic of a plurality of viewpoints, can preserve the image information of each viewpoint to greatest extent.

2. next method ordering of, viewpoint sequence number preferential according to frame number can be read when playing in proper order, can improve the reading efficiency and the reading speed of hard disk.

3. the pattern of step 2 a) is only done I frame coding, eliminates inter-frame dependencies, can random position arrive any frame and independent decoding.

4. the pattern b of step 2) and pattern d) do inter prediction in same view frame sequence, can improve code efficiency.And stipulated the synchronous appearance of the I/P/B frame in each view frame sequence, can improve the search and positioning efficient of video.

5. the pattern c of step 3) and pattern e) in full frame series, do inter prediction and coding, can be when decoding, and do not need a plurality of Video Decoder entities only with a Video Decoder entity.

6. the metadata that in the MXF top of file, adds " three-dimensional video-frequency " can be separated the quick identification file type with other MXF file areas of storing the 2D video with this MXF.

7. at concordance list section a frame sequence of MXF afterbody, reduced the number of index to first viewpoint.

Claims

1.3D the stereo scopic video coding storage means is characterized in that:

F1-1,?F2-1,?…?,?Fk-1,

F1-2,?F2-2,?…?,?Fk-2,

F1-3,?F2-3,?…?,?Fk-3,

……

F1-M,?F2-M,?…?,?Fk-M,

The collating sequence of all frames is called " complete series frame ";

Set and only do I frame coding, do not do P/B frame coding;

I/P frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference, and the I frame period that occurs in each view frame sequence is in full accord;

I/P frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P frame;

I/P/B frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference; For each B frame, each I/P frame of appointment and its front and back in same view frame sequence is frame as a reference; And the I/P/B frame period that occurs in each view frame sequence is in full accord;

I/P/B frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; For each B frame, each I/P frame frame as a reference before and after specifying in the complete series frame; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P/B frame;

In the MXF head, adding this MXF of sign is the metadata of " three-dimensional video-frequency "; The UUID of said metadata is the unique value of appointment;

In the MXF head, add a store video parameter metadata, the video parameter of being stored comprises: wide, high, frame per second, sample rate, picture ratio, frame length, code check, wherein sample rate=viewpoint number * frame per second;

At the concordance list section of MXF afterbody, to the frame sequence F ' 1-1 of first viewpoint, F ' 1-2, F ' 1-3 ... F ' 1-M indexes; Frame sequence to other viewpoints does not index.

H.264 or the VC1 video coding algorithm 2. 3D stereo scopic video coding storage means as claimed in claim 1 is characterized in that, said video encoder adopts MFEG2, MFEG4.