CN102404577A - Memory method for 3D (three-dimensional) video code - Google Patents

Memory method for 3D (three-dimensional) video code Download PDF

Info

Publication number
CN102404577A
CN102404577A CN2011103923999A CN201110392399A CN102404577A CN 102404577 A CN102404577 A CN 102404577A CN 2011103923999 A CN2011103923999 A CN 2011103923999A CN 201110392399 A CN201110392399 A CN 201110392399A CN 102404577 A CN102404577 A CN 102404577A
Authority
CN
China
Prior art keywords
frame
sequence
video
coding
viewpoint
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011103923999A
Other languages
Chinese (zh)
Inventor
林晓森
李艳华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WUXI TAIHANG ELECTRONIC TECHNOLOGY Co Ltd
Original Assignee
WUXI TAIHANG ELECTRONIC TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WUXI TAIHANG ELECTRONIC TECHNOLOGY Co Ltd filed Critical WUXI TAIHANG ELECTRONIC TECHNOLOGY Co Ltd
Priority to CN2011103923999A priority Critical patent/CN102404577A/en
Publication of CN102404577A publication Critical patent/CN102404577A/en
Pending legal-status Critical Current

Links

Abstract

The invention provides a memory method for a 3D (three-dimensional) video code. For a section of 3D video with k viewpoints, the frame sequence of each of the viewpoints is supposed to be S1, S2 until Sk; the length of each frame sequence is M frame; the Fn-m represents the m[th] frame in the Sn frame sequences; the memory method comprises the following steps of: step 1, re-ordering all frames of the section of the 3D video; ordering the frames based on the priority of frame sequences and sub-priority of viewpoint sequences to acquire a full range of frames; step 2, inputting the full range of frames acquired by ordering in step 1 into a video coder to acquire a frame sequence F'; and step 3, packing the frame sequence F' acquired by coding in the step 2 by a media exchanging file format, namely an MXF (Media Exchange Format) format. The memory method disclosed by the invention has the advantages that: the method retains information of each viewpoint to the largest degree; the video of each viewpoint can be accurately synchronized and located in the playing process; and hard disk type memory media can be fully used due to the sequential memory manner.

Description

3D stereo scopic video coding storage means
Technical field
The present invention relates to a kind of 3D stereo scopic video coding storage means, belong to video coding and field of storage.
Background technology
The 3D three-dimensional video-frequency is a multi-view point video, describes the scene picture from time and space two angles.Therefore preserve the 3D three-dimensional video-frequency and need more data and more complicated structure.
The store method of at present common 3D three-dimensional video-frequency is that multi-view point video is stitched together at Spatial Dimension, and remaking is that a common planar video carries out code storage, abbreviates " picture mosaic " method as.For example, each frame of left and right sides viewpoint video is combined into one " about figure " side by side, perhaps " going up figure below " is a kind of common method.
This type " picture mosaic " mode has very big limitation.Resolution reduces greatly, and for example " about figure " picture mosaic mode resolution on level has reduced by 50%; " 3 * 3 " picture mosaic mode has all reduced by 2/3 in level and vertical direction resolution.If keep resolution on certain level, then the resolution of full figure possibly too highly cause the encoder encodes failure.In addition, " picture mosaic " method makes that the displaying principle of method for video coding and 3D display device is closely relevant, and the 3D three-dimensional video-frequency of making according to certain " picture mosaic " method can only be fit to certain type of 3D display device, and the 3D display device that is not suitable for other types uses.For example the video produced of " about figure " method is fit to the 3D display of vertical raster type, rather than the 3D display device of horizontal grating type or time division way.
Summary of the invention
The objective of the invention is to overcome the deficiency that exists in the prior art, a kind of coding and storing method of 3D three-dimensional video-frequency is provided, be used for the coding and the storage of many viewpoints 3D three-dimensional video-frequency.
According to technical scheme provided by the invention, said 3D stereo scopic video coding storage means is:
For one section 3D three-dimensional video-frequency with k viewpoint, suppose that the frame sequence of each viewpoint is respectively S1, S2 ..., Sk, the length of each frame sequence all is the M frame; Fn-m representes the m frame in the Sn frame sequence, and n is called the viewpoint sequence number, and m is called frame number; Operate as follows:
Step 1, with all frames rearrangement of this section 3D three-dimensional video-frequency, next mode of, viewpoint sequence number preferential according to frame number sorts:
F1-1,?F2-1,?…?,?Fk-1,
F1-2,?F2-2,?…?,?Fk-2,
F1-3,?F2-3,?…?,?Fk-3,
……
F1-M,?F2-M,?…?,?Fk-M,
The collating sequence of all frames is called " complete series frame ";
Step 2, the complete series frame input video encoder that the step 1 ordering is obtained obtain frame sequence F ', and the video encoder parameter selects to be made as one of following five kinds of patterns:
A) I frame coding is only done in setting, does not do P/B frame coding;
B) I/P frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference, and the I frame period that occurs in each view frame sequence is in full accord;
C) I/P frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P frame;
D) I/P/B frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference; For each B frame, each I/P frame of appointment and its front and back in same view frame sequence is frame as a reference; And the I/P/B frame period that occurs in each view frame sequence is in full accord;
E) I/P/B frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; For each B frame, each I/P frame frame as a reference before and after specifying in the complete series frame; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P/B frame;
Step 3, with the frame sequence F ' that step 2 coding obtains, adopt the exchange of media file format, i.e. MXF form packing, and this MXF form is followed following several rule:
F) in the MXF head, adding this MXF of sign is the metadata of " three-dimensional video-frequency "; The UUID of said metadata is the unique value of appointment;
G) in the MXF head, add a store video parameter metadata, the video parameter of being stored comprises: wide, high, frame per second, sample rate, picture ratio, frame length, code check, wherein sample rate=viewpoint number * frame per second;
H) at the concordance list section of MXF afterbody, to the frame sequence F ' 1-1 of first viewpoint, F ' 1-2, F ' 1-3 ... F ' 1-M indexes; Frame sequence to other viewpoints does not index.
Described video encoder can be for MFEG2, MFEG4, H.264 or video coding algorithm such as VC1.
Advantage of the present invention is: keep the information of each viewpoint to greatest extent, and can be accurately when playing synchronously with the video of each viewpoint of location, and sequential storage makes hard disk class storage medium can bring into play maximum performance.
Embodiment
Below in conjunction with accompanying drawing and embodiment the present invention is described further.
For one section 3D three-dimensional video-frequency with 2 viewpoints, frame per second is 24pfs, supposes that the frame sequence of each viewpoint is respectively S1, S2, and the length of each frame sequence all is 1000 frames.Set video encoder and adopt the MFEG-2 video coding algorithm.3D stereo scopic video coding storage means of the present invention is following.
1. with all frames rearrangement of this section 3D three-dimensional video-frequency, next mode of, viewpoint sequence number preferential according to frame number sorts:
F1-1,?F2-1,?F1-2,?F2-2,?F1-3,?F2-3,?…,?F1-1000,?F2-1000,
Above collating sequence is called " complete series frame ";
2. the complete series frame input MFEG-2 video encoder that the step 1 ordering is obtained, the video encoder parameter can be made as one of following five kinds of patterns:
A) I frame coding is only done in setting, does not do P/B frame coding;
B) I/P frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference, and the I frame period that occurs in each view frame sequence is in full accord;
C) I/P frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P frame;
D) I/P/B frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference; For each B frame, each I/P frame of appointment and its front and back in same view frame sequence is frame as a reference; And the I/P/B frame period that occurs in each view frame sequence is in full accord;
E) I/P/B frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; For each B frame, each I/P frame frame as a reference before and after specifying in the complete series frame; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P/B frame."/" be or the meaning, be the conventional literary style in present technique field.
Here, the video encoder parameter setting is for doing I/P frame coding.For each P frame, appointment and its former frame in same view frame sequence be frame as a reference, and the reference frame of F1-2 is F1-1, and the reference frame of F2-2 is F2-1, and the reference frame of F1-3 is F1-2.And the I frame period that occurs in each view frame sequence is 12, that is to say F1-1, and F2-1, F1-13, F2-13 etc. are the I frames.
3. the frame sequence F ' that step 2 is obtained through video encoder encodes, adopt exchange of media file format (MXF form) packing, and this MXF follows following several rule:
A) in the MXF head, adding this MXF of sign is the metadata of " three-dimensional video-frequency ".The UUID of this metadata (general unique identifier) is: 0x060e2b340253010c0d01010101016300.
B) in the MXF head, add a store video parameter metadata, the video parameter of being stored comprises: key message such as wide, high, frame per second, sample rate, picture ratio, frame length, code check.Sample rate=38 wherein.
C) at the concordance list section of MXF afterbody, to the frame sequence of first viewpoint
F’1-1,?F’1-2,?F’1-3,?…,F’1-1000
Index.Frame sequence to other viewpoints does not index.
So just, accomplished the code storage of 3D three-dimensional video-frequency.The advantage of doing like this is:
1. the frame with each viewpoint adds frame sequence to video encoder encodes as an independent frame, rather than with giving video encoder encodes after the frame picture mosaic of a plurality of viewpoints, can preserve the image information of each viewpoint to greatest extent.
2. next method ordering of, viewpoint sequence number preferential according to frame number can be read when playing in proper order, can improve the reading efficiency and the reading speed of hard disk.
3. the pattern of step 2 a) is only done I frame coding, eliminates inter-frame dependencies, can random position arrive any frame and independent decoding.
4. the pattern b of step 2) and pattern d) do inter prediction in same view frame sequence, can improve code efficiency.And stipulated the synchronous appearance of the I/P/B frame in each view frame sequence, can improve the search and positioning efficient of video.
5. the pattern c of step 3) and pattern e) in full frame series, do inter prediction and coding, can be when decoding, and do not need a plurality of Video Decoder entities only with a Video Decoder entity.
6. the metadata that in the MXF top of file, adds " three-dimensional video-frequency " can be separated the quick identification file type with other MXF file areas of storing the 2D video with this MXF.
7. at concordance list section a frame sequence of MXF afterbody, reduced the number of index to first viewpoint.

Claims (2)

1.3D the stereo scopic video coding storage means is characterized in that:
For one section 3D three-dimensional video-frequency with k viewpoint, suppose that the frame sequence of each viewpoint is respectively S1, S2 ..., Sk, the length of each frame sequence all is the M frame; Fn-m representes the m frame in the Sn frame sequence, and n is called the viewpoint sequence number, and m is called frame number; Operate as follows:
Step 1, with all frames rearrangement of this section 3D three-dimensional video-frequency, next mode of, viewpoint sequence number preferential according to frame number sorts:
F1-1,?F2-1,?…?,?Fk-1,
F1-2,?F2-2,?…?,?Fk-2,
F1-3,?F2-3,?…?,?Fk-3,
……
F1-M,?F2-M,?…?,?Fk-M,
The collating sequence of all frames is called " complete series frame ";
Step 2, the complete series frame input video encoder that the step 1 ordering is obtained obtain frame sequence F ', and the video encoder parameter selects to be made as one of following five kinds of patterns:
Set and only do I frame coding, do not do P/B frame coding;
I/P frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference, and the I frame period that occurs in each view frame sequence is in full accord;
I/P frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P frame;
I/P/B frame coding is done in setting; For each P frame, appointment and its former frame in same view frame sequence be frame as a reference; For each B frame, each I/P frame of appointment and its front and back in same view frame sequence is frame as a reference; And the I/P/B frame period that occurs in each view frame sequence is in full accord;
I/P/B frame coding is done in setting; For each P frame, specify in the complete series frame former frame frame as a reference; For each B frame, each I/P frame frame as a reference before and after specifying in the complete series frame; And the I frame only appears in the first view frame sequence, and other view frame sequences are all encoded as the P/B frame;
Step 3, with the frame sequence F ' that step 2 coding obtains, adopt the exchange of media file format, i.e. MXF form packing, and this MXF form is followed following several rule:
In the MXF head, adding this MXF of sign is the metadata of " three-dimensional video-frequency "; The UUID of said metadata is the unique value of appointment;
In the MXF head, add a store video parameter metadata, the video parameter of being stored comprises: wide, high, frame per second, sample rate, picture ratio, frame length, code check, wherein sample rate=viewpoint number * frame per second;
At the concordance list section of MXF afterbody, to the frame sequence F ' 1-1 of first viewpoint, F ' 1-2, F ' 1-3 ... F ' 1-M indexes; Frame sequence to other viewpoints does not index.
H.264 or the VC1 video coding algorithm 2. 3D stereo scopic video coding storage means as claimed in claim 1 is characterized in that, said video encoder adopts MFEG2, MFEG4.
CN2011103923999A 2011-12-01 2011-12-01 Memory method for 3D (three-dimensional) video code Pending CN102404577A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011103923999A CN102404577A (en) 2011-12-01 2011-12-01 Memory method for 3D (three-dimensional) video code

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011103923999A CN102404577A (en) 2011-12-01 2011-12-01 Memory method for 3D (three-dimensional) video code

Publications (1)

Publication Number Publication Date
CN102404577A true CN102404577A (en) 2012-04-04

Family

ID=45886297

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011103923999A Pending CN102404577A (en) 2011-12-01 2011-12-01 Memory method for 3D (three-dimensional) video code

Country Status (1)

Country Link
CN (1) CN102404577A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107071301A (en) * 2017-04-17 2017-08-18 宁波视睿迪光电有限公司 The storage method and read method of a kind of many mesh imaging sequences
CN109274955A (en) * 2018-09-11 2019-01-25 曜科智能科技(上海)有限公司 The compression of light field video and depth map and synchronous method, system and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101243692A (en) * 2005-08-22 2008-08-13 三星电子株式会社 Method and apparatus for encoding multiview video
CN101711483A (en) * 2007-06-07 2010-05-19 升级芯片技术公司 The form that is used for encoded stereoscopic image data file
JP2010130690A (en) * 2008-11-28 2010-06-10 Korea Electronics Telecommun Apparatus for transceiving multi-view video image, and method therefor
CN101867816A (en) * 2010-05-10 2010-10-20 南京邮电大学 Stereoscopic video asymmetric compression coding method based on human-eye visual characteristic
CN101978699A (en) * 2008-01-25 2011-02-16 电子部品研究院 Stereoscopic video file format and computer readable recording medium in which stereoscopic video file is recorded according thereto
US20110234763A1 (en) * 2010-03-29 2011-09-29 Electronics And Telecommunications Research Institute Apparatus and method for transmitting/receiving multi-view stereoscopic video

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101243692A (en) * 2005-08-22 2008-08-13 三星电子株式会社 Method and apparatus for encoding multiview video
CN101711483A (en) * 2007-06-07 2010-05-19 升级芯片技术公司 The form that is used for encoded stereoscopic image data file
CN101978699A (en) * 2008-01-25 2011-02-16 电子部品研究院 Stereoscopic video file format and computer readable recording medium in which stereoscopic video file is recorded according thereto
JP2010130690A (en) * 2008-11-28 2010-06-10 Korea Electronics Telecommun Apparatus for transceiving multi-view video image, and method therefor
US20110234763A1 (en) * 2010-03-29 2011-09-29 Electronics And Telecommunications Research Institute Apparatus and method for transmitting/receiving multi-view stereoscopic video
CN101867816A (en) * 2010-05-10 2010-10-20 南京邮电大学 Stereoscopic video asymmetric compression coding method based on human-eye visual characteristic

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
吴小璇: "《DCI数字电影关键技术研究》", 31 December 2010 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107071301A (en) * 2017-04-17 2017-08-18 宁波视睿迪光电有限公司 The storage method and read method of a kind of many mesh imaging sequences
CN107071301B (en) * 2017-04-17 2019-11-22 宁波视睿迪光电有限公司 A kind of storage method and read method of more mesh imaging sequences
CN109274955A (en) * 2018-09-11 2019-01-25 曜科智能科技(上海)有限公司 The compression of light field video and depth map and synchronous method, system and electronic equipment

Similar Documents

Publication Publication Date Title
CN102067615B (en) Image generating method and apparatus and image processing method and apparatus
KR100962696B1 (en) Format for encoded stereoscopic image data file
CN101578881B (en) Method and apparatus for encoding and decoding multi-view images based on global disparity vector
CN101682793B (en) Creating three dimensional graphics data
CN101986716B (en) Quick depth video coding method
CN101904176B (en) Image encoding method for stereoscopic rendering
CN109997358A (en) The UV codec centered on decoder for free viewpoint video stream transmission
CN103202021B (en) Code device, decoding apparatus, transcriber, coding method and coding/decoding method
JP4252105B2 (en) Image data creation device and image data reproduction device
WO2004057879A1 (en) Image data creation device and image data reproduction device for reproducing the data
KR101396325B1 (en) Method and apparatus for generating header information of stereoscopic image data
WO2015161541A1 (en) Parallel synchronous scaling engine and method for multi-view point naked eye 3d display
US20080310762A1 (en) System and method for generating and regenerating 3d image files based on 2d image media standards
CN104718760A (en) Method and apparatus of motion vector derivation 3D video coding
CN103609111A (en) Method and apparatus for video encoding using inter layer prediction with pre-filtering, and method and apparatus for video decoding using inter layer prediction with post-filtering
KR101784579B1 (en) Method and apparatus of compatible depth dependent coding
CN102438161A (en) Method for converting 2D (two-dimensional) image into 3D (three-dimensional) image
US11483534B2 (en) Method for processing immersive video and method for producing immersive video
CN103561255A (en) Nakedness-yet stereoscopic display method
CN102404577A (en) Memory method for 3D (three-dimensional) video code
CN102177722A (en) Image processing apparatus and method, and image reproducing apparatus, method and program
CN107484004B (en) Video processing method and device
CN100571401C (en) Determine the method and the device thereof of reference frame
US20120050465A1 (en) Image processing apparatus and method using 3D image format
CN113542907A (en) Multimedia data receiving and transmitting method, system, processor and player

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120404