US20100289876A1 - Stereoscopic video file format and computer readable recording medium in which stereoscopic video file is recorded according thereto - Google Patents
Stereoscopic video file format and computer readable recording medium in which stereoscopic video file is recorded according thereto Download PDFInfo
- Publication number
- US20100289876A1 US20100289876A1 US12/864,404 US86440409A US2010289876A1 US 20100289876 A1 US20100289876 A1 US 20100289876A1 US 86440409 A US86440409 A US 86440409A US 2010289876 A1 US2010289876 A1 US 2010289876A1
- Authority
- US
- United States
- Prior art keywords
- box
- stereoscopic video
- stereoscopic
- information
- stores
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N13/10—Processing, recording or transmission of stereoscopic or multi-view image signals
- H04N13/106—Processing image signals
- H04N13/161—Encoding, multiplexing or demultiplexing different image signal components
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N13/10—Processing, recording or transmission of stereoscopic or multi-view image signals
- H04N13/106—Processing image signals
- H04N13/172—Processing image signals image signals comprising non-image signal components, e.g. headers or format information
- H04N13/178—Metadata, e.g. disparity information
Definitions
- the present invention relates to a stereoscopic video, and more particularly, to a stereoscopic video file format capable of improving bit efficiency and processing efficiency and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded.
- JTC 1/SC 29/WG11 that is an image related standardization group organized by ISO/IEC has proposed a file format of a stereoscopic video for storing and reproducing stereoscopic video and has discussed a method of effectively optimizing the file format.
- the known stereoscopic video file format relates to an image of binocular disparity, which includes a variety of information such as information showing an arrangement form of a left image and a right image, information informing whether or not to include a mono image such as intermediate insertion advertisement, recommendation display information, and camera parameter information, information on whether or not to refer to other images and information indicating what image is referred to.
- the known stereoscopic video file format generally includes recommendation display information and camera parameter information on each image, such that the data amount is increased because it does not exclude the overlapping portions, which degrade efficiency.
- ES element stream
- FIG. 1 is a structural diagram showing stereoscopic video stream structure of 2ES according to the related art.
- the stereoscopic video stream of 2ES is configured to include left ES, right ES, two ESs.
- Each ES may include a plurality of stereoscopic image frames S 1 to S n and intermediate inserted mono image frames M 1 to M n .
- Each stereoscopic video stream is stored in a place called a trak box together with the stereoscopic camera and display information (SCDI) that shows the camera parameter and recommendation display information.
- SCDI stereoscopic camera and display information
- the stereoscopic camera and display information almost all or completely overlaps between the left ES and the right ES. Since it is impossible to refer to the inter-trak box, the structure according to the related art cannot essentially exclude the above-mentioned overlapping.
- the present invention has been made in an effort to provide a stereoscopic video file format capable of minimizing overlapping data and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded.
- the present invention has been made in an effort to provide a stereoscopic video file format capable of optimizing a configuration of a file format in a simple format and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded.
- the present invention has been made in an effort to provide a stereoscopic video file format capable of facilitating a file format design of a specific image by enabling initial map configuration and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded.
- An exemplary embodiment of the present invention provides stereoscopic video file format, including: a file type box that stores file format information and information indicating whether or not to include monoscopic data; a movie box that stores a plurality of trak boxes configuring stereoscopic video streams; a media data box that stores multimedia resources; a stereoscopic video media information box that stores at least one common stereoscopic video stream arrangement information; a stereoscopic camera and display information reference box that stores camera parameter and recommendation display information referenced by the plurality of trak boxes; a stereoscopic camera and display information box that exists at each stereoscopic image frame of the stereoscopic video stream and stores reference information of the stereoscopic camera and display information reference box; and a meta box that stores metadata.
- the stereoscopic video file format further includes a stereoscopic video media information extension box that designates redefinition of the stereoscopic video stream arrangement information for expansion through the redefinition.
- a stereoscopic video file format including: a first area that stores a plurality of stereoscopic image frames configuring stereoscopic video streams; a second area that stores common stream arrangement information on the plurality of stereoscopic image frames; a third area that stores camera parameter and recommendation display information to be referenced by at least one stereoscopic image frame; and a fourth area that exists at each of the stereoscopic image frames and stores the reference information of the third area.
- Yet another exemplary embodiment of the present invention provides a stereoscopic video file format, including: a trak box that stores stereoscopic video streams; a stereoscopic video media information (semi) box that stores arrangement information on the stereoscopic video stream; a stereoscopic camera and display information (scdi) box that stores stereoscopic camera and recommendation display information, wherein the trak box refers to at least one of the svmi box and the scdi box.
- the trak box may include a reference box that stores identification information on the svmi box and the scdi box for referring to at least one of the svmi box and the scdi box.
- Still yet another exemplary embodiment of the present invention provides a computer readable recording medium recording a stereoscopic video file, including: a file type box that stores file format information and information indicating whether or not to include monoscopic data; a movie box that stores a plurality of trak boxes configuring stereoscopic video streams; a media data box that stores multimedia resources; a stereoscopic video media information box that stores at least one common stereoscopic video stream arrangement information; a stereoscopic camera and display information reference box that stores camera parameter and recommendation display information referenced by the plurality of trak boxes; a stereoscopic camera and display information box that exists at each stereoscopic image frame of the stereoscopic video stream and stores reference information of the stereoscopic camera and display information reference box; and a meta box that stores metadata.
- a stereoscopic video file format and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded collect and store the overlapping information existing at each stereoscopic image frame in one area and refers to the corresponding information through a link at the time of performing the decoding, such that the overlapping information is removed from the stereoscopic video file, thereby making it possible to improve the bit efficiency and the processing efficiency.
- the present invention simplifies the sophisticated structure of the stereoscopic video file and provides the optimized stereoscopic video file format, thereby making it possible to contribute to standardizing the art.
- FIG. 1 is a structural diagram of a stereoscopic video stream structure in a 2ES mode according to the related art.
- FIG. 2 is a structural diagram showing a structure of a stereoscopic video file format according to an exemplary embodiment of the present invention.
- FIG. 2 is a structural diagram showing a structure of a stereoscopic video file format according to an exemplary embodiment of the present invention.
- the stereoscopic video file format may be configured to include a file type (ftyp) box 100 that stores file format information and information indicating whether or not to include monoscopic data; a movie (moov) box 200 that stores a plurality of trak boxes 200 b configuring stereoscopic video streams; a media data (mdat) box 300 that stores multimedia resources; and a first meta box 400 that is configured of meta data.
- ftyp file type
- miov movie
- media data box 300 that stores multimedia resources
- a first meta box 400 that is configured of meta data.
- the stereoscopic video stream may be configured of at least one element streams (ES).
- the file type box 100 may be configured in the following [Table 1].
- the file type box 100 shows the file type and may include an identifier such as “ss01” and “ss02” in order to indicate whether the monoscopic data are included in the stereoscopic contents.
- the monoscopic data may be data of advertisement images.
- the movie box 200 may further include a movie header (mvhd) box 200 a that is header information on images.
- mvhd movie header
- the movie box 200 may further include at least one mono data that are inserted into the trak boxes.
- the trak box 200 b may include a trak header (trhd) box 210 that stores the overall information on the corresponding trak box 200 b and a media (mdia) box 220 that is a container storing media information in the trak box 200 b .
- the trak header box 210 may include a trak reference (tref) box 212 that is a trak reference container.
- the media box 220 may include a media header (mdhd) box 222 that stores the overall information on media; a handler (hdlr) box 224 that shows a media type such as audio data or video data; and a media information (minf) box 226 that is a media information container.
- the stereoscopic video file format may further include: a stereoscopic video media information (semi) box 410 that stores at least one stereoscopic video stream arrangement information; and at least one stereoscopic camera and display information reference (scdr) boxes 420 - 1 to 420 - n that store camera parameter and recommendation display information referring to the plurality of trak boxes 200 b.
- a stereoscopic video media information (semi) box 410 that stores at least one stereoscopic video stream arrangement information
- at least one stereoscopic camera and display information reference (scdr) boxes 420 - 1 to 420 - n that store camera parameter and recommendation display information referring to the plurality of trak boxes 200 b.
- the svmi box 410 and the scdr boxes 420 - 1 to 420 - n may be included in the first meta box 400 .
- the first meta box 400 may further include an item location (iloc) box 430 that stores item locations and an item information (iinf) box 440 that stores the item information.
- the stereoscopic video file format may further include a stereoscopic video media information extension (svi2) box 232 that designates redefinition of the stereoscopic video stream arrangement information that facilitates expansion through the redefinition; and a stereoscopic camera and display information box (scdi box) that may link one of the scdr boxes 420 - 1 to 420 - n or designate the redefinition of the camera parameter and recommendation display information.
- a stereoscopic video media information extension (svi2) box 232 that designates redefinition of the stereoscopic video stream arrangement information that facilitates expansion through the redefinition
- scdi box stereoscopic camera and display information box
- the svi2 box 232 and scdr box 234 may be included in the second meta box 230 that is stored in the trak box 200 b.
- the figure shows that the svmi box 410 and the scdr boxes 420 - 1 to 420 - n are included in the first meta box 400 , the svi2 box 232 and the scdr box 234 are included in the second meta box 230 stored in the trak box 200 b , but the svmi box 410 , the scdr boxes 420 - 1 to 420 - n , the svi2 box 232 , and the scdr box 234 may not be necessarily included in the specific meta box and therefore, may be flexibly moved according to situations.
- the svmi box 410 and the scdr 420 - 1 to 420 - n boxes may be included in the movie box 200 and the svmi box 410 , the scdr boxes 420 - 1 to 420 - n , the svi2 box 232 , and the scdr box 234 may independently exist at the outside rather than being included in any box.
- the information included in the boxes may be separated into a separate box form by a predetermined unit or reference.
- a portion related to the camera parameter information of the scdr box or the scdi box and a portion related to the recommendation display information may be separated into a separate box form.
- the svmi box storing the arrangement information of the stereoscopic video stream or the scdi box storing the stereo camera and recommendation display information exists at the outside of the trak box, such that the trak box can refer to the scdr box or the scdi box at the outside.
- the svmi box or the scdi box may be included in other trak box.
- the svmi box or the scdi box may exist at the outside for the left image and in the trak box for the right image.
- the svmi box 410 may be configured in a format such as the syntax of the following [Table 2].
- the bit size allocated to each field is by way of example only and therefore, is not limited thereto. It can be apparent to those skilled in the art that the bit size can be variously changed if necessary.
- the svmi box 410 may include common information on each content.
- a field value of “stereoscopic_composition_type” showing the element (ES) stream structure of the stereoscopic video file format may be defined by the following [Table 3].
- stereoscopic_composition_type When the value of “stereoscopic_composition_type” is 0, 1, and 2, “is_left_first” indicating whether the left image frame is first and “stereo_mono_change” that is identification information on the stereoscopic image frame and the mono image frame may be commonly used.
- “stereoscopic_composition_type” is 3
- “is_left_first” cannot be commonly used, such that the information on the svi2 box 232 of the corresponding trak may be used.
- the “stereo_mono_change” may be commonly used or may not be commonly used. If it is determined that the “stereo_mono_change” is not commonly used, the information on the svi2 box 232 of the corresponding trak may be used.
- all the trak boxes 200 b refers to the information on the semi box 410 and in the case of the 2ES, each trak box 200 b may use the information on the svi2 box 232 that is stored in the trak boxes 200 b.
- the svi2 box 232 may be configured in a format such as the syntax of the following [Table 4].
- the svi2 box 232 stores only the information according to the characteristics of each trak based on the information on the semi box 410 declared on the uppermost. This information may be updated if necessary. For example, if the information on “is_update_flag” that is a field indicating whether the “stereo_mono_count” is updated, the “stereo_mono_count” information of the corresponding trak is written. Further, in the case of 2ES, the value of “is_left_first” may be designated as [Table 5].
- the scdr boxes 420 - 1 to 420 - n may be configured in a format such as the syntax of the following [Table 6].
- the scdr boxes 420 - 1 to 420 - n may be generated by the number of combinations of displays and cameras used in the video sequence. Therefore, the scdr boxes 420 - 1 to 420 - n are a reference of the scdi box 234 to be used in the entire file and may include the corresponding identification information scdr_ID.
- the scdi box 234 may be configured in a format such as the syntax of the following [Table 7].
- is_scdr_ID_ref indicates whether or not to use the display and camera information of the scdr boxes 420 - 1 to 420 - n and “ref_scdr_ID” indicates the ID of the scdr boxes 420 - 1 to 420 - n that are referenced.
- the scdi box 234 uses the display and camera information of one 420 - i of the scdr boxes 420 - 1 to 420 - n and thus, can search the scdr box 420 - i having the identification information on “ref_scdr_ID” and use the information.
- the svi2 box 232 and the scdi box 234 existing in each stereoscopic image frame may link the semi box 410 and the scdr boxes 420 - 1 to 420 - n of each of the meta box levels and redefine the information therein.
- the overlapping information of the trak box level is collected in the meta box level and is linked or redefined in the trak box level, thereby making it possible to minimize the overlapping data.
- the stereoscopic video file according to the stereoscopic video file format according to the present invention may be stored in a recording medium (for example, CD-ROM, RAM, floppy disk, hard disk, a magneto-optical disk, flash memory, etc.) in a computer-readable type.
- a recording medium for example, CD-ROM, RAM, floppy disk, hard disk, a magneto-optical disk, flash memory, etc.
- the present invention relates to a stereoscopic video file format capable of improving bit efficiency and processing efficiency and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded, which can be used in an image technology industry or a digital technology industry.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Library & Information Science (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
Abstract
Description
- The present invention relates to a stereoscopic video, and more particularly, to a stereoscopic video file format capable of improving bit efficiency and processing efficiency and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded.
- With the development of an image technology and a digital technology, a stereoscopic video technology that further increases animation of images and enables persons watching the images to increase the feel of actual sensation has been developing.
- In particular, ‘JTC 1/SC 29/WG11’ that is an image related standardization group organized by ISO/IEC has proposed a file format of a stereoscopic video for storing and reproducing stereoscopic video and has discussed a method of effectively optimizing the file format.
- The known stereoscopic video file format relates to an image of binocular disparity, which includes a variety of information such as information showing an arrangement form of a left image and a right image, information informing whether or not to include a mono image such as intermediate insertion advertisement, recommendation display information, and camera parameter information, information on whether or not to refer to other images and information indicating what image is referred to.
- However, the known stereoscopic video file format generally includes recommendation display information and camera parameter information on each image, such that the data amount is increased because it does not exclude the overlapping portions, which degrade efficiency. In particular, when a video stream called an element stream (ES) has 2ES, in which the left image and the right image are each encoded, almost all or all of the corresponding information of the left image and the right image overlap.
-
FIG. 1 is a structural diagram showing stereoscopic video stream structure of 2ES according to the related art. - As shown in
FIG. 1 , the stereoscopic video stream of 2ES is configured to include left ES, right ES, two ESs. Each ES may include a plurality of stereoscopic image frames S1 to Sn and intermediate inserted mono image frames M1 to Mn. - Each stereoscopic video stream is stored in a place called a trak box together with the stereoscopic camera and display information (SCDI) that shows the camera parameter and recommendation display information. At this case, the stereoscopic camera and display information almost all or completely overlaps between the left ES and the right ES. Since it is impossible to refer to the inter-trak box, the structure according to the related art cannot essentially exclude the above-mentioned overlapping.
- In addition, when one image within one ES refers to other images, too many images refer to one image or when the reference relation is entangled, the complexity of the file format increases.
- In this case, in order to see what image refers to what image during a process of analyzing the file by the computer, the reference information on all the images should be analyzed, such that it is impossible to configure an initial map.
- The present invention has been made in an effort to provide a stereoscopic video file format capable of minimizing overlapping data and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded.
- Further, the present invention has been made in an effort to provide a stereoscopic video file format capable of optimizing a configuration of a file format in a simple format and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded.
- In addition, the present invention has been made in an effort to provide a stereoscopic video file format capable of facilitating a file format design of a specific image by enabling initial map configuration and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded.
- An exemplary embodiment of the present invention provides stereoscopic video file format, including: a file type box that stores file format information and information indicating whether or not to include monoscopic data; a movie box that stores a plurality of trak boxes configuring stereoscopic video streams; a media data box that stores multimedia resources; a stereoscopic video media information box that stores at least one common stereoscopic video stream arrangement information; a stereoscopic camera and display information reference box that stores camera parameter and recommendation display information referenced by the plurality of trak boxes; a stereoscopic camera and display information box that exists at each stereoscopic image frame of the stereoscopic video stream and stores reference information of the stereoscopic camera and display information reference box; and a meta box that stores metadata.
- The stereoscopic video file format further includes a stereoscopic video media information extension box that designates redefinition of the stereoscopic video stream arrangement information for expansion through the redefinition.
- Another exemplary embodiment of the present invention provides a stereoscopic video file format, including: a first area that stores a plurality of stereoscopic image frames configuring stereoscopic video streams; a second area that stores common stream arrangement information on the plurality of stereoscopic image frames; a third area that stores camera parameter and recommendation display information to be referenced by at least one stereoscopic image frame; and a fourth area that exists at each of the stereoscopic image frames and stores the reference information of the third area.
- Yet another exemplary embodiment of the present invention provides a stereoscopic video file format, including: a trak box that stores stereoscopic video streams; a stereoscopic video media information (semi) box that stores arrangement information on the stereoscopic video stream; a stereoscopic camera and display information (scdi) box that stores stereoscopic camera and recommendation display information, wherein the trak box refers to at least one of the svmi box and the scdi box.
- The trak box may include a reference box that stores identification information on the svmi box and the scdi box for referring to at least one of the svmi box and the scdi box.
- Still yet another exemplary embodiment of the present invention provides a computer readable recording medium recording a stereoscopic video file, including: a file type box that stores file format information and information indicating whether or not to include monoscopic data; a movie box that stores a plurality of trak boxes configuring stereoscopic video streams; a media data box that stores multimedia resources; a stereoscopic video media information box that stores at least one common stereoscopic video stream arrangement information; a stereoscopic camera and display information reference box that stores camera parameter and recommendation display information referenced by the plurality of trak boxes; a stereoscopic camera and display information box that exists at each stereoscopic image frame of the stereoscopic video stream and stores reference information of the stereoscopic camera and display information reference box; and a meta box that stores metadata.
- According to the exemplary embodiments of the present invention, a stereoscopic video file format and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded collect and store the overlapping information existing at each stereoscopic image frame in one area and refers to the corresponding information through a link at the time of performing the decoding, such that the overlapping information is removed from the stereoscopic video file, thereby making it possible to improve the bit efficiency and the processing efficiency.
- Further, the present invention simplifies the sophisticated structure of the stereoscopic video file and provides the optimized stereoscopic video file format, thereby making it possible to contribute to standardizing the art.
-
FIG. 1 is a structural diagram of a stereoscopic video stream structure in a 2ES mode according to the related art; and -
FIG. 2 is a structural diagram showing a structure of a stereoscopic video file format according to an exemplary embodiment of the present invention. - Exemplary embodiments of the present invention will now be described in detail with reference to the accompanying drawings. It is to be noted that like components are denoted by like reference numerals throughout the drawings. Moreover, detailed descriptions related to well-known functions or configurations will be ruled out in order not to unnecessarily obscure the subject matter of the present invention.
-
FIG. 2 is a structural diagram showing a structure of a stereoscopic video file format according to an exemplary embodiment of the present invention. - As shown in
FIG. 2 , the stereoscopic video file format according to an exemplary embodiment of the present invention may be configured to include a file type (ftyp)box 100 that stores file format information and information indicating whether or not to include monoscopic data; a movie (moov)box 200 that stores a plurality oftrak boxes 200 b configuring stereoscopic video streams; a media data (mdat)box 300 that stores multimedia resources; and afirst meta box 400 that is configured of meta data. - The stereoscopic video stream may be configured of at least one element streams (ES).
- The
file type box 100 may be configured in the following [Table 1]. -
TABLE 1 Types Specifications ss01 Stereoscopic content without partial monoscopic data ss01 Stereoscopic content with partial monoscopic data - In other words, the
file type box 100 shows the file type and may include an identifier such as “ss01” and “ss02” in order to indicate whether the monoscopic data are included in the stereoscopic contents. The monoscopic data may be data of advertisement images. - The
movie box 200 may further include a movie header (mvhd)box 200 a that is header information on images. - The
movie box 200 may further include at least one mono data that are inserted into the trak boxes. - The
trak box 200 b may include a trak header (trhd)box 210 that stores the overall information on thecorresponding trak box 200 b and a media (mdia)box 220 that is a container storing media information in thetrak box 200 b. Thetrak header box 210 may include a trak reference (tref)box 212 that is a trak reference container. Themedia box 220 may include a media header (mdhd)box 222 that stores the overall information on media; a handler (hdlr)box 224 that shows a media type such as audio data or video data; and a media information (minf)box 226 that is a media information container. - The stereoscopic video file format according to the exemplary embodiment of the present invention may further include: a stereoscopic video media information (semi)
box 410 that stores at least one stereoscopic video stream arrangement information; and at least one stereoscopic camera and display information reference (scdr) boxes 420-1 to 420-n that store camera parameter and recommendation display information referring to the plurality oftrak boxes 200 b. - The
svmi box 410 and the scdr boxes 420-1 to 420-n may be included in thefirst meta box 400. Thefirst meta box 400 may further include an item location (iloc)box 430 that stores item locations and an item information (iinf)box 440 that stores the item information. - The stereoscopic video file format according to an exemplary embodiment of the present invention may further include a stereoscopic video media information extension (svi2)
box 232 that designates redefinition of the stereoscopic video stream arrangement information that facilitates expansion through the redefinition; and a stereoscopic camera and display information box (scdi box) that may link one of the scdr boxes 420-1 to 420-n or designate the redefinition of the camera parameter and recommendation display information. - The
svi2 box 232 andscdr box 234 may be included in thesecond meta box 230 that is stored in thetrak box 200 b. - The figure shows that the
svmi box 410 and the scdr boxes 420-1 to 420-n are included in thefirst meta box 400, thesvi2 box 232 and thescdr box 234 are included in thesecond meta box 230 stored in thetrak box 200 b, but thesvmi box 410, the scdr boxes 420-1 to 420-n, thesvi2 box 232, and thescdr box 234 may not be necessarily included in the specific meta box and therefore, may be flexibly moved according to situations. For example, thesvmi box 410 and the scdr 420-1 to 420-n boxes may be included in themovie box 200 and thesvmi box 410, the scdr boxes 420-1 to 420-n, thesvi2 box 232, and thescdr box 234 may independently exist at the outside rather than being included in any box. - Alternatively, when a significant amount of information is included in the boxes and thus, the processing time is delayed, the information included in the boxes may be separated into a separate box form by a predetermined unit or reference. For example, a portion related to the camera parameter information of the scdr box or the scdi box and a portion related to the recommendation display information may be separated into a separate box form. Alternatively, the svmi box storing the arrangement information of the stereoscopic video stream or the scdi box storing the stereo camera and recommendation display information exists at the outside of the trak box, such that the trak box can refer to the scdr box or the scdi box at the outside. In this case, the svmi box or the scdi box may be included in other trak box. For example, the svmi box or the scdi box may exist at the outside for the left image and in the trak box for the right image.
- The
svmi box 410 may be configured in a format such as the syntax of the following [Table 2]. The bit size allocated to each field is by way of example only and therefore, is not limited thereto. It can be apparent to those skilled in the art that the bit size can be variously changed if necessary. -
TABLE 2 <Syntax of svmi box> aligned(8) class StereoscopicVideoMediaInformationBox extends FullBox(‘svmi’, version = 0, 0){ // stereoscopic visual type information unsigned int(8) stereoscopic_composition_type; if (stereoscopic_composition_type != 3) { unsigned int(1) is_left_first; unsigned int(7) reserved; } else unsigned int(8) reserved; // stereoscopic contents information unsigned int(32) stereo_mono_change_count; for(i=0; i<stereo_mono_change_count; i++){ unsigned int(32) sample_count; unsigned int(1) stereo_flag; unsigned int(7) reserved; } } - Reviewing the above [Table 2], the
svmi box 410 may include common information on each content. - A field value of “stereoscopic_composition_type” showing the element (ES) stream structure of the stereoscopic video file format may be defined by the following [Table 3].
-
TABLE 3 Stereoscopic_composition_type Idenification 0 1 Only ES exists Side-by-Side format 1 1 Only ES exists Vertical Line integrated format 2 1 Only ES exists Frame sequential format 3 2 ES exists Monoscopic image - When the value of “stereoscopic_composition_type” is 0, 1, and 2, “is_left_first” indicating whether the left image frame is first and “stereo_mono_change” that is identification information on the stereoscopic image frame and the mono image frame may be commonly used.
- Meanwhile, when the value of “stereoscopic_composition_type” is 3, “is_left_first” cannot be commonly used, such that the information on the
svi2 box 232 of the corresponding trak may be used. In addition, the “stereo_mono_change” may be commonly used or may not be commonly used. If it is determined that the “stereo_mono_change” is not commonly used, the information on thesvi2 box 232 of the corresponding trak may be used. - In other words, in the 1ES, all the
trak boxes 200 b refers to the information on thesemi box 410 and in the case of the 2ES, eachtrak box 200 b may use the information on thesvi2 box 232 that is stored in thetrak boxes 200 b. - The
svi2 box 232 may be configured in a format such as the syntax of the following [Table 4]. -
TABLE 4 <Syntax of svi2 box> aligned(8) class StereoscopicVideoMediaInformationBoxEx extends FullBox(‘svi2’, version = 0, 0){ // stereoscopic visual type information unsigned int(1) is_left_first; unsigned int(1) is_update_flag; unsigned int(6) reserved; if(is_update_flag){ // stereoscopic fragment information unsigned int(32) stereo_mono_change_count; for(i=0; i<stereo_mono_change_count; i++){ unsigned int(32) sample_count; unsigned int(1) stereo_flag; unsigned int(7) reserved; } } } - Referring to the above [Table 4], the
svi2 box 232 stores only the information according to the characteristics of each trak based on the information on thesemi box 410 declared on the uppermost. This information may be updated if necessary. For example, if the information on “is_update_flag” that is a field indicating whether the “stereo_mono_count” is updated, the “stereo_mono_count” information of the corresponding trak is written. Further, in the case of 2ES, the value of “is_left_first” may be designated as [Table 5]. -
TABLE 5 is_left_first Identification 1 Monoscopic left image 0 Monoscopic right image - The scdr boxes 420-1 to 420-n may be configured in a format such as the syntax of the following [Table 6].
-
TABLE 6 <Syntax of scdr box> aligned(8) class StereoscopicCameraAndDisplayInformationBox extends FullBox(‘scdr’, version = 0, 0){ unsigned int (16) scdr_ID; // stereoscopic display information unsigned int(16) expected_display_width; unsigned int(16) expected_display_height; unsigned int(16) expected_viewing_distance; int(16) min_of_disparity; int(16) max_of_disparity; // stereoscopic camera information unsigned int(32) baseline; unsigned int(32) focal_length; unsigned int(32) convergence_distance; unsigned int(1) is_camera_cross; unsigned int(7) reserved; if (is_camera_cross){ unsigned int (32) rotation; } } - The scdr boxes 420-1 to 420-n may be generated by the number of combinations of displays and cameras used in the video sequence. Therefore, the scdr boxes 420-1 to 420-n are a reference of the
scdi box 234 to be used in the entire file and may include the corresponding identification information scdr_ID. - The
scdi box 234 may be configured in a format such as the syntax of the following [Table 7]. -
TABLE 7 <Syntax of scdi box> aligned(8) class StereoscopicCameraAndDisplayInformationBox extends FullBox(‘scdi’, version = 0, 0){ unsigned int (16) item_count; for( i=0; i<item_count; i++ ){ unsigned int(16) item_ID; unsigned int(1) is_item_ID_ref; unsigned int(1) is_scdr_ID_ref; unsigned int (6) reserved; if(is_scdi_ID_ref) { unsigned int(16) ref_scdr_ID; } else{ if(is_item_ID_ref){ unsigned int(16) ref_item_ID; } else{ //stereoscopic display information unsigned int(1) is_display_safety_info; unsigned int(7) reserved; if(is_display_safety_info){ unsigned int(16) expected_display_width; unsigned int(16) expected_display_height; unsigned int(16) expected_viewing_distance; int(16) min_of_disparity; int(16) max_of_disparity; } // stereoscopic camera information unsigned int(1) is_cam_params; unsigned int(7) reserved; if(is_cam_params){ unsigned int(32) baseline; unsigned int(32) focal_length; unsigned int(32) convergence_distance; unsigned int(1) is_camera_cross; unsigned int(7) reserved; if(is_camera_cross){ unsigned int (32) rotation; } } } } } } - In the above [Table 7], “is_scdr_ID_ref” indicates whether or not to use the display and camera information of the scdr boxes 420-1 to 420-n and “ref_scdr_ID” indicates the ID of the scdr boxes 420-1 to 420-n that are referenced. For example, if “is_scdr_ID_ref” is 1, the
scdi box 234 uses the display and camera information of one 420-i of the scdr boxes 420-1 to 420-n and thus, can search the scdr box 420-i having the identification information on “ref_scdr_ID” and use the information. - In other words, the
svi2 box 232 and thescdi box 234 existing in each stereoscopic image frame may link thesemi box 410 and the scdr boxes 420-1 to 420-n of each of the meta box levels and redefine the information therein. - Through this, the overlapping information of the trak box level is collected in the meta box level and is linked or redefined in the trak box level, thereby making it possible to minimize the overlapping data.
- As described above, the stereoscopic video file according to the stereoscopic video file format according to the present invention may be stored in a recording medium (for example, CD-ROM, RAM, floppy disk, hard disk, a magneto-optical disk, flash memory, etc.) in a computer-readable type.
- A number of exemplary embodiments have been described above. Nevertheless, it will be understood that various modifications may be made. For example, suitable results may be achieved if the described techniques are performed in a different order and/or if components in a described system, architecture, device, or circuit are combined in a different manner and/or replaced or supplemented by other components or their equivalents.
- The present invention relates to a stereoscopic video file format capable of improving bit efficiency and processing efficiency and a computer-readable recording medium in which the corresponding stereoscopic video file are recorded, which can be used in an image technology industry or a digital technology industry.
Claims (22)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020080008118A KR100924757B1 (en) | 2008-01-25 | 2008-01-25 | Stereoscopic video file format and computer readable recording medium for recording a stereoscopic video file therefore |
KR10-2008-0008118 | 2008-01-25 | ||
PCT/KR2009/000408 WO2009093881A1 (en) | 2008-01-25 | 2009-01-28 | Stereoscopic video file format and computer readable recording medium in which stereoscopic video file is recorded according thereto |
Publications (2)
Publication Number | Publication Date |
---|---|
US20100289876A1 true US20100289876A1 (en) | 2010-11-18 |
US8659642B2 US8659642B2 (en) | 2014-02-25 |
Family
ID=40901291
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/864,404 Expired - Fee Related US8659642B2 (en) | 2008-01-25 | 2009-01-28 | Stereoscopic video file format and computer readable recording medium in which stereoscopic video file is recorded according thereto |
Country Status (5)
Country | Link |
---|---|
US (1) | US8659642B2 (en) |
EP (1) | EP2247115A4 (en) |
KR (1) | KR100924757B1 (en) |
CN (1) | CN101978699A (en) |
WO (1) | WO2009093881A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090284583A1 (en) * | 2008-05-19 | 2009-11-19 | Samsung Electronics Co., Ltd. | Apparatus and method for creatihng and displaying media file |
US20110276662A1 (en) * | 2010-05-07 | 2011-11-10 | Samsung Electronics Co., Ltd. | Method of constructing multimedia streaming file format, and method and apparatus for servicing multimedia streaming using the multimedia streaming file format |
US9049414B2 (en) | 2010-08-19 | 2015-06-02 | Samsung Electronics Co., Ltd. | Device for recording and reproducing image, method for recording and reproducing image, and recording medium |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009075495A1 (en) * | 2007-12-10 | 2009-06-18 | Samsung Electronics Co., Ltd. | System and method for generating and reproducing image file including 2d image and 3d stereoscopic image |
CN102340681A (en) * | 2010-07-26 | 2012-02-01 | 深圳市锐取软件技术有限公司 | 3D (three-dimensional) stereo video single-file double-video stream recording method |
CN102404577A (en) * | 2011-12-01 | 2012-04-04 | 无锡太行电子技术有限公司 | Memory method for 3D (three-dimensional) video code |
CN102780897A (en) * | 2012-05-31 | 2012-11-14 | 新奥特(北京)视频技术有限公司 | Method for enabling single file video material to support 3D (three-dimensional) technology |
CN103179421B (en) * | 2013-01-25 | 2015-08-19 | 成都索贝数码科技股份有限公司 | A kind of description of stereoscopic video file and management method |
CN109600601A (en) * | 2018-11-23 | 2019-04-09 | 维沃移动通信有限公司 | A kind of method and terminal device storing 3D rendering |
CN113542907B (en) * | 2020-04-16 | 2022-09-23 | 上海交通大学 | Multimedia data transceiving method, system, processor and player |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5576950A (en) * | 1993-07-28 | 1996-11-19 | Nippon Telegraph And Telephone Corporation | Video image search method and system using the same |
US5642171A (en) * | 1994-06-08 | 1997-06-24 | Dell Usa, L.P. | Method and apparatus for synchronizing audio and video data streams in a multimedia system |
US6005607A (en) * | 1995-06-29 | 1999-12-21 | Matsushita Electric Industrial Co., Ltd. | Stereoscopic computer graphics image generating apparatus and stereoscopic TV apparatus |
US20030156188A1 (en) * | 2002-01-28 | 2003-08-21 | Abrams Thomas Algie | Stereoscopic video |
US20060051057A1 (en) * | 2004-09-03 | 2006-03-09 | Toshiyuki Nakagawa | Data recording/reproducing apparatus and method |
US20070014219A1 (en) * | 2003-10-29 | 2007-01-18 | Sony Corporation | File processing device, file processing method, file processing method program, recording medium containing the file processing method program, imaging device, and recording medium containing file |
US20070078898A1 (en) * | 2005-09-30 | 2007-04-05 | Yahoo! Inc. | Server-based system and method for retrieving tagged portions of media files |
US20080303813A1 (en) * | 2007-06-08 | 2008-12-11 | Do-Young Joung | Method for recording three-dimensional video data and medium recording the same |
US20080303832A1 (en) * | 2007-06-11 | 2008-12-11 | Samsung Electronics Co., Ltd. | Method of generating two-dimensional/three-dimensional convertible stereoscopic image bitstream and method and apparatus for displaying the same |
US20080303893A1 (en) * | 2007-06-11 | 2008-12-11 | Samsung Electronics Co., Ltd. | Method and apparatus for generating header information of stereoscopic image data |
US20090066783A1 (en) * | 2007-09-07 | 2009-03-12 | Samsung Electronics Co., Ltd. | Method and apparatus for generating stereoscopic file |
US20090148070A1 (en) * | 2007-12-10 | 2009-06-11 | Samsung Electronics Co., Ltd. | System and method for generating and reproducing image file including 2d image and 3d stereoscopic image |
US20090199100A1 (en) * | 2008-02-05 | 2009-08-06 | Samsung Electronics Co., Ltd. | Apparatus and method for generating and displaying media files |
US20090208119A1 (en) * | 2008-02-15 | 2009-08-20 | Samsung Electronics Co., Ltd. | Method for generating and playing image files for slideshows |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1501317A4 (en) * | 2002-04-25 | 2006-06-21 | Sharp Kk | Image data creation device, image data reproduction device, and image data recording medium |
JP4529556B2 (en) * | 2004-06-24 | 2010-08-25 | パナソニック株式会社 | Electronic device for generating stereoscopic image file, electronic device for generating three-dimensional image data, image file generation method, three-dimensional image data generation method, and file structure of image file |
KR20050092688A (en) * | 2005-08-31 | 2005-09-22 | 한국정보통신대학교 산학협력단 | Integrated multimedia file format structure, its based multimedia service offer system and method |
KR100871740B1 (en) * | 2006-08-31 | 2008-12-05 | 한국정보통신대학교 산학협력단 | File for multimedia broadcasting contents and system/method for servicing multimedia broadcasting contents by using same |
KR100716142B1 (en) | 2006-09-04 | 2007-05-11 | 주식회사 이시티 | Method for transferring stereoscopic image data |
-
2008
- 2008-01-25 KR KR1020080008118A patent/KR100924757B1/en active IP Right Grant
-
2009
- 2009-01-28 CN CN2009801093208A patent/CN101978699A/en active Pending
- 2009-01-28 US US12/864,404 patent/US8659642B2/en not_active Expired - Fee Related
- 2009-01-28 WO PCT/KR2009/000408 patent/WO2009093881A1/en active Application Filing
- 2009-01-28 EP EP09703099A patent/EP2247115A4/en not_active Ceased
Patent Citations (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5576950A (en) * | 1993-07-28 | 1996-11-19 | Nippon Telegraph And Telephone Corporation | Video image search method and system using the same |
US5642171A (en) * | 1994-06-08 | 1997-06-24 | Dell Usa, L.P. | Method and apparatus for synchronizing audio and video data streams in a multimedia system |
US6005607A (en) * | 1995-06-29 | 1999-12-21 | Matsushita Electric Industrial Co., Ltd. | Stereoscopic computer graphics image generating apparatus and stereoscopic TV apparatus |
US6175379B1 (en) * | 1995-06-29 | 2001-01-16 | Matsushita Electric Industrial Co., Ltd. | Stereoscopic CG image generating apparatus and stereoscopic TV apparatus |
US6268880B1 (en) * | 1995-06-29 | 2001-07-31 | Matsushita Electric Industrial Co., Ltd. | Stereoscopic CG image generating apparatus and stereoscopic TV apparatus |
US20030156188A1 (en) * | 2002-01-28 | 2003-08-21 | Abrams Thomas Algie | Stereoscopic video |
US20070014219A1 (en) * | 2003-10-29 | 2007-01-18 | Sony Corporation | File processing device, file processing method, file processing method program, recording medium containing the file processing method program, imaging device, and recording medium containing file |
US20060051057A1 (en) * | 2004-09-03 | 2006-03-09 | Toshiyuki Nakagawa | Data recording/reproducing apparatus and method |
US20070078898A1 (en) * | 2005-09-30 | 2007-04-05 | Yahoo! Inc. | Server-based system and method for retrieving tagged portions of media files |
US20080303813A1 (en) * | 2007-06-08 | 2008-12-11 | Do-Young Joung | Method for recording three-dimensional video data and medium recording the same |
US20080303832A1 (en) * | 2007-06-11 | 2008-12-11 | Samsung Electronics Co., Ltd. | Method of generating two-dimensional/three-dimensional convertible stereoscopic image bitstream and method and apparatus for displaying the same |
US20080303893A1 (en) * | 2007-06-11 | 2008-12-11 | Samsung Electronics Co., Ltd. | Method and apparatus for generating header information of stereoscopic image data |
US20090066783A1 (en) * | 2007-09-07 | 2009-03-12 | Samsung Electronics Co., Ltd. | Method and apparatus for generating stereoscopic file |
US20090148070A1 (en) * | 2007-12-10 | 2009-06-11 | Samsung Electronics Co., Ltd. | System and method for generating and reproducing image file including 2d image and 3d stereoscopic image |
US20090199100A1 (en) * | 2008-02-05 | 2009-08-06 | Samsung Electronics Co., Ltd. | Apparatus and method for generating and displaying media files |
US20090208119A1 (en) * | 2008-02-15 | 2009-08-20 | Samsung Electronics Co., Ltd. | Method for generating and playing image files for slideshows |
Non-Patent Citations (2)
Title |
---|
ISO, ISO/IEC 14496-1, 2001-10-01, ISO, 2nd Edition, pp. 684 * |
ISO, ISO/IEC 14496-12, 2005-10-01, ISO, 2nd Edition, pp. 94 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090284583A1 (en) * | 2008-05-19 | 2009-11-19 | Samsung Electronics Co., Ltd. | Apparatus and method for creatihng and displaying media file |
US8749616B2 (en) * | 2008-05-19 | 2014-06-10 | Samsung Electronics Co., Ltd. | Apparatus and method for creating and displaying media file |
US20110276662A1 (en) * | 2010-05-07 | 2011-11-10 | Samsung Electronics Co., Ltd. | Method of constructing multimedia streaming file format, and method and apparatus for servicing multimedia streaming using the multimedia streaming file format |
US9049414B2 (en) | 2010-08-19 | 2015-06-02 | Samsung Electronics Co., Ltd. | Device for recording and reproducing image, method for recording and reproducing image, and recording medium |
Also Published As
Publication number | Publication date |
---|---|
WO2009093881A1 (en) | 2009-07-30 |
EP2247115A1 (en) | 2010-11-03 |
CN101978699A (en) | 2011-02-16 |
KR20090081933A (en) | 2009-07-29 |
US8659642B2 (en) | 2014-02-25 |
KR100924757B1 (en) | 2009-11-05 |
EP2247115A4 (en) | 2012-11-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8659642B2 (en) | Stereoscopic video file format and computer readable recording medium in which stereoscopic video file is recorded according thereto | |
US8396906B2 (en) | Metadata structure for storing and playing stereoscopic data, and method for storing stereoscopic content file using this metadata | |
US9781403B2 (en) | Method and apparatus for generating stereoscopic file | |
US20100161686A1 (en) | Metadata structure for storing and playing stereoscopic data, and method for storing stereoscopic content file using this metadata | |
US20090199100A1 (en) | Apparatus and method for generating and displaying media files | |
US20120033039A1 (en) | Encoding method, display device, and decoding method | |
CA2713857C (en) | Apparatus and method for generating and displaying media files | |
US8749616B2 (en) | Apparatus and method for creating and displaying media file | |
KR101390810B1 (en) | Method and apparatus for receiving image data stream comprising parameters for displaying local three dimensional image, and method and apparatus for generating image data stream comprising parameters for displaying local three dimensional image | |
KR100939641B1 (en) | Stereoscopic video file format and computer readable recording medium for recording a stereoscopic video file therefore | |
KR100921686B1 (en) | Stereoscopic video file format and computer readable recording medium for recording a stereoscopic video file therefore | |
KR20090034707A (en) | Method and appratus for generating multiview image data stream, and method and apparatus for decoding multiview image data stream |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: KOREA ELECTRONICS TECHNOLOGY INSTITUTE, KOREA, REP Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SHIN, HWA SEON;MYUNG, JIN SU;KIM, JE WOO;AND OTHERS;REEL/FRAME:024734/0216 Effective date: 20100722 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20220225 |