WO2013136373A1

WO2013136373A1 - Three-dimensional image processing device and three-dimensional image processing method

Info

Publication number: WO2013136373A1
Application number: PCT/JP2012/001846
Authority: WO
Inventors: 佑貴原
Original assignee: パナソニック株式会社
Priority date: 2012-03-16
Filing date: 2012-03-16
Publication date: 2013-09-19

Abstract

A three-dimensional image processing device (100) comprises: a display unit (130) for alternately displaying a first viewpoint image and a second viewpoint image that constitute a three-dimensional image; an input receiving unit (140) for receiving a capture request input from the user; and a file generation unit (160) for generating a file conforming to the multi-picture format, in which a first viewpoint image and a second viewpoint image corresponding to the first viewpoint image are stored as a pair of still images, said first and second viewpoint images being displayed on the display unit (130) at a timing when the capture request input is received by the input receiving unit (140).

Description

Stereoscopic image processing apparatus and stereoscopic image processing method

The present invention relates to a 3D image processing apparatus and 3D image processing method, and more particularly to a 3D image processing apparatus and 3D image processing method for extracting a 3D still image from a 3D image.

Patent Document 1 discloses a technique for capturing a screen image of a two-dimensional video and storing the captured screen image as a JPEG file. Patent Document 2 discloses a technique for storing a stereoscopic video as a stereoscopic still image file.

JP 2007-067962 A JP, 2010-183267, A

However, the conventional techniques disclosed in

Patent Literatures

1 and 2 have a problem that they can not capture a screen image of a stereoscopic video and store it in a mode that can be viewed as a stereoscopic still image later.

The present invention has been made in view of the above problems, and it is an object of the present invention to provide a three-dimensional video processing apparatus and a three-dimensional video processing method capable of storing a screen image of a three-dimensional video later as a stereoscopic still image. To aim.

A stereoscopic video processing apparatus according to an aspect of the present invention includes: a display unit that alternately displays a first viewpoint image and a second viewpoint image that form a stereoscopic video; an input reception unit that receives an input of a capture request from a user; A multi-picture format compliant in which the first viewpoint image displayed on the display unit and the second viewpoint image corresponding to the first viewpoint image are stored as a pair of still images at the timing when the capture request is received by the input reception unit And a file generation unit for generating a file of

According to the above configuration, the screen image of the stereoscopic video can be stored in a mode that can be viewed as a stereoscopic still image later.

The file generation unit may store the pair of still images acquired from each of the plurality of stereoscopic videos in the file when a plurality of stereoscopic videos are simultaneously displayed on the display unit.

Furthermore, the file generation unit may be configured to combine a plurality of first viewpoint images stored in the file according to a layout displayed on the display unit, and a plurality of combined first viewpoint images stored in the file. The second viewpoint image may be stored in the file together with a combined second viewpoint image combined according to the layout displayed on the display unit.

This makes it possible to reproduce the display content of the display unit at the time when the capture request is accepted by the input acceptance unit later as a stereoscopic still image.

Furthermore, the file generation unit may store, in the file, information specifying a combination of a first viewpoint image and a second viewpoint image constituting the pair of still images.

Furthermore, the three-dimensional video processing apparatus may further include a file reproduction unit that reads out the first viewpoint image and the second viewpoint image constituting the pair of still images from the file and alternately displays them on the display unit.

As an example, the first viewpoint image may be one of a left viewpoint image and a right viewpoint image having parallax. The second viewpoint image may be the other of the left viewpoint image and the right viewpoint image.

In a stereoscopic video processing method according to an aspect of the present invention, a display step of alternately displaying a first viewpoint image and a second viewpoint image constituting a stereoscopic video, an input receiving step of receiving an input of a capture request from a user, and A multi-picture format compliant in which the first viewpoint image displayed in the display step and the second viewpoint image corresponding to the first viewpoint image are stored as a pair of still images at the timing when the capture request is received in the input reception step And a file generation step of generating a file of

The present invention can be realized not only as such a 3D image processing apparatus and 3D image processing method, but also as an integrated circuit for realizing the functions of the 3D image processing apparatus, and each step of the 3D image processing method It can also be realized as a program to be executed by Needless to say, such a program can be distributed via a recording medium such as a CD-ROM and a transmission medium such as the Internet.

According to the present invention, it is possible to obtain a three-dimensional video processing apparatus capable of storing a screen image of a three-dimensional video so as to be viewed as a three-dimensional still image later.

FIG. 1 is a block diagram of a 3D image processing apparatus according to the first embodiment. FIG. 2 is a diagram for explaining the outline of the operation of the 3D image processing apparatus. FIG. 3 is a flowchart of file generation processing according to the first embodiment. FIG. 4 is a flowchart of left viewpoint image generation processing according to the first embodiment. FIG. 5 is a flowchart of right viewpoint image generation processing according to the first embodiment. FIG. 6 is a diagram showing an example of data of parameter 1. FIG. 7 is a view showing an example of data of parameter 2. FIG. 8 is a diagram showing an example of data of parameter 3. FIG. 9 is a diagram showing an example of the data layout of the mpo file according to the first embodiment. FIG. 10 is a diagram for explaining the outline of the operation of the 3D image processing apparatus. FIG. 11 is a flowchart of file generation processing according to the second embodiment. FIG. 12 is a flowchart of left viewpoint image generation processing according to the second embodiment. FIG. 13 is a flowchart of right viewpoint image generation processing according to the second embodiment. FIG. 14 is a flowchart of combined left viewpoint image generation processing according to the second embodiment. FIG. 15 is a flowchart of combined right-viewpoint image generation processing according to the second embodiment. FIG. 16A is a diagram illustrating an example of data of parameter 4; FIG. 16B is a view showing a continuation of FIG. 16A of the data example of parameter 4; FIG. 16C is a view showing a continuation of FIG. 16B of the data example of parameter 4; FIG. 17 is a diagram showing an example of data of parameter 5. FIG. 18 is a diagram of an example of data of parameter 6; FIG. 19 shows an example of data of parameter 7. FIG. 20 is a diagram of an example of data of the parameter 8. FIG. 21 is a diagram showing an example of the data layout of the mpo file according to the second embodiment.

Hereinafter, a stereoscopic video processing apparatus and a stereoscopic video processing method according to the present invention will be described with reference to the drawings. The present invention is specified based on the description of the claims. Therefore, among the components in the following embodiments, components not described in the claims are not necessarily required to achieve the object of the present invention. That is, the following embodiment is a description of a more preferable embodiment of the present invention. Further, each drawing is a schematic view, and is not necessarily strictly illustrated.

Embodiment 1
A stereoscopic video processing apparatus according to the first embodiment of the present invention will be described with reference to FIGS. 1 and 2. FIG. 1 is a block diagram of a 3D image processing apparatus 100 according to the first embodiment. FIG. 2 is a diagram for explaining an outline of the operation of the stereoscopic video processing device 100. As shown in FIG.

As shown in FIG. 1, the stereoscopic video processing apparatus 100 includes a video acquisition unit 110, a video decoding unit 120, a display unit 130, an input reception unit 140, a 2D / 3D detection unit 150, and a file generation unit 160. , A storage unit 170, and a file reproduction unit 180.

The video acquisition unit 110 acquires a video signal from the outside of the apparatus, and outputs the acquired video signal to the video decoding unit 120. Although the acquisition destination of the video signal is not particularly limited, for example, the video acquisition unit 110 may acquire the video signal through a broadcast wave or a communication network, or may read the video signal from the recording medium. That is, the stereoscopic video processing apparatus 100 according to the first embodiment can be applied to a television receiver or the like. Further, the video acquisition unit 110 may include an imaging device (not shown) and may acquire a video signal from the imaging device. That is, the three-dimensional video processing apparatus 100 according to the first embodiment can be applied to a video camera capable of capturing a three-dimensional video.

In addition, although the specific example of a broadcast wave is not specifically limited, For example, analog broadcast, terrestrial digital broadcast, BS (Broadcast Satellite) broadcast, CS (Communication Satellite) broadcast etc. correspond. Further, specific examples of the recording medium are not particularly limited, but, for example, a DVD (Digital Versatile Disc), a BD (Blu-ray Disc), an SD (Secure Digital) card and the like correspond.

Further, the video signal acquired by the video acquisition unit 110 may be a signal of a two-dimensional video, or may be a signal of a three-dimensional video (stereoscopic video). For example, as shown in FIG. 2, a stereoscopic video is composed of a left viewpoint video and a right viewpoint video.

The left viewpoint video and the right viewpoint video shown in FIG. 2 are videos obtained by capturing an object from different viewpoints. The left viewpoint video is composed of a plurality of images (left viewpoint images) L ₁ , L ₂ , L ₃ , L ₄ , L ₅ . Similarly, the right viewpoint video is composed of a plurality of images (right viewpoint images) R ₁ , R ₂ , R ₃ , R ₄ , R ₅ . Then, the image R ₁ of the image L ₁ and the right view image of the left view image, an object as seen at the same time a (the same time captured in) the image, these referred to as "corresponding image". Similarly, the images L ₂ and R ₂ , the images L ₃ and R ₃ , the images L ₄ and R ₄ , and the images L ₅ and R ₅ are respectively corresponding images. Then, parallax in the horizontal direction is added to the corresponding pair of images.

The video decoding unit 120 decodes the video signal acquired from the video acquisition unit 110, and outputs the decoded video to the display unit 130. The video signal acquired from the video acquisition unit 110 is, for example, H.264. H.264 / AVC or the like is coded by a moving picture coding method such as H.264 / AVC. That is, at least a part of the image included in the video is encoded with reference to another image. Therefore, the image forming the video can only be decoded after decoding the reference image.

The display unit 130 has a display screen on which the video acquired from the video decoding unit 120 is displayed. Although the specific structure of a display part is not specifically limited, For example, a liquid crystal display, a plasma display, or organic electroluminescent (ElectroLuminescence) display etc. are employable.

Also, when the video output from the video decoding unit 120 is a stereoscopic video, the display unit 130 alternately displays the left viewpoint image and the right viewpoint image. The stereoscopic image shown in FIG. 2 is displayed in the order of images L ₁ , R ₁ , L ₂ , R ₂ , L ₃ , R ₃ , L ₄ , R ₄ , L ₅ , R ₅ as indicated by arrows. Ru. Then, the viewer can grasp the depth of the stereoscopic video by opening and closing the shutters of the left eye lens and the right eye lens of the glasses worn by the viewer (user) in synchronization with the image displayed on the display unit 130 it can. It is needless to say that the display order of the corresponding left viewpoint image and right viewpoint image is not limited to left → right as described above, but may be right → left.

The input receiving unit 140 is a user interface that receives input of various instructions (requests) from the viewer. The input receiving unit 140 according to the first embodiment receives an input of a capture request requesting acquisition of a screen image of a video displayed on the display unit 130 as a still image, and notifies the file generation unit 160 of the input.

The 2D / 3D detection unit 150 detects whether the image displayed on the display unit 130 is a two-dimensional (2D) image or a three-dimensional (3D) image. Although the detection method is not particularly limited, for example, the 2D / 3D detection unit 150 sets a flag indicating that it is a stereoscopic image (typically, a flag indicating that it is a side-by-side type or top and bottom type) to the video signal. When it is included, or when a pair of images displayed on the display unit 130 has horizontal parallax, it may be detected as a stereoscopic video.

When the capture request is notified from the input reception unit 140, the file generation unit 160 acquires an image displayed on the display unit 130 as a still image, and stores the acquired still image according to a multi-picture format file (see FIG. The mpo file is generated, and the generated mpo file is stored in the storage unit 170. The file generation unit 160 may obtain, for example, a still image from the video decoding unit 120. That is, the video decoding unit 120 needs to hold the decoded image for a predetermined period of time after output to the display unit 130.

In particular, when a stereoscopic image is displayed on the display unit 130, the file generation unit 160 stores the image displayed on the display unit 130 in the mpo file in such a manner that the image can be viewed as a stereoscopic still image later. For example, as shown in FIG. 2, at the timing when the image L ₃ of the left view image on the display unit 130 is displayed, the capture request is received by the input receiving unit 140. In this case, the file generation unit 160, an image _{L 3} displayed on the display unit 130, and an image _{R 3} in the right-view image corresponding to the image _{L 3,} and stores the mpo file.

The storage unit 170 stores the file generated by the file generation unit 160. Although the specific configuration of the storage unit is not particularly limited, for example, data such as dynamic random access memory (DRAM), synchronous dynamic random access memory (SDRAM), flash memory, ferroelectric memory, or hard disk drive (HDD) Any means can be used as long as it is a means capable of storing.

The file reproduction unit 180 reads the file stored in the storage unit 170 and reproduces a still image included in the read file. For example, the file reproducing unit 180 reads the mpo file shown in FIG. 2 and causes the display unit 130 to alternately display the images L ₃ and R ₃ included in the read mpo file. Thus, the viewer can view the stereoscopic still image.

In the stereoscopic video processing apparatus 100 illustrated in FIG. 1, when the video acquisition unit 110 acquires a video that has already been decoded, the video decoding unit 120 can be omitted. Further, when the video input to the video acquisition unit 110 is only a stereoscopic video, the 2D / 3D detection unit 150 can be omitted. In addition, the storage unit 170 may be an external storage device provided separately from the stereoscopic video processing device 100. Furthermore, the file playback unit 180 may be a component of a playback apparatus that plays back a 3D still image by reading an mpo file stored in the storage unit 170 instead of the component of the 3D image processing apparatus 100.

Next, with reference to FIGS. 3 to 9, an operation according to Embodiment 1 of the stereoscopic video processing device 100 will be described. FIG. 3 is a flowchart of file generation processing according to the first embodiment. FIG. 4 is a flowchart of left viewpoint image generation processing according to the first embodiment. FIG. 5 is a flowchart of right viewpoint image generation processing according to the first embodiment. 6 to 8 show examples of data stored in the header of the mpo file. FIG. 9 is a diagram showing an example of the data layout of the mpo file. Hereinafter, FIG. 2 will be described as an example.

First, the stereoscopic video processing apparatus 100 displays the stereoscopic video acquired by the video acquisition unit 110 and decoded by the video decoding unit 120 on the display unit 130. The input receiving unit 140, during the display of stereoscopic images (the timing at which the image _{L 3} on the display unit 130 is displayed), accepts a capture request from a user (S11).

When the capture request is accepted, the 2D / 3D detection unit 150 detects whether the video displayed on the display unit 130 is a stereoscopic video (S12). When a stereoscopic video is displayed on the display unit 130 (YES in S12), the file generation unit 160 executes left viewpoint image generation processing (S13) and right viewpoint image generation processing (S14).

Referring to FIG. 4, the file generation unit 160 first compresses the image _{L 3} of the left-view in JPEG format (S21). Although the compression format (encoding format) is not limited to JPEG, it needs to be a format that can be compressed and expanded (encoding and decoding) by one image alone, unlike the case of video.

Next, the file generation unit 160 adds an APP2 marker to the image data compressed in step S21 (S22). Note that the APP2 marker indicates MP format attached information defined by the multi-picture format of the Camera & Imaging Products Industry Association standard.

Next, the file generation unit 160 stores the parameter 1 shown in FIG. 6 in the APP2 marker added in step S22 (S23). Details of data set in parameter 1 will be described later.

Next, the file generation unit 160 stores the parameter 2 shown in FIG. 7 in the APP2 marker added in step S22 (S24). Details of data set in parameter 2 will be described later.

Referring to FIG. 5, the file generation unit 160 first compresses the image _{R 3} in right viewpoint in JPEG format (S31). The compression format (coding format) is not limited to JPEG, but it is desirable to be in the same format as the left viewpoint image.

Next, the file generation unit 160 adds an APP2 marker to the image data compressed in step S31 (S32). Note that the APP2 marker indicates MP format attached information defined by the multi-picture format of the Camera & Imaging Products Industry Association standard.

Next, the file generation unit 160 stores the parameter 3 shown in FIG. 8 in the APP2 marker added in step S32 (S33). Details of the data set in the parameter 3 will be described later.

Returning to the processing of FIG. 3, the file generation unit 160 links the compressed image data generated in the left viewpoint image generation processing (S13) and the right viewpoint image generation processing (S14) (S15). Then, the file generation unit 160 stores the data obtained in step S15 in the storage unit 170 as an mpo file (S16). The data layout of the mpo file generated in step S16 will be described later.

On the other hand, if the two-dimensional image on the display unit 130 is displayed (NO at S12), the file generation unit 160 compresses the image _{L 3} displayed on the display unit 130 in the JPEG format (S18), and compressed The image data is stored as a JPEG file in the storage unit 170 (S19). The processes in steps S18 to S19 are the same as those in the related art, and thus detailed description will be omitted.

The mpo file generated in step S16 has, for example, the data layout shown in FIG. Specifically, the mpo file of FIG. 9, the APP1 marker, and APP2 marker which stores

parameters

1 and 2, the image _{L 3} which is compressed, and the APP1 marker, and APP2 marker which stores parameter 3, the compression an image R ₃ which is is stored in this order. Note that the APP1 marker indicates exif attached information defined in a multi-picture format of the Camera & Imaging Products Industry Association standard.

Parameter 1 shown in FIG. 6 includes common items and information (a plurality of MP entries) for each image. The number of MP entries corresponds to the number of images (two in the example of FIG. 6) stored in the mpo file.

The common items are the MP format version ("0100" in the example of FIG. 6) indicating the version of the mpo file, and the number of recorded images ("2" in the example of FIG. 6) indicating the number of images stored in the mpo file. , MP entry for storing information (offset) for specifying the position of the first MP entry (MP entry 1) on the mpo file.

The MP entry 1, information about the image L ₃ of the left viewpoint is stored. Specifically, in the MP entry 1, individual image type management information including a subordinate parent image flag, a subordinate child image flag, and an MP type, an individual image data offset, a subordinate image 1 entry number, and a subordinate image 2 entry Including numbers and

The MP type MP entry 1, the value indicating that the image _{L 3} are stored as a three-dimensional still image ( "0x020002") is set. In addition, the individual image data offset, the offset value indicating the position of the image L ₃ on mpo file (since the image L ₃ is stored in the head "0x0") is set. "0x" indicates that the subsequent numerical value is a hexadecimal number. Further, the values set to the other tags of the MP entry 1 are not particularly different from the conventional ones, so the description will be omitted.

The MP entry 2, information about the image R ₃ in the right viewpoint is stored. The data format of the MP entry 2 is the same as that of the MP entry 1. Then, set an offset value indicating the position of the image R ₃ on mpo file into separate image data offset, "0x0" except as set out in the dependent image 1 entry number, the same value is set and MP entry 1 .

Parameter 2 shown in FIG. 7 includes an individual image number and a reference viewpoint number. The individual image number is a number for specifying an image (an image L _{3 in} the example of FIG. 9) to which the APP2 marker including the parameter 2 is added on the mpo file, and “0x1” is set in the example of FIG. Ru. The reference viewpoint number is a number for identifying a viewpoint serving as a reference of a pair of still images (images L ₃ and R ₃ ) stored in the mpo file. In the example of FIG. 7, the individual image number “0x1” of the image L ₃ Is set. That is, in this mpo file, the left viewpoint is used as the reference viewpoint.

Parameter 3 shown in FIG. 8 includes an MP format version, an individual image number, and a reference viewpoint number. In the MP format version, the same value "0100" as the MP format version included in the common item of FIG. 6 is set. In the individual image number, an image (an image R _{3 in} the example of FIG. 9) to which an APP2 marker including parameter 3 is added is a number for specifying on the mpo file, and “0x2” is set in the example of FIG. Be done. In addition, the reference viewpoint number, the individual image number of the image _{L 3} "0x1" is set.

7 and 8 show an example in which the left viewpoint image is the first viewpoint image (reference viewpoint image) and the right viewpoint image is the second viewpoint image, the present invention is not limited to this. The first viewpoint image (reference viewpoint image) may be used, and the left viewpoint image may be used as the second viewpoint image.

As described above, the file generation unit 160 according to the first embodiment is imaged at the same time (typically, the same time stamp is added), and a pair of still images (image L ₃ , R ₃ ) is stored in one mpo file, and the same reference viewpoint number (“0x020002” for MP type) as a value indicating that a pair of still images is stored as a stereoscopic still image in the mpo file “0x1” is set and stored in the storage unit 170.

Then, the file reproduction unit 180 recognizes that the pair of still images included in the mpo file read from the storage unit 170 is a three-dimensional still image, and causes the display unit 130 to alternately display the images L ₃ and R ₃ . As a result, it is possible to allow the user to view a stereoscopic still image.

In the first embodiment, the left viewpoint image is displayed on the display unit 130 at the timing when the capture request is input, and the file generation unit 160 is to display the left viewpoint image during display and immediately thereafter. An example of acquiring the right viewpoint image has been described. However, it goes without saying that the present invention can be applied even when the right viewpoint image is displayed on the display unit 130 at the timing when the capture request is input. In that case, the file generation unit 160 may obtain the right viewpoint image being displayed and the left viewpoint image displayed immediately before that.

Second Embodiment
Next, an operation according to Embodiment 2 of the stereoscopic video processing device 100 will be described. The configuration of the three-dimensional video processing apparatus 100 is the same as that of FIG. Further, detailed description of the operation common to the first embodiment will be omitted, and differences will be mainly described.

First, with reference to FIG. 10, the outline of the operation according to the second embodiment of the stereoscopic video processing device 100 will be described. In the example of FIG. 10, a plurality of 3D images are input to the 3D image processing apparatus 100. The first three-dimensional video includes a left-view video composed of images L ₁₁ , L ₁₂ , L ₁₃ ..., And a right-view video composed of images R ₁₁ , R ₁₂ , R ₁₃ . Similarly, the second three-dimensional video is a left-view video composed of images L ₂₁ , L ₂₂ , L ₂₃ ... And a right-view video composed of images R ₂₁ , R ₂₂ , R _23. including.

Then, the stereoscopic video processing apparatus 100 acquires the first stereoscopic video and the second stereoscopic video by the video acquisition unit 110, decodes the first stereoscopic video and the second stereoscopic video by the video decoding unit 120, and obtains the first stereoscopic video. And the second stereoscopic video are simultaneously displayed on the display unit 130. That is, the display unit 130 can display a plurality of sub-screens on the display screen, and can simultaneously display different videos on each sub-screen. For example, the display unit 130 shown in FIG. 10, are displayed large and small child screen, the image L ₁₂ of the left perspective of the first stereoscopic image is displayed on the left side of the relatively small sub-screen, right image L ₂₂ of left view of the second stereoscopic image is displayed on a relatively large child screen.

Then, when the capture request is received by the input reception unit 140, the file generation unit 160 acquires a pair of still images acquired from each of the first 3D image and the 2nd 3D image displayed on the display unit 130. The mpo file storing the combined left viewpoint image obtained by combining the left viewpoint images of the recorded still images and the combined right viewpoint image obtained by combining the right viewpoint images of the acquired still images is stored in the storage unit 170 Let That is, assuming that the number of stereoscopic images displayed on the display unit 130 is n, 2 (n + 1) images are stored in the mpo file according to the second embodiment.

The operation according to the second embodiment of the stereoscopic video processing device 100 will be described with reference to FIGS. 11 to 21. FIG. 11 is a flowchart of file generation processing according to the second embodiment. FIG. 12 is a flowchart of left viewpoint image generation processing according to the second embodiment. FIG. 13 is a flowchart of right viewpoint image generation processing according to the second embodiment. FIG. 14 is a flowchart of combined left viewpoint image generation processing according to the second embodiment. FIG. 15 is a flowchart of combined right-viewpoint image generation processing according to the second embodiment. 16A to 20 show examples of data stored in the header of the mpo file. FIG. 21 is a diagram showing an example of the data layout of the mpo file. Hereinafter, FIG. 10 will be described as an example.

The flowchart of FIG. 11 corresponds to the process of steps S13 to S15 of FIG. First, the file generation unit 160 sets n = 1 (S41), and executes left viewpoint image generation processing (S42) of the first three-dimensional video and right viewpoint image generation processing (S43) of the first three-dimensional video.

Referring to FIG. 12, the file generation unit 160 first compresses the image _{L 12} of the left-view in JPEG format (S51). Next, the file generation unit 160 adds an APP2 marker to the image data compressed in step S51 (S52). Steps S51 to S52 correspond to steps S21 to S22 in FIG.

Next, n = 1, that is, when processing the first 3D image (YES in S53), the file generation unit 160 adds the parameter 4 shown in FIGS. 16A to 16C to the APP2 marker added in step S52. Store (S54). Details of the data set in the parameter 4 will be described later. On the other hand, when the processing of the second stereoscopic video described later is performed (NO in S53), the file generation unit 160 skips step S54.

Next, the file generation unit 160 stores the parameter 5 in the APP2 marker added in step S52 (S55). Note that the data layout and setting value of parameter 5 added to the APP2 marker of the left viewpoint image of the first three-dimensional video are the same as parameter 2 shown in FIG. On the other hand, the data layout and setting values of the parameter 5 to be added to the APP2 marker of the left viewpoint image of the second 3D image described later will be described later with reference to FIG.

Referring to FIG. 13, the file generation unit 160 first compresses the image _{R 12} in right viewpoint in JPEG format (S61). Next, the file generation unit 160 adds an APP2 marker to the image data compressed in step S61 (S62). Steps S61 to S62 correspond to steps S31 to S32 in FIG.

Next, the file generation unit 160 stores the parameter 6 in the APP2 marker added in step S62 (S63). Note that the data layout and setting values of the parameter 6 added to the APP2 marker of the first right-viewpoint image are common to the parameter 3 shown in FIG. On the other hand, the data layout and setting values of the parameters 6 to be added to the APP2 marker of the right viewpoint image of the second stereoscopic video described later will be described later with reference to FIG.

Returning to the process of FIG. 11, the file generation unit 160 determines whether the image generation of all the videos has been completed (S44). In the above example, since the processing of the second 3D image has not been completed (NO in S44), the file generation unit 160 sets n = n + 1 (S45), and generates the left viewpoint image generation process (S42) for the second 3D image. A right viewpoint image generation process (S43) is performed. That is, the file generating unit 160 performs a left viewpoint image generation processing (S42) the image L ₂₂ of left view of the second stereoscopic image, right viewpoint for the image R ₂₂ in the right perspective of the second stereoscopic image An image generation process (S43) is performed.

When the processing of the second 3D image is completed (YES in S44), the file generation unit 160 uses the still image generated in steps S42 to S45 to perform combined left viewpoint image generation processing (S46), and combines the combined right viewpoint image. The generation process (S47) is performed.

Referring to FIG. 14, file generation unit 160 obtains the position and size on display unit 130 of each image displayed on display unit 130 (S71). In other words, the file generation unit 160 obtains the coordinates and size of the child screen on which the first three-dimensional video is displayed on the display unit 130, and the coordinates and size of the child screen on which the second three-dimensional video is displayed on the display unit 130. Do.

For example, the file generation unit 160 sets the coordinate (x, y) of the upper left end of the display unit 130 to the origin (0, 0), the horizontal direction rightward of the display unit 130 the positive direction of the x coordinate, the vertical direction of the display unit 130 In a two-dimensional coordinate system where the downward direction is the positive direction of the y coordinate, acquire the coordinates of the upper left end of each child screen as the coordinates of the child screen, and acquire the height and width of each child screen as the size of the child screen Good. However, the information specifying the layout of each child screen is not limited to the above. For example, the file generation unit 160 may acquire the coordinates of the upper left end and the coordinates of the lower right end of each child screen instead of the above information.

Next, the file generation unit 160 acquires each of the images 1 (image L ₁₂ of the left viewpoint of the first three-dimensional video) and image 3 (image L ₂₂ of the left viewpoint of the second three-dimensional video) in step S71. A combined left viewpoint image combined according to screen coordinates and size is generated. Then, the file generation unit 160 compresses the generated combined left viewpoint image in JPEG format (S72). The combined left viewpoint image is a screen image actually displayed on the display unit 130.

Next, the file generation unit 160 adds the APP2 marker to the image data compressed in step S72 (S73). Then, the file generation unit 160 stores the parameter 7 shown in FIG. 19 in the APP2 marker added in step S73 (S74). Details of the data set in the parameter 7 will be described later.

Referring to FIG. 15, file generation unit 160 acquires the position and size on display unit 130 of each image displayed on display unit 130 (S81). In step S81, the information acquired in step S71 described above may be reused.

Next, the file generation unit 160 acquires the image 2 (the image R ₁₂ of the right viewpoint of the first three-dimensional video) and the image 4 (the image R ₂₂ of the right viewpoint of the second three-dimensional video) A combined right viewpoint image combined according to the screen coordinates and size is generated. Then, the file generation unit 160 compresses the generated combined right-viewpoint image in JPEG format (S 82). The combined right viewpoint image is a screen image that is actually displayed on the display unit 130.

Next, the file generation unit 160 adds an APP2 marker to the image data compressed in step S82 (S83). Then, the file generation unit 160 stores the parameter 8 shown in FIG. 20 in the APP2 marker added in step S83 (S84). Details of the data set in the parameter 7 will be described later.

Returning to the processing of FIG. 11, the file generation unit 160 performs left viewpoint image generation processing (S42), right viewpoint image generation processing (S43), combined left viewpoint image generation processing (S46), and combined right viewpoint image generation processing (S42). The compressed image data generated in S47) are linked (S48). Then, the file generation unit 160 stores the data obtained in step S48 in the storage unit 170 as an mpo file.

The mpo file generated by the process of FIG. 11 has, for example, the data layout shown in FIG. Specifically, the mpo file of Figure 21, the APP1 marker, and APP2 marker which stores parameter 4,2, a compressed image _{L 12,} and APP1 marker, and APP2 marker which stores parameter 3, the compression an image _{R 12} which are the APP1 marker, and APP2 marker which stores parameters 5, the image _{L 22} which is compressed, and the APP1 marker, and APP2 marker which stores parameter 6, the image _{R 22} compressed, APP1 A marker, an APP 2 marker storing parameter 7, a compressed combined left-viewpoint image, an APP 1 marker, an APP 2 marker storing parameter 8, and a compressed combined right-viewpoint image are stored in this order.

Parameters 4 shown in FIGS. 16A, 16B, and 16C include common items and information (a plurality of MP entries) for each image. The data layout of the common items in FIG. 16A is the same as that in FIG. 6 except that the number of recording images is set to “6”.

The MP entry 1 shown in FIG. 16A, information about the image L ₁₂ of the left perspective of the first stereoscopic image is stored. In addition, the MP entry 2, information about the image R ₁₂ in the right perspective of the first stereoscopic image is stored.

The data layout and setting values of the

MP entries

1 and 2 are almost the same as those in FIG. However, the same value (“0x1”) is set to the subordinate parent image flag of the MP entry 1 and the subordinate child image flag of the MP entry 2. That is, two images in which the same value is set to these flags constitute a pair of still images.

The MP entry 3 shown in FIG. 16B, information about the image L ₂₂ of left view of the second stereo image is stored. The data layout of MP entry 3 is common to that of entry 1. The subordinate parent image flag is “0x2”, the individual image data offset is the offset to image 3 and the subordinate image 1 entry number is “0x4 (point to MP entry 4) The same value as entry 1 is set except that “is set.

The MP entry 4 shown in FIG. 16B, information about the image R ₂₂ in the right perspective of the second stereo image is stored. The data layout of MP entry 4 is common to that of entry 2, and the same value as entry 2 is set except that “0x2” is set to the subordinate image flag and the offset to image 4 is set to the individual image data offset. ing.

In the MP entry 5 shown in FIG. 16C, information related to the combined left viewpoint image is stored. The data layout of MP entry 5 is the same as entry 1. The subordinate parent image flag is “0x3”, the individual image data offset is the offset to image 5, and the subordinate image 1 entry number is “0x6 (point to MP entry 6) The same value as entry 1 is set except that “is set.

In the MP entry 6 shown in FIG. 16C, information on the combined right viewpoint image is stored. The data layout of MP entry 6 is common to that of entry 2, and the same value as MP entry 2 is set except that “0x3” is set to the subordinate image flag and the offset to image 6 is set to the individual image data offset. It is done.

The data layout of parameter 5 shown in FIG. 17 is the same as parameter 3 shown in FIG. 8, and the number “0x3” specifying image 3 (the image L ₂₂ of the left viewpoint of the second stereoscopic video) is the individual image number. , except that the individual image number "0x3" of the image L ₂₂ is set to the reference viewpoint number, the same value as the parameter 3 are set.

The data layout of parameter 6 shown in FIG. 18 is the same as parameter 3 shown in FIG. 8, and the number “0x4” specifying image 4 (the right viewpoint image R _{22 of} the second stereoscopic video) is the individual image number. , except that the individual image number "0x3" of the image L ₂₂ is set to the reference viewpoint number, the same value as the parameter 3 are set.

The data layout of parameters 7 and 8 shown in FIGS. 19 and 20 is in common with parameter 3 shown in FIG. 8, and “0xFFFFFFFF (value indicating that it is a combined image)” is used as the individual image number and reference view number. The same value as parameter 3 is set except for the setting.

As described above, the file generation unit 160 according to the second embodiment extends the first embodiment and acquires and acquires a pair of still images from each of the plurality of stereoscopic images simultaneously displayed on the display unit 130. Store multiple sets of still images in an mpo file. Also, the file generation unit 160 stores, in the mpo file, information (a subordinate parent image flag and a subordinate child image flag of the MP entry) specifying the combination of the left viewpoint image and the right viewpoint image constituting the pair of still images.

Then, of the plurality of still images included in the mpo file read out from storage unit 170, file reproducing unit 180 sets the same value to the subordinate parent image flag and the subordinate child image flag of the MP entry. The still image is read, and the read pair of images is alternately displayed on the display unit 130. As a result, it is possible to allow the user to view a stereoscopic still image.

Furthermore, the file generation unit 160 combines a plurality of left viewpoint images stored in the mpo file according to the layout displayed on the display unit 130 according to the layout, and a plurality of right viewpoint images stored in the mpo file. And the combined right viewpoint image combined according to the layout displayed on the display unit 130 is stored in the mpo file.

Then, the file reproducing unit 180 reads out the combined left viewpoint image and the combined right viewpoint image from the mpo file and alternately displays the combined left viewpoint image and the combined right viewpoint image on the display unit 130, whereby the display unit at the time when the input request unit 140 receives the capture request. It becomes possible to reproduce the display contents of 130 as a three-dimensional still image.

In the above example, the case where two stereoscopic videos are simultaneously displayed on the display unit 130 has been described, but the number of stereoscopic videos is not limited to this and may be three or more. In addition, 2D video and 3D video may be displayed on the display unit 130 simultaneously. In that case, the file generation unit 160 may execute the processing for NO in step S12 of FIG. 3 for 2D video and the processing for YES in step S12 of FIG. 3 for 3D video.

(Other embodiments)
Although the present invention has been described based on the above embodiment, it is needless to say that the present invention is not limited to the above embodiment. The following cases are also included in the present invention.

Specifically, each of the above-described devices is a computer system including a microprocessor, a ROM, a RAM, a hard disk unit, a display unit, a keyboard, a mouse and the like. A computer program is stored in the RAM or the hard disk unit. Each device achieves its function by the microprocessor operating according to the computer program. Here, the computer program is configured by combining a plurality of instruction codes indicating instructions to the computer in order to achieve a predetermined function.

Some or all of the components constituting each of the above-described devices may be configured from one system LSI (large scale integrated circuit). The system LSI is a super multifunctional LSI manufactured by integrating a plurality of components on one chip, and more specifically, is a computer system configured to include a microprocessor, a ROM, a RAM and the like. . The RAM stores a computer program. The system LSI achieves its functions by the microprocessor operating according to the computer program.

Some or all of the components that make up each of the above-described devices may be configured from an IC card or a single module that can be attached to or detached from each device. The IC card or module is a computer system including a microprocessor, a ROM, a RAM, and the like. The IC card or module may include the above-described ultra-multifunctional LSI. The IC card or module achieves its function by the microprocessor operating according to the computer program. This IC card or this module may have tamper resistance.

The present invention may be methods shown above. In addition, it may be a computer program that realizes these methods by a computer, or may be a digital signal composed of a computer program.

Also, the present invention is a computer program or recording medium capable of reading digital signals from a computer, such as a flexible disk, hard disk, CD-ROM, MO, DVD, DVD-ROM, DVD-RAM, BD (Blu-ray Disc), It may be recorded in a semiconductor memory or the like. In addition, digital signals may be recorded on these recording media.

In addition, the present invention may transmit a computer program or a digital signal via a telecommunication line, a wireless or wired communication line, a network typified by the Internet, data broadcasting, and the like.

Furthermore, the present invention is a computer system comprising a microprocessor and a memory, the memory storing the computer program, and the microprocessor may operate according to the computer program.

In addition, it may be implemented by another independent computer system by recording and transporting a program or digital signal on a recording medium, or by transporting a program or digital signal via a network or the like.

The above embodiment and the above modification may be combined respectively.

Although the embodiments of the present invention have been described above with reference to the drawings, the present invention is not limited to the illustrated embodiments. Various modifications and variations can be made to the illustrated embodiment within the same or equivalent scope of the present invention.

The present invention is advantageously used in a stereoscopic video processing apparatus and a stereoscopic video processing method.

Reference Signs List 100 stereoscopic video processing device 110 video acquisition unit 120 video decoding unit 130 display unit 140 input reception unit 150 2D / 3D detection unit 160 file generation unit 170 storage unit 180 file reproduction unit

Claims

A display unit which alternately displays a first viewpoint image and a second viewpoint image constituting a stereoscopic video;
An input receiving unit that receives an input of a capture request from a user;
A multi-picture format in which the first viewpoint image displayed on the display unit and the second viewpoint image corresponding to the first viewpoint image are stored as a pair of still images at the timing when the capture request is received by the input reception unit. And a file generation unit that generates a compliant file.
The file generation unit stores, in the file, the pair of still images acquired from each of the plurality of three-dimensional videos when a plurality of three-dimensional videos are simultaneously displayed on the display unit. Stereoscopic image processing device.
The file generation unit is further configured to:
A combined first viewpoint image in which a plurality of first viewpoint images stored in the file are combined according to the layout displayed on the display unit;
The stereoscopic video processing device according to claim 2, wherein a plurality of second viewpoint images stored in the file and a combined second viewpoint image obtained by combining the plurality of second viewpoint images according to the layout displayed on the display unit are stored in the file. .
The stereoscopic video processing apparatus according to claim 2, wherein the file generation unit further stores, in the file, information identifying a combination of a first viewpoint image and a second viewpoint image constituting the pair of still images. .
The three-dimensional video processing apparatus further includes a file playback unit that reads out from the file the first viewpoint image and the second viewpoint image that constitute the pair of still images and causes the display unit to alternately display the first viewpoint image and the second viewpoint image. The stereoscopic video processing device according to any one of 4.
The first viewpoint image is one of a left viewpoint image and a right viewpoint image having parallax with each other,
The stereoscopic video processing device according to any one of claims 1 to 5, wherein the second viewpoint image is the other of the left viewpoint image and the right viewpoint image.
A display step of alternately displaying a first viewpoint image and a second viewpoint image constituting a stereoscopic video;
An input receiving step of receiving an input of a capture request from a user;
A multi-picture format in which the first viewpoint image displayed in the display step and the second viewpoint image corresponding to the first viewpoint image are stored as a pair of still images at the timing when the capture request is received in the input reception step And a file generating step of generating a compliant file.