CN101616322A - Stereo video coding-decoding method, Apparatus and system - Google Patents

Stereo video coding-decoding method, Apparatus and system Download PDF

Info

Publication number
CN101616322A
CN101616322A CN200810126528A CN200810126528A CN101616322A CN 101616322 A CN101616322 A CN 101616322A CN 200810126528 A CN200810126528 A CN 200810126528A CN 200810126528 A CN200810126528 A CN 200810126528A CN 101616322 A CN101616322 A CN 101616322A
Authority
CN
China
Prior art keywords
image information
scene
dimensional image
slice
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN200810126528A
Other languages
Chinese (zh)
Inventor
赵嵩
刘源
王静
方平
李凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Device Shenzhen Co Ltd
Original Assignee
Shenzhen Huawei Communication Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Huawei Communication Technologies Co Ltd filed Critical Shenzhen Huawei Communication Technologies Co Ltd
Priority to CN200810126528A priority Critical patent/CN101616322A/en
Priority to PCT/CN2009/072241 priority patent/WO2009155827A1/en
Publication of CN101616322A publication Critical patent/CN101616322A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/261Image signal generators with monoscopic-to-stereoscopic image conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/597Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)

Abstract

The embodiment of the invention discloses a kind of stereo video coding-decoding method, Apparatus and system, relate to the three-dimensional video-frequency technical field, having solved existing stereo video coding-decoding can not complete reservation three-dimensional video-frequency depth information, and the more problem of shared decode resources.The embodiment of the invention is at first obtained the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information; And, be coded in respectively in the different slice-group in the same Frame two-dimensional image information in the same frame and additional information.The embodiment of the invention mainly is used in three-dimensional film and TV, three-dimensional video-frequency meeting, virtual reality system, long-range Industry Control, many occasions such as robot navigation and tele-medicine.

Description

Stereo video coding-decoding method, Apparatus and system
Technical field
The present invention relates to the three-dimensional video-frequency technical field, particularly the stereoscopic video data are carried out the method for Code And Decode, and the device and the system that realize this method.
Background technology
Three-dimensional (3D) video technique is based on people's binocular parallax principle, obtain Same Scene but two slightly variant width of cloth images by video camera, be shown to people's left eye and right eye respectively, form binocular parallax, thereby make the people can obtain the depth information of scene and experience third dimension.The three-dimensional video-frequency technology can be reappeared the objective world scene truly, shows depth perception, stereovision and the authenticity of scene, is the important directions of current video technical development.
Compare with traditional two dimension (2D) video, three-dimensional video-frequency has two video flowings of right and left eyes, keeping image resolution ratio not reduce, do not consider under the situation of encoding compression, the three-dimensional video-frequency data quantity transmitted is the twice of two-dimensional video data amount, the increase of the video data volume has brought great challenge for storage and transmission, only improves the memory capacity and the network bandwidth and has been not enough to address this problem, and must design coding method stereoscopic video efficiently and compress processing.
Three-dimensional video-frequency is compared 2 traditional dimension videos and is had much bigger data volume, the compression algorithm of two-dimensional video data and international standard are very ripe at present, but, can not directly adopt the compression algorithm of 2-D data to handle stereo video data because that stereo video data and two-dimensional video data have is different.Handle the video data of two passages of three-dimensional video-frequency respectively if adopt the existing two-dimensional image compression algorithm, since do not consider two between the channel data time and the correlation in space, nonsynchronous situation may appear in the video data that feasible display end receives two passages.
A kind of method of stereo scopic video coding is to adopt interweaving method that two channel datas are synthesized the single channel data at present, data after utilizing the two dimensional compaction algorithm process synthetic then, two width of cloth images that are about to form a certain frame scan, wherein the data interlacing of two width of cloth images is arranged, and forms a new Frame.When this synthetic frame is shown, need to put in order frame decoding, be divided into the data of two passages then.If with this three-dimensional video-frequency and existing two-dimensional display device when compatible mutually, the two-dimensional video device just is wanted the data of one of them passage, but adopt this deinterleaving method can cause two-dimentional demonstration the data of two passages all need be decoded, increased the work of two-dimentional display device.
Existing another kind of method for encoding stereo video selects one of them channel data of stereo video data to flow as REF video, and adopt the standard code method to encode, be that predictive coding is carried out on the basis then to the video flowing of another channel data with REF video stream.When this encoding scheme was used conventional two-dimensional display device display video data, the REF video stream of only decoding can realize that two dimension shows; With 3 D stereo display device display video data the time, all encoded contents of then decoding.Though this coding method has reduced the live load of two-dimentional display device, what but this encoding scheme adopted for the video flowing of another channel data is predictive coding, and the mode of this predictive coding can not be with the complete exactly reservation of the depth information in the three-dimensional video-frequency.
Can not the depth information in the three-dimensional video-frequency is accurate and the problem of complete reservation in order to overcome existing second kind of method for encoding stereo video, prior art has proposed another hierarchical coding scheme based on existing compression protocol.The video data that is about to one of them passage adopts conventional method to be encoded to basic layer, and is enhancement layer with the video data encoding of another passage.When carrying out view restructuring, only predict in conjunction with the video data in the basic layer.This encoding scheme 2D/3D's is better compatible, shows for traditional 2D, only needs the decoding base layer data to get final product; Show that for 3D then layer and enhancement layer are all decoded substantially.Video data with two passages in this programme compresses respectively, has completely kept the depth information in the three-dimensional video-frequency.But owing to adopted enhancement layer in this programme, when the data of decoding enhancement layer, take more decode resources, as: decode time and spatial cache need stronger hardware decoding performance.Can't decoding enhancement layer if the hardware decoding performance of terminal is relatively poor, can not realize that promptly 3D shows.
Summary of the invention
Embodiments of the invention provide a kind of stereo video coding-decoding method, Apparatus and system, and compatible preferably two-dimensional video shows, and in the economize on hardware decode resources, depth information that can complete reservation three-dimensional video-frequency.
For achieving the above object, embodiments of the invention adopt following technical scheme:
A kind of method for encoding stereo video comprises:
Obtain the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information;
With two-dimensional image information in the same frame and additional information, be coded in respectively in two slice-group in the same Frame.
A kind of stereo scopic video coding device comprises:
Information acquisition unit is used to obtain the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information;
Coding unit is used for two-dimensional image information and additional information with same frame, is coded in respectively in two slice-group in the same Frame.
A kind of three-dimensional video-frequency coding/decoding method comprises:
Band in the slice-group that is encoded into by the scene two-dimensional image information in the decoded data frame;
Decoded slice-group is dressed up frame.
A kind of three-dimensional video-frequency decoding device comprises:
Decoding unit is used for the band in the slice-group that the decoded data frame is encoded into by the scene two-dimensional image information;
Become frame unit, be used for decoded slice-group is dressed up frame.
A kind of three-dimensional video-frequency system comprises:
The stereo scopic video coding device, be used to obtain the two-dimensional image information of scene, and can constitute the additional information of scene stereo-picture, and, be coded in respectively in two slice-group in the Frame two-dimensional image information in the same frame and additional information with described two-dimensional image information;
The three-dimensional video-frequency decoding device is used for the band in the slice-group that the decoded data frame is encoded into by the scene two-dimensional image information, and decoded slice-group is dressed up frame.
By the described embodiments of the invention of technique scheme, with the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information, be coded in one of them slice-group in the frame.Carry out 2D if desired and show, the wherein slice-group of the two-dimensional image information correspondence of scene of then only need decoding; Carry out 3D if desired and show, then need all slice-group in the decoded frame, realized the compatibility that 2D/3D shows.Because the two-dimensional image information and the additional information of scene all are encoded into band, and band all belongs to basic layer, and is for the decoding of enhancement layer, less to the band required decode resources of decoding; And present embodiment is not the scheme that adopts predictive coding, but with the two-dimensional image information and the additional information direct coding of scene, no matter additional information is the depth information of scene or the two-dimensional image information of another viewpoint of scene, depth information that all can complete reservation three-dimensional video-frequency.
Description of drawings
Fig. 1 is the flow chart of the embodiment of the invention 1 neutral body method for video coding;
Fig. 2 is the block diagram of the embodiment of the invention 1 neutral body video coding apparatus;
Fig. 3 is the flow chart of the embodiment of the invention 1 neutral body video encoding/decoding method;
Fig. 4 is the block diagram of the embodiment of the invention 1 neutral body video decoder;
Fig. 5 is the flow chart of the embodiment of the invention 2 neutral body method for video coding;
Fig. 6 is the block diagram of the embodiment of the invention 2 neutral body video coding apparatus;
Fig. 7 is the flow chart of the embodiment of the invention 3 neutral body method for video coding;
Fig. 8 is the block diagram of the embodiment of the invention 3 neutral body video coding apparatus;
Fig. 9 is the flow chart of the embodiment of the invention 4 neutral body video encoding/decoding methods;
Figure 10 is the block diagram of the embodiment of the invention 4 neutral body video decoders.
Embodiment
The embodiment of the invention is with the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information, is coded in respectively in one of them slice-group in the same frame.Realize the compatibility that 2D/3D shows, and when saving decode resources, the depth information of complete reservation three-dimensional video-frequency.Below in conjunction with accompanying drawing, the embodiment of stereo video coding-decoding method of the present invention and device is described in detail.
Embodiment 1:
As shown in Figure 1, present embodiment neutral body method for video coding specifically comprises the steps:
101, obtain the two-dimensional image information of scene, promptly plane graph obtains the additional information that can constitute the scene stereo-picture with described two-dimensional image information simultaneously.
This additional information is the depth information of scene generally speaking, perhaps is the two-dimensional image information of this another viewpoint of scene; Can directly obtain the two-dimensional image information of two viewpoints by stereo camera, with one of them two-dimensional image information as additional information; Also the two-dimensional image information of two viewpoints that can get access to by stereo camera calculates depth information, and with the two-dimensional image information of one of them viewpoint two-dimensional image information as scene, with depth information as additional information.
Described depth information can also directly obtain by depth camera, and comparatively complete sum is accurate for the depth information that gets access to by depth camera.
102, two-dimensional image information and the additional information with the same frame of above-mentioned scene is encoded into two slice-group respectively.If additional information is the two-dimensional image information of another viewpoint of scene, then can adopt the coded system of same high compression ratio, if additional information is a depth information, then needs two-dimensional image information is adopted the coded system of high compression ratio, and depth information is adopted the coded system of lossless compress.
The present embodiment method for encoding stereo video with two-dimensional image information in the same frame and additional information, is coded in respectively in two different slice-group in the same frame, can be complete the keeps depth information; And because slice-group all belongs to basic layer, make that decoding end does not need enhancement layer is decoded, saved decode resources.And two-dimensional image information is different with the encoding compression mode that depth information adopts, and can keep the depth information of three-dimensional video-frequency better.
Corresponding to above-mentioned method for encoding stereo video, this enforcement also provides a kind of stereo scopic video coding device, and as shown in Figure 2, this stereo scopic video coding device comprises:
Information acquisition unit 21 is used to obtain the two-dimensional image information of scene, i.e. plane graph; Simultaneously, this information acquisition unit is also obtained the additional information that can constitute the scene stereo-picture with described two-dimensional image information.
Coding unit 22 is used for same frame is encoded into a Frame, and two-dimensional image information in this Frame and corresponding respectively two slice-group that are encoded in this Frame of additional information.Generally speaking, coding unit 22 is to encode according to the sequence number of slice-group, and at first the encoding strip thereof group 1, and the encoding strip thereof group 2 then, and wherein slice-group 1 can comprise two-dimensional image information, and slice-group 2 can comprise additional information.
As shown in Figure 3, corresponding to above-mentioned method for encoding stereo video, this enforcement also provides a kind of three-dimensional video-frequency coding/decoding method, specifically comprises the steps:
301, the band in the slice-group that is encoded into by the scene two-dimensional image information in the decoded data frame, the slice-group that is encoded into by the scene two-dimensional image information is arranged to the first slice-group of Frame generally speaking, for example: two-dimensional image information is arranged in the slice-group 0 in the Frame, additional information is arranged in the slice-group 1, so in this step just decoding be numbered band in 0 the slice-group.
302, decoded slice-group is dressed up frame, to export the one-tenth frame data that can show; For the existing two-dimensional display device, only need two-dimensional image information can finish the demonstration of picture, so present embodiment has been realized the compatibility to the 2D demonstration.
Corresponding to the three-dimensional video-frequency coding/decoding method, the embodiment of the invention also provides a kind of three-dimensional video-frequency decoding device, and as shown in Figure 4, this device comprises:
Decoding unit 41 is used for the band in the slice-group that the decoded data frame is encoded into by the scene two-dimensional image information.
Become frame unit 42, be used for decoded slice-group is dressed up frame.
Present embodiment also provides a kind of three-dimensional video-frequency system, this three-dimensional video-frequency system comprises stereo scopic video coding device among Fig. 2 and the three-dimensional video-frequency decoding device among Fig. 4, the description of having omitted the communication module between above-mentioned stereo scopic video coding device and the three-dimensional video-frequency decoding device.
Wherein the stereo scopic video coding device is used to obtain the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information, that is: can this scene of stereo display by described two-dimensional image information and additional information, and the three-dimensional video-frequency code device is with two-dimensional image information in the same frame and additional information, be encoded into respectively in two different slice-group in the same Frame, so that complete reservation additional information, particularly when additional information is depth information, can reproduce three-dimensional video-frequency preferably.
After information after the stereo scopic video coding device will be encoded by communication module sends to the three-dimensional video-frequency decoding device, band in the slice-group that is encoded into by the scene two-dimensional image information in the three-dimensional video-frequency decoding device decoded data frame, and decoded slice-group dressed up frame, show so that carry out image.
Three-dimensional video-frequency decoding device in the present embodiment band in the slice-group that wherein two-dimensional image information is encoded into of only need decoding promptly can be used to carry out image and shows, has saved decode resources.
Embodiment 2:
As shown in Figure 5, present embodiment neutral body method for video coding comprises the steps:
501, obtain the two-dimensional image information of scene, i.e. the plane graph of this scene.
502, obtain the additional information that can constitute the scene stereo-picture with described two-dimensional image information.Additional information in the present embodiment is the depth information of described scene.
Obtain depth information and generally comprise dual mode, be respectively:
Mode one, directly obtain the depth information of described scene by depth camera, the depth information that this mode is obtained is more complete and accurate.
Mode two, according to the two-dimensional image information of described scene and the two-dimensional image information of this another viewpoint of scene, calculate the depth information of described scene, generally need pass through images match, after finding the pixel that matches each other, can calculate the degree of depth by the existing geometric principle, may there be error in this computational methods, but still can satisfy the demand to depth information.
Employing mode two is obtained depth information in the present embodiment.
503,, predict another viewpoint two-dimensional image information of this scene according to depth information and two-dimensional image information.
504, another viewpoint two-dimensional image information of this scene that utilizes prediction to obtain, and the original two-dimensional image information of another viewpoint of this scene carries out the parallax compensation residual computations, obtains residual image information.When showing, if only constitute three-dimensional video-frequency by depth information and two-dimensional image information, can cause the information loss of another viewpoint two-dimensional image information of described scene, if add this residual image information then can recover another viewpoint two-dimensional image information fully, guarantee the definition of image, and can eliminate the time redundancy between the image of two viewpoints.
505, three slice-group of arrangement in same Frame, and the two-dimensional image information in this Frame, depth information correspondence respectively are coded in the different slice-group.Also need simultaneously the residual image information in this frame is coded in the alternative in vitro test group in this Frame.That is, 3 slice-group have been comprised altogether in a frame in the present embodiment.And the two-dimensional image information correspondence is weaved into first slice-group in the frame, and promptly the slice-group at this band place is numbered 0 (other slice-group numberings are respectively 1 and 2).
In the present embodiment, two-dimensional image information is encoded according to the coded system of high compression ratio, and depth information is encoded according to the coded system of lossless compress.Guarantee that depth information obtains better protection, the precision when guaranteeing view restructuring.
Corresponding to above-mentioned method for encoding stereo video, present embodiment also provides a kind of stereo scopic video coding device, and as shown in Figure 6, this device comprises information acquisition unit 61, predicting unit 62, computing unit 63, coding unit 64.Wherein:
Information acquisition unit 61 is used to obtain the two-dimensional image information of scene, i.e. plane graph; Simultaneously, this information acquisition unit is also obtained the additional information that can constitute the scene stereo-picture with described two-dimensional image information.Additional information is the depth information of described scene in the present embodiment.
Information acquisition unit 61 will be obtained depth information, and the equipment that can adopt has following two kinds:
Equipment one, depth camera, this depth camera are directly obtained the two-dimensional image information and the depth information of scene.
Equipment two, binary channels stereo camera, the two-dimensional image information of described scene is obtained by one of them passage of binary channels stereo camera; And calculate the depth information of described scene according to two channel image data in the described binary channels stereo camera.
Selected the binary channels stereo camera as information acquisition unit 61. in the present embodiment
Predicting unit 62 is used for predicting another viewpoint two-dimensional image information of this scene according to depth information and two-dimensional image information.
Computing unit 63, another viewpoint two-dimensional image information of this scene that utilizes prediction to obtain, and the original two-dimensional image information of another viewpoint of this scene carry out the parallax compensation residual computations, obtain residual image information.
Coding unit 64 is used for same frame is encoded into a Frame, and the two-dimensional image information in this Frame, depth information respectively correspondence be encoded into preceding two slice-group.This coding unit 64 also needs the residual image information in this frame is coded in the alternative in vitro test group in this frame simultaneously.That is, 3 slice-group have been comprised in a frame in the present embodiment.And the slice-group of the band that the two-dimensional image information correspondence is encoded into is the first slice-group in this frame, and promptly this slice-group is numbered 0 (other slice-group numberings are respectively 1 and 2).
Coding unit in the present embodiment is encoded two-dimensional image information according to the coded system of high compression ratio; Described depth information is encoded according to the coded system of lossless compress.
Embodiment 3:
Additional information in the present embodiment is different with embodiment 2, and the present embodiment method for encoding stereo video specifically comprises the steps: as shown in Figure 7
701, obtain the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information; Described additional information is the two-dimensional image information of described another viewpoint of scene.Obtain the two-dimensional image information of scene in this enforcement by the binary channels stereo camera, and the two-dimensional image information of this another viewpoint of scene.
702, with the two-dimensional image information of scene in the same frame, and the two-dimensional image information of this another viewpoint of scene, be encoded into different slice-group respectively.And with the slice-group that the two-dimensional image information correspondence is encoded into is first slice-group in this Frame.In the present embodiment, the band in all slice-group constitutes by two-dimensional image information, can all encode according to the coded system of high compression ratio.
Corresponding to above-mentioned method for encoding stereo video, present embodiment also provides a kind of stereo scopic video coding device, and as shown in Figure 8, this device comprises:
Information acquisition unit 81 is used to obtain the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information; Additional information in the present embodiment is the two-dimensional image information of described another viewpoint of scene.So information acquisition unit 61 can adopt the binary channels stereo camera, directly obtain the two-dimensional image information of scene and the two-dimensional image information of this another viewpoint of scene by the binary channels stereo camera.
Coding unit 82 is used for the two-dimensional image information with same frame, and the two-dimensional image information of this another viewpoint of scene, is encoded into the band in two different slice-group in the same frame respectively.Described coding unit is all encoded the two-dimensional image information of two-dimensional image information and another viewpoint according to the coded system of high compression ratio.
Embodiment 4:
Present embodiment is the three-dimensional video-frequency coding/decoding method of corresponding method for encoding stereo video, and as shown in Figure 9, this three-dimensional video-frequency coding/decoding method comprises the steps:
901, owing to be base unit with the band in the cataloged procedure, so band can independently be decoded in the decode procedure.Band in the slice-group that is encoded into by the scene two-dimensional image information in the decoded frame in this step is the band in the first slice-group of this frame generally speaking.
902, can judge the still two dimension demonstration of needs three-dimensional display according to used display device, if carry out three-dimensional display, then execution in step 903; Show that then execution in step 905 if carry out two dimension.
If 903 device therefors need carry out three-dimensional display, the data that then need all to be carried out three-dimensional display are all decoded, so need the band in the remaining slice-group in the decoded data frame successively.
Concrete operations are: the band in the next slice-group in the decoded data frame.
904, whether the band in the judgment data frame is all decoded and is finished, if finish then execution in step 905; Otherwise return execution in step 903.
905, the slice-group that has decoded in the Frame is dressed up frame data, so just finished the decoding of frame data.
According to the difference of encoded content and display requirement, the content that remaining slice-group comprised in the present embodiment also changes to some extent.For example: if encoded content is the two dimensional image of two different points of view of scene, the band that band is encoded into for the two-dimensional image information by another visual angle of scene in the so remaining slice-group.If encoded content is two-dimensional image information and depth information, the band of the band in the so remaining slice-group for being encoded into by scene depth information.If encoded content is for being two-dimensional image information, depth information and scene residual image information, the band under the lower situation of display requirement in the remaining slice-group can be the band that is encoded into by scene depth information; Remaining band is under the display requirement condition with higher: band that is encoded into by scene depth information and the band that is encoded into by scene residual image information.
When showing two dimensional image, a band that decoding wherein is encoded into by the scene two-dimensional image information is as the data of a two field picture in the present embodiment, when showing 3-D view, decode the band in the remaining slice-group again,, realized the compatibility that 2D/3D shows for the usefulness of 3D demonstration.
If present embodiment three-dimensional video-frequency coding/decoding method directly is used on the 3D display device, do not need to consider the compatibling problem of 2D/3D demonstration, then can judge the display type of current display device, directly the band in the remaining slice-group is decoded out, and assembling framing, if directly be used on the 2D display device, also can judge the display type of current display device, but behind the band in decoding the slice-group that is encoded into by the scene two-dimensional image information, just be assembled into a frame image data, and output shows.
Corresponding to above-mentioned three-dimensional video-frequency coding/decoding method, present embodiment also provides a kind of three-dimensional video-frequency decoding device, and as shown in figure 10, this device comprises:
Decoding unit 11 is used for the band in the slice-group that the decoded data frame is encoded into by the scene two-dimensional image information, and the slice-group at these band places is first slice-group of frame generally speaking.
Judging unit 12 judges that according to display device needs carry out three-dimensional display and still carry out the two dimension demonstration;
If carry out three-dimensional display, the data that then need all to be carried out three-dimensional display are all decoded, so the needs band in the remaining slice-group in the decoded frame successively.Concrete operations are: by the next slice-group in the decoding unit decodes bar Frame, have then in the judgment unit judges Frame slice-group whether all decoding finish, if do not finish, then, in finishing frame, remain the decoding of slice-group once more by the band in the next slice-group in the decoding unit decodes Frame.
Show that if carry out two dimension decoding unit does not need the band in the residue slice-group in the decoded frame.
Become frame unit 13, when judging unit 12 was judged the band that does not need to remain in the decoded data frame in the slice-group, when perhaps finishing the decoding that remains band in the slice-group in the Frame, this one-tenth frame unit 13 was dressed up frame with decoded slice-group.
If present embodiment three-dimensional video-frequency coding/decoding method directly is used on the 3D display device, perhaps directly be used on the 2D display device, do not need to judge the display type of current display device, save judging unit 12.If directly be used on the 2D display device, just behind the band that decodes by decoding unit 11 in the slice-group that is encoded into by the scene two-dimensional image information, be assembled into a frame image data by one-tenth frame unit 13, and show; If directly be used on the 3D display device, be assembled into a frame image data again after then the decoding of the band in the remaining slice-group being finished, and show.
Stereo scopic video coding device shown in Figure 6 in the foregoing description 2, and three-dimensional video-frequency decoding device shown in Figure 10 can be formed the three-dimensional video-frequency system among the embodiment 4.
Stereo scopic video coding device shown in Figure 8 in the foregoing description 3, and three-dimensional video-frequency decoding device shown in Figure 10 can be formed the three-dimensional video-frequency system among the embodiment 4.
The embodiment of the invention mainly is used in the three-dimensional video-frequency technology, and particularly stereoscopic video is carried out in the technology of Code And Decode, as: three-dimensional film and TV, the three-dimensional video-frequency meeting, virtual reality system, long-range Industry Control, many occasions such as robot navigation and tele-medicine.
Through the above description of the embodiments, the those skilled in the art can be well understood to the present invention and can realize by the mode that software adds essential general hardware platform, can certainly pass through hardware, but the former is better execution mode under a lot of situation.Based on such understanding, the part that technical scheme of the present invention contributes to prior art in essence in other words can embody with the form of software product, this computer software product is stored in the storage medium that can read, floppy disk as computer, hard disk or CD etc. comprise that some instructions are used so that an equipment is carried out the described method of each embodiment of the present invention.
The above; only be the specific embodiment of the present invention, but protection scope of the present invention is not limited thereto, anyly is familiar with those skilled in the art in the technical scope that the present invention discloses; the variation that can expect easily or replacement all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection range of claim.

Claims (19)

1, a kind of method for encoding stereo video is characterized in that comprising:
Obtain the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information;
With two-dimensional image information in the same frame and additional information, be coded in respectively in two slice-group of same Frame.
2, method for encoding stereo video according to claim 1 is characterized in that, the slice-group that described two-dimensional image information correspondence is encoded into is positioned at the head of corresponding data frame.
3, method for encoding stereo video according to claim 1 and 2 is characterized in that, described additional information is the two-dimensional image information of described another viewpoint of scene, perhaps is the depth information of described scene.
4, method for encoding stereo video according to claim 3 is characterized in that, the mode of described coding is: the two-dimensional image information to described two-dimensional image information and another viewpoint is encoded according to the coded system of high compression ratio; Described depth information is encoded according to the coded system of lossless compress.
5, method for encoding stereo video according to claim 3 is characterized in that, the acquisition process of described depth information is:
Directly obtain the depth information of described scene by depth camera; Perhaps
According to the two-dimensional image information of described scene and the two-dimensional image information of this another viewpoint of scene, calculate the depth information of described scene.
6, method for encoding stereo video according to claim 5 is characterized in that, if according to the two-dimensional image information of described scene and the two-dimensional image information of this another viewpoint of scene, calculates the depth information of described scene, and this method also comprises:
Predict another viewpoint two-dimensional image information of this scene according to depth information and two-dimensional image information;
Another viewpoint two-dimensional image information of this scene that utilizes prediction to obtain, and the original two-dimensional image information of another viewpoint of this scene carry out the parallax compensation residual computations, obtain residual image information;
Also comprise: described residual image information is coded in its place Frame in the slice-group.
7, a kind of stereo scopic video coding device is characterized in that comprising:
Information acquisition unit is used to obtain the two-dimensional image information of scene, and the additional information that can constitute the scene stereo-picture with described two-dimensional image information;
Coding unit is used for two-dimensional image information and additional information with same frame, is coded in respectively in two slice-group of same Frame.
8, stereo scopic video coding device according to claim 7 is characterized in that, described additional information is the depth information of described scene;
Described information acquisition unit is a depth camera, is used for directly obtaining the two-dimensional image information and the depth information of scene; Perhaps
Described information acquisition unit is the binary channels stereo camera, is used to obtain the two-dimensional image information of described scene and the two-dimensional image information of this another viewpoint of scene; And calculate the depth information of described scene according to two channel image data in the described binary channels stereo camera.
9, stereo scopic video coding device according to claim 8 is characterized in that, described coding unit is used for two-dimensional image information is encoded according to the coded system of high compression ratio, and described depth information is encoded according to the coded system of lossless compress.
10, stereo scopic video coding device according to claim 8 is characterized in that, if described information acquisition unit is the binary channels stereo camera, this device also comprises:
Predicting unit is used for predicting another viewpoint two-dimensional image information of this scene according to depth information and two-dimensional image information;
Computing unit is used to utilize and predicts another viewpoint two-dimensional image information of this scene that obtains, and the original two-dimensional image information of another viewpoint of this scene, carries out the parallax compensation residual computations, obtains residual image information;
Described coding unit also is used for described residual image information is coded in the slice-group of its place Frame correspondence.
11, stereo scopic video coding device according to claim 7 is characterized in that, described additional information is the two-dimensional image information of described another viewpoint of scene;
Described information acquisition unit is the binary channels stereo camera, is used for directly obtaining the two-dimensional image information of scene and the two-dimensional image information of this another viewpoint of scene.
12, stereo scopic video coding device according to claim 11 is characterized in that, described coding unit is used for the two-dimensional image information with two-dimensional image information and another viewpoint, encodes according to the coded system of high compression ratio.
13, a kind of three-dimensional video-frequency coding/decoding method is characterized in that comprising:
Band in the slice-group that is encoded into by the scene two-dimensional image information in the decoded data frame;
Decoded slice-group is dressed up frame.
14, three-dimensional video-frequency coding/decoding method according to claim 13 is characterized in that, the described slice-group that is encoded into by the scene two-dimensional image information is positioned at the head of Frame.
15, three-dimensional video-frequency coding/decoding method according to claim 13 is characterized in that, before decoded slice-group was dressed up frame, this method also comprised:
When carrying out three-dimensional display, the band in the residue slice-group in the decoded data frame successively then; Or
When carrying out the two dimension demonstration, then do not need the band in the residue slice-group in the decoded data frame.
16, three-dimensional video-frequency coding/decoding method according to claim 15, it is characterized in that, described remaining slice-group comprises: the slice-group that is made of scene depth information, the perhaps slice-group that constitutes by the two-dimensional image information at another visual angle of scene, perhaps slice-group that constitutes by scene depth information and the slice-group that constitutes by scene residual image information.
17, a kind of three-dimensional video-frequency decoding device is characterized in that comprising:
Decoding unit is used for the band in the slice-group that the decoded data frame is encoded into by the scene two-dimensional image information;
Become frame unit, be used for decoded slice-group is dressed up frame.
18, three-dimensional video-frequency decoding device according to claim 17 is characterized in that,
Described decoding unit is used for when needs carry out three-dimensional display the band in the remaining slice-group of decoded data frame successively; Perhaps
Described decoding unit is used for carrying out two dimension band in the remaining slice-group of decoded data frame not when showing at needs.
19, a kind of three-dimensional video-frequency system is characterized in that comprising:
The stereo scopic video coding device, be used to obtain the two-dimensional image information of scene, and can constitute the additional information of scene stereo-picture, and, be coded in respectively in two slice-group in the same Frame two-dimensional image information in the same frame and additional information with described two-dimensional image information;
The three-dimensional video-frequency decoding device is used for the band in the slice-group that the decoded data frame is encoded into by the scene two-dimensional image information, and decoded slice-group is dressed up frame.
CN200810126528A 2008-06-24 2008-06-24 Stereo video coding-decoding method, Apparatus and system Pending CN101616322A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN200810126528A CN101616322A (en) 2008-06-24 2008-06-24 Stereo video coding-decoding method, Apparatus and system
PCT/CN2009/072241 WO2009155827A1 (en) 2008-06-24 2009-06-12 Method, apparatus and system for stereo video encoding and decoding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200810126528A CN101616322A (en) 2008-06-24 2008-06-24 Stereo video coding-decoding method, Apparatus and system

Publications (1)

Publication Number Publication Date
CN101616322A true CN101616322A (en) 2009-12-30

Family

ID=41444016

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200810126528A Pending CN101616322A (en) 2008-06-24 2008-06-24 Stereo video coding-decoding method, Apparatus and system

Country Status (2)

Country Link
CN (1) CN101616322A (en)
WO (1) WO2009155827A1 (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102387359A (en) * 2010-08-31 2012-03-21 中国电信股份有限公司 Three-dimensional video transmission method, three-dimensional video transmission system and coding and decoding device
CN102630026A (en) * 2011-02-03 2012-08-08 美国博通公司 Method and system for processing video
CN102724520A (en) * 2011-03-29 2012-10-10 青岛海信电器股份有限公司 Method and system for processing videos
CN102934450A (en) * 2010-05-13 2013-02-13 索尼公司 Image processing device and image processing method
WO2014106496A1 (en) * 2013-01-07 2014-07-10 Mediatek Inc. Method and apparatus of depth to disparity vector conversion for three-dimensional video coding
CN106982367A (en) * 2017-03-31 2017-07-25 联想(北京)有限公司 Video transmission method and its device
CN108696471A (en) * 2017-02-21 2018-10-23 科通环宇(北京)科技有限公司 Code stream packaging method, code stream, coding/decoding method and device based on AVS2
WO2018233693A1 (en) * 2017-06-23 2018-12-27 Mediatek Inc. Methods and apparatus for deriving composite tracks
CN110661975A (en) * 2019-10-10 2020-01-07 Oppo广东移动通信有限公司 Image encoding and decoding method and device, electronic equipment and storage medium
CN110784722A (en) * 2019-11-06 2020-02-11 Oppo广东移动通信有限公司 Encoding and decoding method, encoding and decoding device, encoding and decoding system and storage medium
CN110809152A (en) * 2019-11-06 2020-02-18 Oppo广东移动通信有限公司 Information processing method, encoding device, decoding device, system, and storage medium
CN110855997A (en) * 2019-11-06 2020-02-28 Oppo广东移动通信有限公司 Image processing method and device and storage medium
CN111225218A (en) * 2019-11-06 2020-06-02 Oppo广东移动通信有限公司 Information processing method, encoding device, decoding device, system, and storage medium
CN112788325A (en) * 2019-11-06 2021-05-11 Oppo广东移动通信有限公司 Image processing method, encoding device, decoding device and storage medium
CN113383540A (en) * 2019-01-23 2021-09-10 奥崔迪合作公司 Interoperable 3D image content processing
CN114175626A (en) * 2019-11-06 2022-03-11 Oppo广东移动通信有限公司 Information processing method, encoding device, decoding device, system, and storage medium
CN114402590A (en) * 2019-11-06 2022-04-26 Oppo广东移动通信有限公司 Information processing method and system, encoding device, decoding device, and storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8679136B2 (en) * 2008-06-17 2014-03-25 Apollo Endosurgery, Inc. Needle capture device

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07123447A (en) * 1993-10-22 1995-05-12 Sony Corp Method and device for recording image signal, method and device for reproducing image signal, method and device for encoding image signal, method and device for decoding image signal and image signal recording medium
JPH07284127A (en) * 1994-04-13 1995-10-27 Mitsubishi Heavy Ind Ltd Encoder/decoder for stereo image
JPH1198528A (en) * 1997-09-19 1999-04-09 Fujitsu Ltd Method for compressing stereoscopic animation and device therefor
JP3519594B2 (en) * 1998-03-03 2004-04-19 Kddi株式会社 Encoding device for stereo video
CN1204757C (en) * 2003-04-22 2005-06-01 上海大学 Stereo video stream coder/decoder and stereo video coding/decoding system
US20050041736A1 (en) * 2003-05-07 2005-02-24 Bernie Butler-Smith Stereoscopic television signal processing method, transmission system and viewer enhancements
US7650036B2 (en) * 2003-10-16 2010-01-19 Sharp Laboratories Of America, Inc. System and method for three-dimensional video coding
CN1545333A (en) * 2003-11-21 2004-11-10 �Ϻ���ͨ��ѧ Method of three dimensional video image signal compression

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102934450A (en) * 2010-05-13 2013-02-13 索尼公司 Image processing device and image processing method
CN102387359A (en) * 2010-08-31 2012-03-21 中国电信股份有限公司 Three-dimensional video transmission method, three-dimensional video transmission system and coding and decoding device
CN102630026A (en) * 2011-02-03 2012-08-08 美国博通公司 Method and system for processing video
CN102724520A (en) * 2011-03-29 2012-10-10 青岛海信电器股份有限公司 Method and system for processing videos
US10341638B2 (en) 2013-01-07 2019-07-02 Mediatek Inc. Method and apparatus of depth to disparity vector conversion for three-dimensional video coding
WO2014106496A1 (en) * 2013-01-07 2014-07-10 Mediatek Inc. Method and apparatus of depth to disparity vector conversion for three-dimensional video coding
CN104919799A (en) * 2013-01-07 2015-09-16 联发科技股份有限公司 Method and apparatus of depth to disparity vector conversion for three-dimensional video coding
CN108696471A (en) * 2017-02-21 2018-10-23 科通环宇(北京)科技有限公司 Code stream packaging method, code stream, coding/decoding method and device based on AVS2
CN106982367A (en) * 2017-03-31 2017-07-25 联想(北京)有限公司 Video transmission method and its device
US10873733B2 (en) 2017-06-23 2020-12-22 Mediatek Inc. Methods and apparatus for deriving composite tracks
WO2018233693A1 (en) * 2017-06-23 2018-12-27 Mediatek Inc. Methods and apparatus for deriving composite tracks
CN113383540B (en) * 2019-01-23 2024-04-02 奥崔迪合作公司 Interoperable 3D image content processing
CN113383540A (en) * 2019-01-23 2021-09-10 奥崔迪合作公司 Interoperable 3D image content processing
CN110661975B (en) * 2019-10-10 2021-10-26 Oppo广东移动通信有限公司 Image encoding and decoding method and device, electronic equipment and storage medium
CN110661975A (en) * 2019-10-10 2020-01-07 Oppo广东移动通信有限公司 Image encoding and decoding method and device, electronic equipment and storage medium
CN112788325A (en) * 2019-11-06 2021-05-11 Oppo广东移动通信有限公司 Image processing method, encoding device, decoding device and storage medium
CN111225218A (en) * 2019-11-06 2020-06-02 Oppo广东移动通信有限公司 Information processing method, encoding device, decoding device, system, and storage medium
CN110855997A (en) * 2019-11-06 2020-02-28 Oppo广东移动通信有限公司 Image processing method and device and storage medium
CN110809152A (en) * 2019-11-06 2020-02-18 Oppo广东移动通信有限公司 Information processing method, encoding device, decoding device, system, and storage medium
CN114175626A (en) * 2019-11-06 2022-03-11 Oppo广东移动通信有限公司 Information processing method, encoding device, decoding device, system, and storage medium
CN114402590A (en) * 2019-11-06 2022-04-26 Oppo广东移动通信有限公司 Information processing method and system, encoding device, decoding device, and storage medium
CN114175626B (en) * 2019-11-06 2024-04-02 Oppo广东移动通信有限公司 Information processing method, encoding device, decoding device, system, and storage medium
CN110784722A (en) * 2019-11-06 2020-02-11 Oppo广东移动通信有限公司 Encoding and decoding method, encoding and decoding device, encoding and decoding system and storage medium

Also Published As

Publication number Publication date
WO2009155827A1 (en) 2009-12-30

Similar Documents

Publication Publication Date Title
CN101616322A (en) Stereo video coding-decoding method, Apparatus and system
Merkle et al. Multi-view video plus depth representation and coding
Martinian et al. Extensions of H. 264/AVC for multiview video compression
CN101415114B (en) Method and apparatus for encoding and decoding video, and video encoder and decoder
CN102685532B (en) Coding method for free view point four-dimensional space video coding system
US10264281B2 (en) Method and apparatus of inter-view candidate derivation in 3D video coding
KR101753171B1 (en) Method of simplified view synthesis prediction in 3d video coding
CN101986716B (en) Quick depth video coding method
US20080205791A1 (en) Methods and systems for use in 3d video generation, storage and compression
CN101243692B (en) Method and apparatus for encoding multiview video
CN102055982A (en) Coding and decoding methods and devices for three-dimensional video
CN102970529B (en) A kind of object-based multi-view point video fractal image compression & decompression method
US20150172714A1 (en) METHOD AND APPARATUS of INTER-VIEW SUB-PARTITION PREDICTION in 3D VIDEO CODING
KR100738867B1 (en) Method for Coding and Inter-view Balanced Disparity Estimation in Multiview Animation Coding/Decoding System
Pan et al. Motion and disparity vectors early determination for texture video in 3D-HEVC
KR20110133532A (en) Apparatus and method for encoding depth image
CN105474640B (en) The method and apparatus that the camera parameters of 3 d video encoding transmit
US11509879B2 (en) Method for transmitting video, apparatus for transmitting video, method for receiving video, and apparatus for receiving video
Merkle et al. Efficient compression of multi-view depth data based on MVC
Yan et al. CTU layer rate control algorithm in scene change video for free-viewpoint video
CN103108183B (en) Skip mode and Direct mode motion vector predicting method in three-dimension video
US10477230B2 (en) Method and apparatus of disparity vector derivation for three-dimensional and multi-view video coding
Liu et al. Point cloud video streaming in 5G systems and beyond: challenges and solutions
CN102263952B (en) Quick fractal compression and decompression method for binocular stereo video based on object
CN102263953B (en) Quick fractal compression and decompression method for multicasting stereo video based on object

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Open date: 20091230