CN1204757C - Stereo video stream coder/decoder and stereo video coding/decoding system - Google Patents

Stereo video stream coder/decoder and stereo video coding/decoding system Download PDF

Info

Publication number
CN1204757C
CN1204757C CN 03116541 CN03116541A CN1204757C CN 1204757 C CN1204757 C CN 1204757C CN 03116541 CN03116541 CN 03116541 CN 03116541 A CN03116541 A CN 03116541A CN 1204757 C CN1204757 C CN 1204757C
Authority
CN
China
Prior art keywords
video
frame
video flowing
auxilliary
estimation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 03116541
Other languages
Chinese (zh)
Other versions
CN1450816A (en
Inventor
张兆杨
安平
骆艳
戏昌满
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai University
University of Shanghai for Science and Technology
Original Assignee
University of Shanghai for Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Shanghai for Science and Technology filed Critical University of Shanghai for Science and Technology
Priority to CN 03116541 priority Critical patent/CN1204757C/en
Publication of CN1450816A publication Critical patent/CN1450816A/en
Application granted granted Critical
Publication of CN1204757C publication Critical patent/CN1204757C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention relates to data compression technique for stereo videos. When the present invention carries out encoding for binocular video streams collected by a stereo camera system, one path of videos carry out encoding according to the standard of compatible MPEG series, and another path of videos respectively uses parallax compensation prediction, and combined parallax and motion compensation prediction, and carry out coding transmission at decoding ends through frame estimation and interpolation, wherein the parallax estimation uses multilevel segmentation block estimation based on Markov model; in order to utilize reference frame pictures recovered the decoding end, corresponding parallax and motion vector, the method of the frame estimation and the interpolation uses a method based on a frame estimation probability model to carry out estimation and interpolation. The decoding end has two stage decoding, wherein one state decodes main video streams to obtain the single video displayed on common display equipment, and the other stage decodes all double video streams; recovered stereo video signals are synthesized into displayed stereo pictures by an automatic stereo display.

Description

A kind of stereo video streaming coder/decoder and stereo video coding-decoding system thereof
Technical field
The present invention relates to the moving image treatment technology, be specifically related to a kind of coding/decoding method and device of stereo video data.
Background technology
Human when watching the world around, can not only see the width and the height of object, and can know their degree of depth, can judgment object between or the distance between beholder and the object.The main cause that this 3D vision characteristic produces is: people usually always binocular watch object simultaneously, and because the spacing (about 65mm) of two eyes optical axis, left eye and right eye are when seeing the object of certain distance, received sensed image is different, thereby brain is by motion, the adjustment of eyeball, combine the information of this two images, produce third dimension.When list was watched object with left eye and right eye, the visual referred sensation that is produced just was parallax.
The 3D vision characteristic that comes from the binocular structure to we provide one from about obtain two images real world relative depth sense directly and simple method, and this relative depth information is such as telecommunication (tele-medicine, teleconference), tele-robotic (Remote, autonomous aviation, supervision), be vital in the application of amusement (interactive HDTV, three-dimensional film) and virtual reality and so on.But this to introduce the obvious cost of relative depth information be to make its transmission and data quantity stored double above than mono-vision system in order to increase authenticity.In order to satisfy the increase of data volume, settling mode increases channel width nothing more than, improves channel utilization and reduce these approach of information source code check with compress technique efficiently with agreement efficiently.But, therefore must adopt effective image compression technology owing to increase the diseconomy of the memory span and the network bandwidth.
The method of stereo scopic video coding all is to utilize the correlation between the binocular video stream to come the whole code efficiency that improves the two-path video signal in itself under the prior art.Two class methods are arranged substantially, and the first kind is the three-dimensional video-frequency stream encoding method based on the MPEG video encoding standard, and its basic principle is that one road video flowing is wherein encoded separately, and another road video flowing then adopts disparity estimation and compensation technique to encode.These class methods mostly adopt the hybrid coding mode, for example with the mixing definition of operating such coding (definition of one of them stream is relative relatively poor), based on the Bit Allocation in Discrete method of psychological characteristics, based on the D encoding of multiresolution and adopt frame to estimate that interpolation rebuilds right B frame and clap, right wing video flowing B frame does not transmit, and recovers but make interpolation in decoding end) etc. mode.The problem that these class methods exist is: the efficient of disparity estimation compensation haves much room for improvement; Ignored the effective utilization to right wing stream movable information when utilizing binocular parallax information, binary encoding efficient also has the bigger rising space; Though adopt frame to estimate that the D encoding compression ratio of interpositioning is high, existing frame estimates that interpositioning is fairly simple, the image reconstruction quality is undesirable; Also lack ripe perfect D encoding system generally.
Second class is object-based D encoding method, and its basic principle is the object in the scene to be cut apart to extract also encode in conjunction with motion and binocular depth information.But when having a plurality of objects to occur in the scene, the coding effect of these class methods is also bad, and owing to its complexity of calculation, real-time is also relatively poor, and is still far away from the requirement of real time system application simultaneously.
Summary of the invention
The purpose of this invention is to provide a kind of stereo video streaming coder/decoder, it have compression rates height, decode rate fast and can with the advantage of single video coded system compatibility.
Above-mentioned purpose of the present invention is achieved through the following technical solutions:
A kind of three-dimensional video-frequency stream encoder comprises:
Main video flowing coding unit, the road video flowing that is used for stereoscopic video stream is encoded to generate main video code flow according to the MPEG agreement;
Auxilliary video flowing coding unit, it comprises:
Parallax/motion-compensated estimation is single not to be had, another road video flowing interior corresponding intracoded frame and the MB of prediction frame of stereoscopic video stream are carried out disparity-compensated estimation respectively to be used for utilizing interior intracoded frame of main video flowing and MB of prediction frame, and utilize intracoded frame previous in the auxilliary video flowing and/or MB of prediction frame that current MB of prediction frame in the auxilliary video flowing is carried out motion-compensated estimation, the initial value of disparity-compensated estimation obtains according to following manner: utilizes by the MB of prediction frame in the auxilliary video flowing being carried out motion vector that motion-compensated estimation obtains the optical parallax field of previous intracoded frame or MB of prediction frame carried out motion-compensated estimation, and with the initial value of new optical parallax field as disparity-compensated estimation:
The compensation prediction coding unit is used for the disparity-compensated estimation information of the intracoded frame in the auxilliary video flowing and the disparity-compensated estimation information and the motion-compensated estimation information of MB of prediction frame are encoded to generate auxilliary video code flow;
Multiplexer is used for main video code flow and auxilliary video code flow are generated the three-dimensional video-frequency code stream with time division multiplexing mode.
Reasonable is that in above-mentioned three-dimensional video-frequency stream encoder, described parallax/motion estimation unit adopts carries out disparity estimation based on layering markov probabilistic model and multistage matching way.Be more preferably, in described layering markov probabilistic model and overlapping piece matching way, stratum level is set at two-stage, and the block size is divided into 8 * 8 and 16 * 16 two kinds.
Reasonable is in above-mentioned three-dimensional video-frequency stream encoder, to adjust the transmission channel bandwidth that auxilliary video code flow takies by the residual error image DCT quantization parameter that changes in the disparity-compensated estimation information.
A kind of three-dimensional video-frequency stream decoder comprises:
Demultiplexer is used for the three-dimensional video-frequency code stream is decomposed into main video code flow and auxilliary video code flow;
Main video code flow decoding unit is used for main video code flow is decoded to generate main video flowing according to the MPEG agreement;
Auxilliary video code flow decoding unit, it comprises:
Parallax/motion compensated prediction unit is used for rebuilding according to the disparity-compensated estimation information that comprises in the intracoded frame of main video flowing and MB of prediction frame and the auxilliary video code flow and motion-compensated estimation information the intracoded frame and the I picture predictive frame of auxilliary video flowing;
Frame estimates and interpolation unit, be used for according to corresponding main video flowing bi-directional predicted/disparity-compensated estimation information that intracoded frame in the interpolation frame, auxilliary video flowing and MB of prediction frame and auxilliary video code flow comprise and motion-compensated estimation information rebuilds the bi-directional predicted/interpolation frame in the auxilliary video flowing;
Assist the video flowing reconstruction unit, be used for the intracoded frame of parallax/motion compensated prediction unit reconstruction and the bi-directional predicted/two-way interpolation frame of I picture predictive frame and frame estimation and interpolation unit reconstruction are sorted to generate auxilliary video flowing according to time order and function.
Reasonable is that in above-mentioned three-dimensional video-frequency stream decoder, frame estimates to estimate based on the three-dimensional frame of Bayes's minimum cost equation with the interpolation unit employing and interpolating method is rebuild bi-directional predicted/interpolation frame.
In three-dimensional video-frequency stream encoder of the present invention, owing to only one of them video flowing is carried out high-quality coding according to mpeg standard, and in another video flowing, have only a few frames to encode, all the other frames are fully " skipping " and carry out frame in decoding end and estimate that interpolation recovers then, therefore improve code efficiency greatly, saved transmission bandwidth.
The purpose of this invention is to provide a kind of three-dimensional video-frequency treatment system, it have compression rates height, decode rate fast and can with the advantage of single video coded system compatibility.
Above-mentioned purpose of the present invention is achieved through the following technical solutions:
A kind of processing system for video, comprise picked-up left road and right wing video flowing video camera, make the synchronous in time time base corrector of the video flowing of two video cameras outputs, will be through the stream of the two-path video after the time base corrector time synchronizing multiplexed frame sequential multiplexer, the computer system that comprises stereo coder as claimed in claim 1 and three-dimensional video-frequency stream decoder as claimed in claim 5 and regular display and three-dimensional display with the formation stereo video streaming
Wherein, in the time only need transmitting the single channel video flowing, video image is encoded and signal is delivered to transmission channel by the main video flowing coding unit of three-dimensional video-frequency stream encoder, when needs transmitting two paths video flowing, respectively left and right sides road video image is encoded and signal is delivered to transmission channel by main video flowing coding unit and auxilliary video flowing coding unit, when the video code flow that receives only comprises one road video flowing, encoding code stream is decoded and decoded signal is delivered to regular display by the main video code flow decoding unit of three-dimensional video-frequency stream decoder, when the code stream that receives comprises left and right sides two-path video stream signal, respectively left and right sides two-way encoding code stream is decoded and decoded signal is delivered to three-dimensional display by main video code flow decoding unit and auxilliary video code flow decoding unit.
Video system of the present invention except having the advantage that code efficiency is high and the transmission bandwidth requirement is low, also can with the serial coding standard compatibility of existing single video MPEG.This is keeping having reduced the upgrade cost of system under the compatible prerequisite of coding, and the flexible control to three-dimensional display quality is provided.
Description of drawings
By below in conjunction with the description of accompanying drawing to preferred embodiment of the present invention, can further understand purpose of the present invention, feature and advantage, wherein:
Fig. 1 is the schematic diagram according to stereo video streaming coder/decoder of the present invention.
Fig. 2 is for adopting the processing system for video schematic diagram according to stereo video streaming coder/decoder of the present invention.
Embodiment
Below in conjunction with accompanying drawing preferred embodiment of the present invention is described.
Fig. 1 is the schematic diagram according to stereo video streaming coder/decoder of the present invention.As shown in Figure 1, three-dimensional video-frequency stream encoder 1 is responsible for the left and right sides video flowing of input is encoded, below for convenience of description for the purpose of supposition left video stream be main video flowing and right video flowing is auxilliary video flowing, it is limitation of the invention that but this supposition should not be construed as, and in fact also can be opposite supposition.The video code flow channel 2 that three-dimensional video-frequency stream encoder 1 coding generates transfers to three-dimensional video-frequency stream decoder 3.
Referring to Fig. 1, three-dimensional video-frequency stream encoder 1 comprises mpeg encoder 4, multiplexer 7 as main video flowing coding unit and the auxilliary video flowing coding unit that is made of parallax/motion-compensated estimation unit 5 and compensation prediction coding unit 6.
It on the MPEG digital video coding technical spirit a kind of method for compressing image that utilizes the statistical redundancy degree of video sequence on time and direction in space to realize, it depends on the correlation of (interpel) between the pixel, comprises such hypothesis: promptly have simple correlation translational motion between each successive frame.Therefore the pixel value on special frame can adopt the intraframe coding technology to be predicted according near pixel at same frame, perhaps can adopt the interframe technology to be predicted according to the pixel near the frame.
When a video sequence shot change, the temporal correlation near each frame between the pixel is just very little, even disappears, and should adopt this moment the intraframe coding technology to come the development space correlation to compress to realize active data.In the MPEG compression algorithm, adopt discrete cosine transform (DCT) coding techniques, picture block with 8 * 8 pixels is that unit effectively develops the spatial coherence between near each picture rope of same picture, below can be called intracoded frame according to the picture frame of intraframe coding technique compresses, and brief note is I MOr I A, wherein subscript M and A represent main video flowing and auxilliary video flowing respectively.
If have bigger correlation near the frame between each pixel, that is to say, when the content of two successive frames is very similar or identical, just can adopt interframe DPCM coding techniques based on time prediction (motion compensated prediction of interframe), below can be called MB of prediction frame, and brief note is P according to the picture frame of inter-frame coding compression MOr P A, wherein subscript M and A represent main video flowing and auxilliary video flowing respectively.
Also introduce a kind of picture frame that is called bi-directional predicted frames in mpeg standard, it can adopt past frame and future frame, and reduction obtains as the reference frame, but itself can not be as the reference frame, below this class picture frame is called bi-directional predicted frames, and brief note is B MOr B A, wherein subscript M and A represent main video flowing and auxilliary video flowing respectively.
In the present invention, 4 pairs of left roads of mpeg encoder video flowing is encoded generating main video code flow according to mpeg standard, and this main video code flow is by according to certain tactic coding back I M, P MAnd B MFrame sequence constitutes.
As shown in the figure, two-path video stream in the left and right sides all is transfused to the parallax/motion estimation unit 5 in the auxilliary video flowing coding unit, and carries out parallax and estimation in this unit.Particularly, with main video flowing and auxilliary video flowing inter-sync or corresponding intracoded frame I MWith I AAnd MB of prediction frame P MWith P ACompare to obtain picture frame I in the auxilliary video flowing AOr P ADisparity estimation; With the auxilliary previous intracoded frame I of video flowing AOr MB of prediction frame P AWith current MB of prediction frame P ACompare to obtain estimation current MB of prediction frame.Why be every width of cloth P AFrame provide motion estimation information and disparity estimation information be because, in the ordinary course of things, motion and parallax are carried out mixed compensation can obtain best predicting the outcome, therefore in the present invention, in order to make decoding end recover the picture frame of better quality, parallax/motion estimation unit 5 is a width of cloth P AFrame provides motion estimation information (by with previous same video flowing internal reference frame I AFrame or P AFrame and current P AFrame relatively obtains) and disparity estimation information (according to the P of correspondence AAnd P MFrame obtains), can effectively solve the problem of blocking the code efficiency reduction that causes with parallax barrier because of time domain like this.
The method of disparity estimation has multiple, and in the present invention, parallax/motion estimation unit 5 adopts carries out disparity estimation based on layering markov probabilistic model and multistage matching way.The advantage of this method is to obtain a level and smooth and relatively accurate optical parallax field, and this will reduce the entropy of parallax compensation residual error image greatly, thereby further improves compression ratio.For with the piece size compatibility of mpeg standard, when adopting above-mentioned layering markov probabilistic model and overlapping piece matching way, stratum level is set at two-stage, the block size is divided into 8 * 8 and 16 * 16 two kinds.
Motion-compensated estimation is a kind of time-based DPCM coded prediction technology, and it has obtained extensive use in MPEG1 and MPEG2 video encoding standard.The motion compensation notion is with the basis that is estimated as to the video interframe movement, that is to say, if all objects all spatially have a displacement in the video lens, then use limited kinematic parameter (for example for the translational motion of pixel, the available motion vector is described) to come interframe movement is described.Because the spatial coherence between some motion vectors is higher usually, sometimes can think that a motion vector represented the motion of an adjacent pixel blocks, therefore a frame picture can be divided into several pixel blocks (block of pixels is 16 * 16 pixels in MPEG1 and MPEG2 standard), and only a motion vector representing each block of pixels be estimated, encoded and transmits.Owing to only prediction error picture (difference between raw frames and the motion compensated prediction picture) is encoded, therefore reduced the temporal redundancy of interframe.
Actual observation shows that for stereo video image continuous in time, their optical parallax field has the height temporal redundancy equally, and therefore in the present invention, reasonable is the initial value that obtains disparity-compensated estimation according to following manner: at first to P AFrame carries out motion compensated prediction to obtain motion vector, then to reference frame I previous in the same video flowing A(or P A) optical parallax field carry out motion compensated prediction, obtain the initial value that new optical parallax field promptly can be used as disparity estimation thus.This mode can reduce the auxilliary required time of video flowing coding greatly, has improved coding rate.
Compensation prediction coding unit 6 links to each other with parallax/motion estimation unit 5, the I that it obtains parallax/motion estimation unit 5 AFrame disparity estimation compensated information and P AFrame disparity estimation compensated information or motion estimation and compensation information encode to generate auxilliary video code flow.I behind the coding AFrame disparity estimation compensated information bit stream is divided into three parts: difference vector stream, parallax compensation residual error image and quad-tree structure, wherein, difference vector stream adopts differential pulse coding method (DPCM) coding, and the residual error image adopts discrete cosine transform (DCT) and mark quantization methods to encode.
Multiplexer 7 links to each other with compensation prediction coding unit 6 with mpeg encoder 4, and it generates the three-dimensional video-frequency code stream with main video code flow and auxilliary video code flow with time division multiplexing mode.In the present invention, in order to improve code efficiency, all bi-directional predicted/interpolation frame (B in the auxilliary video flowing AFrame) do not make any encoding process, do not send into multiplexer 7 with transmission on channel 2 yet as an auxilliary video code flow part.
In above-mentioned three-dimensional video-frequency stream encoder, can be by the DCT quantization parameter of residual error image behind the above-mentioned parallax compensation of change, the additional bandwidth that changes transmission channel neatly is to satisfy the stereo display under the various bandwidth demands.
Refer again to Fig. 1, three-dimensional video-frequency stream decoder 3 comprises estimates the auxilliary decoding video stream unit that constitutes with interpolation unit 11 and auxilliary video flowing reconstruction unit 12 as the mpeg decoder 9 of main decoding video stream unit, demultiplexer 7 and by parallax/motion compensated prediction unit 10, frame.
As shown in Figure 1, demultiplexer 8 is decomposed into main video code flow and auxilliary video code flow with the three-dimensional video-frequency code stream of transmission on the channel 2 and main video flowing offered mpeg decoder 9 and will assists that video flowing offers parallax/motion compensated prediction unit 10 and frame is estimated and interpolation unit 11.
9 pairs of main video code flows of mpeg decoder are decoded generating main video flowing according to the MPEG agreement, and it is by according to certain tactic recovery back I M, P MAnd B MFrame sequence constitutes.
Also estimate to link to each other with auxilliary video flowing reconstruction unit 12 with interpolation unit 11 with mpeg decoder 9, frame in parallax/motion compensated prediction unit 10, it is according to intracoded frame I in the main video flowing of mpeg decoder 9 outputs MWith MB of prediction frame P MAnd corresponding intracoded frame I in disparity estimation compensated information that comprises in the auxilliary video code flow of demultiplexer 8 outputs and the auxilliary video flowing of motion estimation and compensation information reconstruction AWith MB of prediction frame P A, the I of its reconstruction AFrame and P AFrame is output to frame estimation and interpolation unit 11 and auxilliary video flowing reconstruction unit 12.
Frame is estimated also to link to each other with auxilliary video flowing reconstruction unit 12 with mpeg decoder 9 with interpolation unit 11, and it is according to corresponding bi-directional predicted frames B in the main video flowing of mpeg decoder 9 outputs M, corresponding intracoded frame I in the auxilliary video flowing AWith MB of prediction frame P A(this B for example AThe I that front and back are contiguous AFrame and P AFrame) and the disparity estimation compensated information that comprises in the auxilliary video code flow and motion estimation and compensation information rebuild the bi-directional predicted/interpolation frame of auxilliary video flowing, the B of its reconstruction AFrame is output to auxilliary video flowing reconstruction unit 12.
In auxilliary video flowing reconstruction unit 12, the intracoded frame I that parallax/motion compensated prediction unit 10 rebuilds AWith MB of prediction frame P AAnd bi-directional predicted/interpolation frame B that frame is estimated and interpolation unit is rebuild ASuccessively sort to generate auxilliary video flowing according to acquisition time.
Because the overwhelming majority is B in the auxilliary video flowing AFrame, so in three-dimensional encoding and decoding structure, B AFrame is rebuild speed and image quality is crucial.For this reason, adopt a kind of frame method of estimation in the present invention, its three-dimensional frame based on Bayes's minimum cost equation is estimated and interpolating method (SFEI_BLCF).This method is utilized in motion, parallax and the pictorial information (representing with arrow shown in the dotted line in Fig. 2) of decoding end acquisition and the characteristics of stereoscopic video sequence self, can synthesize B fast AFrame, and image reconstruction has acceptable quality on the stereoscopic vision meaning.Concrete reconstruction procedures is as follows:
(1) because B AFrame inserts in I in being AWith P ABetween the frame, so to I AWith P AMotion vector between the frame is pressed B AFrame is to I AThe distance of frame is stretched to determine I APixel in the frame is at B APosition in the frame.
(2) for same pixel, if it is at corresponding B M, I AAnd P AThe difference of the pixel value in the frame then is considered as the viewing area with it less than set point, to the weighted average of these pixel values as B AThe value of respective pixel point in the frame, and note B AThis pixel points to I in the frame AAnd P AThe motion vector of frame and sensing B MThe difference vector of frame.
(3) for same pixel, if it is at corresponding B M, I AAnd P AThe difference of the pixel value in the frame is more than or equal to set point, then this pixel is considered as blocking a little, in the viewing area of its neighborhood, select in the motion vector relevant one as match motion vector, and be mapped to corresponding picture frame to obtain the final pixel value of this point according to this motion vector with each pixel.
Therefore, in stereo video streaming encoder/decoder of the present invention, the B of main video flowing MThe I of frame, auxilliary video flowing AAnd P AFrame all needs to carry out coding transmission as the reference frame of interframe compensation prediction.But, can directly utilize motion that decoding end obtains and difference vector value to auxilliary video flowing B in when decoding AFrame recovers and rebuilds and need not to carry out match search, so the present invention has the high and fireballing characteristics of decoding of compression rates.
Fig. 2 shows processing system for video schematic diagram of the present invention.As shown in Figure 2, this processing system for video comprises two the video camera 21a and 21b, the time base corrector 22 that links to each other with video camera, the frame sequential multiplexer 23 that links to each other with time base corrector 22, computer system 24 and regular display 25 and three-dimensional display 26 that absorb left road and right wing video flowing respectively, and wherein computer system 24 comprises above-mentioned stereo coder and three-dimensional video-frequency stream decoder.
In above-mentioned processing system for video, when encoding, the left and right sides video flowing of two video camera 21a and 21b output exports frame sequential multiplexer 23 to after time base corrector 22 carries out time synchronizing, send into computer system 24 through behind the multiplexed formation stereo video streaming.When having only one road video flowing input computer system 24 or only need transmit the single channel video flowing, video image is encoded and the mpeg standard signal bit stream is delivered to transmission channel by the main video flowing coding unit of three-dimensional video-frequency stream encoder, when needs transmitting two paths video flowing, respectively left and right sides road video image is encoded and the signal that will comprise main video code flow and auxilliary video code flow is delivered to transmission channel by the main video flowing coding unit of three-dimensional video-frequency stream encoder and auxilliary video flowing coding unit.
Decoding is finished by the three-dimensional video-frequency stream decoder of computer system 24, when the video flowing that receives only comprises one road video flowing, encoding code stream is decoded and decoded signal is delivered to regular display by the main video code flow decoding unit of three-dimensional video-frequency stream decoder, when the code stream that receives comprises left and right sides two-path video stream signal, respectively left and right sides two-way encoding code stream is decoded and decoded signal is delivered to automatic stereoscopic display device by the main video code flow decoding unit of three-dimensional video-frequency stream decoder and auxilliary video code flow decoding unit.
Below with a concrete example of using effect of the present invention is described.Suppose that picture frame is CIF form (352 * 288), main video flowing is encoded according to the grammer standard of mpeg encoded, this road image quality higher relatively (average peak signal to noise ratio PSNR is about 35dB), encoding rate is 0.14MbS~2.55MbS.Only there is a few frames to carry out predictive coding and transmission in the auxilliary video flowing, all the other frames are complete " skipping " then, the frame of " being skipped " at coding side carries out real-time recovery in decoding end by frame estimation and interpolation, and the average encoding rate of this road video flowing is 14.8Kbs~108Kbs.By relatively as seen, transmit that to assist the needed additional bandwidth of video flowing extremely low, make that total bit stream of three-dimensional digit TV only is about 1.15~1.3 times of common haplopia digital television transfer bit stream.Though the image quality of auxilliary video flowing is than main video flowing low slightly (average peak signal to noise ratio PSNR is about 30dB), but this have mixed-resolution about image decoding end can utilize fully the human vision system characteristic (HumanVisualsystem, HVS) and corresponding three-dimensional display synthesize stereo image with high visual definition and enough depth perceptions.

Claims (7)

1. a three-dimensional video-frequency stream encoder is characterized in that, comprising:
Main video flowing coding unit, the road video flowing that is used for stereoscopic video stream is encoded to generate main video code flow according to the MPEG agreement;
Auxilliary video flowing coding unit, it comprises:
Parallax/motion-compensated estimation unit, another road video flowing interior corresponding intracoded frame and the MB of prediction frame of stereoscopic video stream are carried out disparity-compensated estimation respectively to be used for utilizing interior intracoded frame of main video flowing and MB of prediction frame, and utilize intracoded frame previous in the auxilliary video flowing and/or MB of prediction frame that current MB of prediction frame in the auxilliary video flowing is carried out motion-compensated estimation, wherein, the initial value of disparity-compensated estimation obtains according to following manner: utilizes by the MB of prediction frame in the auxilliary video flowing being carried out motion vector that motion-compensated estimation obtains the optical parallax field of previous intracoded frame or MB of prediction frame carried out motion-compensated estimation, and with the initial value of new optical parallax field as disparity-compensated estimation;
The compensation prediction coding unit is used for the disparity-compensated estimation information of the intracoded frame in the auxilliary video flowing and the disparity-compensated estimation information and the motion-compensated estimation information of MB of prediction frame are encoded to generate auxilliary video code flow;
Multiplexer is used for main video code flow and auxilliary video code flow are generated the three-dimensional video-frequency code stream with time division multiplexing mode.
2. three-dimensional video-frequency stream encoder as claimed in claim 1 is characterized in that, described parallax/motion estimation unit adopts carries out disparity estimation based on layering markov probabilistic model and multistage matching way.
3. three-dimensional video-frequency stream encoder as claimed in claim 2 is characterized in that, in described layering markov probabilistic model and overlapping piece matching way, stratum level is set at two-stage, and the block size is divided into 8 * 8 and 16 * 16 two kinds.
4. as any described three-dimensional video-frequency stream encoder among the claim 1-3, it is characterized in that it adjusts the transmission channel bandwidth that auxilliary video code flow takies by the residual error image DCT quantization parameter that changes in the disparity-compensated estimation information.
5. a three-dimensional video-frequency stream decoder is characterized in that, comprising:
Demultiplexer is used for the three-dimensional video-frequency code stream is decomposed into main video code flow and auxilliary video code flow;
Main video code flow decoding unit is used for main video code flow is decoded to generate main video flowing according to the MPEG agreement;
Auxilliary video code flow decoding unit, it comprises:
Parallax/motion compensated prediction unit is used for rebuilding according to the disparity-compensated estimation information that comprises in the intracoded frame of main video flowing and MB of prediction frame and the auxilliary video code flow and motion-compensated estimation information the intracoded frame and the I picture predictive frame of auxilliary video flowing;
Frame estimates and interpolation unit, be used for according to corresponding main video flowing bi-directional predicted/disparity-compensated estimation information that intracoded frame in the interpolation frame, auxilliary video flowing and MB of prediction frame and auxilliary video code flow comprise and motion-compensated estimation information rebuilds the bi-directional predicted/interpolation frame in the auxilliary video flowing;
Assist the video flowing reconstruction unit, be used for the intracoded frame of parallax/motion compensated prediction unit reconstruction and the bi-directional predicted/two-way interpolation frame of I picture predictive frame and frame estimation and the single nothing reconstruction of interpolation are sorted to generate auxilliary video flowing according to time order and function.
6. three-dimensional video-frequency stream decoder as claimed in claim 5 is characterized in that, frame estimates to estimate based on the three-dimensional frame of Bayes's minimum cost equation with the interpolation unit employing and interpolating method is rebuild bi-directional predicted/interpolation frame.
7. processing system for video, it is characterized in that, the video camera that comprises left road of picked-up and right wing video flowing, make the synchronous in time time base corrector of video flowing of two video camera outputs, will be multiplexed to form the frame sequential multiplexer of stereo video streaming through the stream of the two-path video after the time base corrector time synchronizing, the computer system and regular display and the three-dimensional display that comprise stereo coder as claimed in claim 1 and three-dimensional video-frequency stream decoder as claimed in claim 5
Wherein, in the time only need transmitting the single channel video flowing, video image is encoded and signal is delivered to transmission channel by the main video flowing coding unit of three-dimensional video-frequency stream encoder, when needs transmitting two paths video flowing, by main video flowing coding unit and auxilliary video flowing coding single do not have respectively left and right sides road video image is encoded and signal is delivered to transmission channel, when the video code flow that receives only comprises one road video flowing, encoding code stream is decoded and decoded signal is delivered to regular display by the main video code flow decoding unit of three-dimensional video-frequency stream decoder, when the code stream that receives comprises left and right sides two-path video stream signal, respectively left and right sides two-way encoding code stream is decoded and decoded signal is delivered to three-dimensional display by main video code flow decoding unit and auxilliary video code flow decoding unit.
CN 03116541 2003-04-22 2003-04-22 Stereo video stream coder/decoder and stereo video coding/decoding system Expired - Fee Related CN1204757C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 03116541 CN1204757C (en) 2003-04-22 2003-04-22 Stereo video stream coder/decoder and stereo video coding/decoding system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 03116541 CN1204757C (en) 2003-04-22 2003-04-22 Stereo video stream coder/decoder and stereo video coding/decoding system

Publications (2)

Publication Number Publication Date
CN1450816A CN1450816A (en) 2003-10-22
CN1204757C true CN1204757C (en) 2005-06-01

Family

ID=28684196

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 03116541 Expired - Fee Related CN1204757C (en) 2003-04-22 2003-04-22 Stereo video stream coder/decoder and stereo video coding/decoding system

Country Status (1)

Country Link
CN (1) CN1204757C (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101361371B (en) * 2006-01-05 2010-11-03 日本电信电话株式会社 Video encoding method, decoding method, device thereof, program thereof, and storage medium containing the program

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100524077B1 (en) * 2003-11-13 2005-10-26 삼성전자주식회사 Apparatus and method of temporal smoothing for intermediate image generation
EP1727090A1 (en) 2004-02-27 2006-11-29 Tdvision Corporation S.A. DE C.V. Method and system for digital decoding 3d stereoscopic video images
US8369406B2 (en) 2005-07-18 2013-02-05 Electronics And Telecommunications Research Institute Apparatus of predictive coding/decoding using view-temporal reference picture buffers and method using the same
KR100678911B1 (en) * 2005-07-21 2007-02-05 삼성전자주식회사 Method and apparatus for video signal encoding and decoding with extending directional intra prediction
KR101245251B1 (en) * 2006-03-09 2013-03-19 삼성전자주식회사 Method and apparatus for encoding and decoding multi-view video to provide uniform video quality
WO2007110000A1 (en) * 2006-03-29 2007-10-04 Huawei Technologies Co., Ltd. A method and device of obtaining disparity vector and its multi-view encoding-decoding
US8970680B2 (en) * 2006-08-01 2015-03-03 Qualcomm Incorporated Real-time capturing and generating stereo images and videos with a monoscopic low power mobile device
KR101023262B1 (en) * 2006-09-20 2011-03-21 니폰덴신뎅와 가부시키가이샤 Image encoding method, decoding method, device thereof, program thereof, and storage medium containing the program
US8274551B2 (en) 2007-06-11 2012-09-25 Samsung Electronics Co., Ltd. Method and apparatus for generating header information of stereoscopic image data
CN101415114B (en) * 2007-10-17 2010-08-25 华为终端有限公司 Method and apparatus for encoding and decoding video, and video encoder and decoder
CN101420609B (en) * 2007-10-24 2010-08-25 华为终端有限公司 Video encoding, decoding method and video encoder, decoder
CN101609652B (en) * 2008-06-17 2012-12-19 联咏科技股份有限公司 Transmission interface and method for reducing power consumption and electromagnetic interference effect
CN101616322A (en) * 2008-06-24 2009-12-30 深圳华为通信技术有限公司 Stereo video coding-decoding method, Apparatus and system
EP2334088A1 (en) * 2009-12-14 2011-06-15 Koninklijke Philips Electronics N.V. Generating a 3D video signal
EP2355510A1 (en) * 2009-12-21 2011-08-10 Alcatel Lucent Method and arrangement for video coding
CN102195894B (en) * 2010-03-12 2015-11-25 腾讯科技(深圳)有限公司 The system and method for three-dimensional video-frequency communication is realized in instant messaging
CN103190152B (en) * 2010-10-26 2016-04-27 韩国放送公社 For the hierarchical broadcast system and method for three-dimensional broadcast
US8786674B2 (en) 2010-11-26 2014-07-22 Mediatek Singapore Pte. Ltd. Method for performing video display control within a video display system, and associated video processing circuit and video display system
CN104717484B (en) * 2010-11-26 2017-06-09 联发科技(新加坡)私人有限公司 Carry out method, video processing circuits and the video display system of video display control
CN102055984B (en) * 2011-01-27 2012-10-03 山东大学 Three-dimensional video decoding structure for smoothly switching 2D and 3D play modes and operating method
CN102625097B (en) * 2011-01-31 2014-11-05 北京大学 Method for intra-frame prediction of three-dimensional video and coding and decoding methods
CN102118602B (en) * 2011-03-15 2013-08-21 深圳市捷视飞通科技有限公司 Method and system for displaying auxiliary streaming video in multiple pictures
CN102196291A (en) * 2011-05-20 2011-09-21 四川长虹电器股份有限公司 Method for coding binocular stereo video
CN102244801A (en) * 2011-07-13 2011-11-16 中国民航大学 Digital stereoscopic television system and coding and decoding methods
CN102271270A (en) * 2011-08-15 2011-12-07 清华大学 Method and device for splicing binocular stereo video
CN102438147B (en) * 2011-12-23 2013-08-07 上海交通大学 Intra-frame synchronous stereo video multi-reference frame mode inter-view predictive coding and decoding method
CN103379351B (en) * 2012-04-28 2016-03-02 中国移动通信集团山东有限公司 A kind of method for processing video frequency and device
US10567765B2 (en) * 2014-01-15 2020-02-18 Avigilon Corporation Streaming multiple encodings with virtual stream identifiers
US20170026659A1 (en) * 2015-10-13 2017-01-26 Mediatek Inc. Partial Decoding For Arbitrary View Angle And Line Buffer Reduction For Virtual Reality Video
US20180098090A1 (en) * 2016-10-04 2018-04-05 Mediatek Inc. Method and Apparatus for Rearranging VR Video Format and Constrained Encoding Parameters
CN107071385B (en) * 2017-04-18 2019-01-25 杭州派尼澳电子科技有限公司 A kind of method for encoding stereo video introducing parallax compensation based on H265
CN107295214B (en) * 2017-08-09 2019-12-03 湖南兴天电子科技有限公司 Interpolated frame localization method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101361371B (en) * 2006-01-05 2010-11-03 日本电信电话株式会社 Video encoding method, decoding method, device thereof, program thereof, and storage medium containing the program

Also Published As

Publication number Publication date
CN1450816A (en) 2003-10-22

Similar Documents

Publication Publication Date Title
CN1204757C (en) Stereo video stream coder/decoder and stereo video coding/decoding system
US8953898B2 (en) Image processing apparatus and method
US5619256A (en) Digital 3D/stereoscopic video compression technique utilizing disparity and motion compensated predictions
US5612735A (en) Digital 3D/stereoscopic video compression technique utilizing two disparity estimates
US8644386B2 (en) Method of estimating disparity vector, and method and apparatus for encoding and decoding multi-view moving picture using the disparity vector estimation method
CN100473157C (en) System and method for internet broadcasting of mpeg-4-based stereoscopic video
US6055012A (en) Digital multi-view video compression with complexity and compatibility constraints
KR102343700B1 (en) Video transmission based on independently encoded background updates
US20140313291A1 (en) Video coding method, video decoding method, video coder, and video decoder
US20070041443A1 (en) Method and apparatus for encoding multiview video
US20090190662A1 (en) Method and apparatus for encoding and decoding multiview video
EP1927250A1 (en) Method of estimating disparity vector, and method and apparatus for encoding and decoding multi-view moving picture using the disparity vector estimation method
US20110134227A1 (en) Methods and apparatuses for encoding, decoding, and displaying a stereoscopic 3d image
Lim et al. A multiview sequence CODEC with view scalability
US20100002764A1 (en) Method For Encoding An Extended-Channel Video Data Subset Of A Stereoscopic Video Data Set, And A Stereo Video Encoding Apparatus For Implementing The Same
CN101867816A (en) Stereoscopic video asymmetric compression coding method based on human-eye visual characteristic
US20150312547A1 (en) Apparatus and method for generating and rebuilding a video stream
Hewage et al. Comparison of stereo video coding support in MPEG-4 MAC, H. 264/AVC and H. 264/SVC
CN109451293B (en) Self-adaptive stereoscopic video transmission system and method
Hewage et al. Stereoscopic tv over ip
Yip et al. Joint source and channel coding for H. 264 compliant stereoscopic video transmission
KR100566100B1 (en) Apparatus for adaptive multiplexing/demultiplexing for 3D multiview video processing and its method
Willner et al. Mobile 3D video using MVC and N800 internet tablet
KR101233161B1 (en) Method for transmission and reception of 3-dimensional moving picture in DMB mobile terminal
Mallik et al. HEVC based Stereo Video codec

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20050601

Termination date: 20150422

EXPY Termination of patent right or utility model