CN102137259B

CN102137259B - Method and device for coding two paths of three-dimensional videos

Info

Publication number: CN102137259B
Application number: CN 201110091938
Authority: CN
Inventors: 季向阳; 马茜; 戴琼海
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2011-04-13
Filing date: 2011-04-13
Publication date: 2013-03-27
Anticipated expiration: 2031-04-13
Also published as: CN102137259A

Abstract

The invention provides a method and device for coding two paths of three-dimensional videos, wherein the method comprises the following steps of: splicing two paths of three-dimensional videos to obtain a binary syllabification sequence; in the term of the two paths of three-dimensional videos, carrying out rate distortion optimization coding on the binary syllabification sequence; and reconfiguring the coded binary syllabification sequence to obtain a coding result of each path of video. Through changing a distortion measurement selected by a macro block mode, the rate distortion performance of coding the two paths of three-dimensional videos is improved, and an optimal image coding effect is obtained.

Description

The coding method of two paths of three-dimensional videos and device

Technical field

The present invention relates to the three-dimensional video-frequency processing technology field, particularly a kind of coding method of two paths of three-dimensional videos and device.

Background technology

In recent years, along with popularizing that 3 D stereo is used, the research of stereoscopic image and video begins to become focus, and the application relevant with image and video also constantly widened, such as digital television broadcasting, video request program, long-distance education and medical treatment, wireless multimedia communication etc.Because the data volume of original image and video is very large, and bandwidth can not unconfinedly increase, and finishes the image transmitting of big data quantity in order to utilize limited bandwidth, needs to use the effective video compress technique.

A kind of double vision point splicing of three-dimensional video-frequency has been proposed at present.Compare two-path video and transmit respectively, the encoding scheme of three-dimensional video-frequency splicing only need be transmitted half or data still less, therefore can reduce transmission bandwidth and reduce decoder complexity.In addition, spliced image can utilize traditional single channel encoder directly to compress, and is compatible strong.

The shortcoming of the coding method of existing Two bors d's oeuveres three-dimensional video-frequency is that the framework that adopts traditional single view to encode is processed, and does not consider the characteristics of the Two bors d's oeuveres attribute of signal source, therefore can't obtain optimum encoding efficiency.

Summary of the invention

Purpose of the present invention is intended to solve at least one of above-mentioned technological deficiency.

For achieving the above object, one aspect of the present invention proposes a kind of coding method of two paths of three-dimensional videos, may further comprise the steps: A: two paths of three-dimensional videos is spliced to obtain the Two bors d's oeuveres sequence; B: take described two paths of three-dimensional videos as reference, described Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding; And C: the coding result that the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain each road video.

In one embodiment of the invention, described steps A further comprises: respectively described two-way video is carried out every row ground or the underground sampling of interlacing; And the two-path video behind the described down-sampling spliced to obtain the Two bors d's oeuveres sequence.

In one embodiment of the invention, described step B further comprises: B1: the macro block in the described Two bors d's oeuveres sequence is carried out up-sampling; B2: obtain the correspondence image zone of macro block in described two paths of three-dimensional videos behind the described up-sampling; B3: the coding distortion between the macro block behind the calculating up-sampling and the correspondence image zone of described two paths of three-dimensional videos; And B4: according to described coding distortion, determine the coding mode of described macro block.

In one embodiment of the invention, described macro block is being carried out in the process of up-sampling, keeping the width of described macro block or highly constant.

In one embodiment of the invention, according to described coding distortion and formula min{J (λ)=D (x)+λ R (x) }, determine the coding mode of described macro block, wherein, D (x) is described coding distortion, and R (x) is code check, and λ is Lagrange factor.

The present invention also proposes a kind of code device of two paths of three-dimensional videos on the other hand, comprising: concatenation module is used for two paths of three-dimensional videos is spliced to obtain the Two bors d's oeuveres sequence; Coding module is used for take described two paths of three-dimensional videos as reference, and described Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding; And reconstructed module, for the coding result that the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain each road video.

In one embodiment of the invention, described concatenation module further comprises: downsampling unit is used for respectively described two-way video being carried out every row ground or the underground sampling of interlacing; And concatenation unit, be used for the two-path video behind the described down-sampling is spliced to obtain the Two bors d's oeuveres sequence.

In one embodiment of the invention, described coding module further comprises: the up-sampling unit is used for the macro block of described Two bors d's oeuveres sequence is carried out up-sampling; Search unit is used for obtaining macro block behind the described up-sampling in the correspondence image zone of described two paths of three-dimensional videos; And computing unit, be used for to calculate the coding distortion between the correspondence image zone of macro block behind the up-sampling and described two paths of three-dimensional videos, and determine the coding mode of described macro block according to described coding distortion.

The present invention has improved the distortion performance of two paths of three-dimensional videos coding by changing the distortion metrics of Macroblock Mode Selection, obtains more excellent Image Coding effect.

The aspect that the present invention adds and advantage in the following description part provide, and part will become obviously from the following description, or recognize by practice of the present invention.

Description of drawings

Above-mentioned and/or the additional aspect of the present invention and advantage are from obviously and easily understanding becoming the description of embodiment below in conjunction with accompanying drawing, wherein:

Fig. 1 is the flow chart of coding method of the two paths of three-dimensional videos of the embodiment of the invention;

Fig. 2 is the flow chart of the rate-distortion optimization algorithm of the embodiment of the invention;

Fig. 3 is the schematic diagram of code device of the two paths of three-dimensional videos of the embodiment of the invention;

Fig. 4 is the structural representation of the concatenation module of one embodiment of the invention; And

Fig. 5 is the structural representation of the coding module of one embodiment of the invention.

Embodiment

The below describes embodiments of the invention in detail, and the example of described embodiment is shown in the drawings, and wherein identical or similar label represents identical or similar element or the element with identical or similar functions from start to finish.Be exemplary below by the embodiment that is described with reference to the drawings, only be used for explaining the present invention, and can not be interpreted as limitation of the present invention.

Be illustrated in figure 1 as the flow chart of coding method of the two paths of three-dimensional videos of the embodiment of the invention, the method may further comprise the steps:

Step S101 splices to obtain the Two bors d's oeuveres sequence to two paths of three-dimensional videos.

Particularly, at first two paths of three-dimensional videos is carried out respectively interlacing ground or every the underground sampling of row, for example, carries out respectively the down-sampling of odd column or even number line sampling; Then the two-path video sequence behind the down-sampling is spliced, form the Two bors d's oeuveres sequence.

Step S102 take two paths of three-dimensional videos as reference, carries out the rate-distortion optimization coding to the Two bors d's oeuveres sequence.

Under traditional hybrid encoding frame, if will under the condition of minimum distortion, keep code check R to be no more than maximal rate R _Max, then need to select best coding parameter to reach best picture quality.

Rate-distortion optimization algorithm in the embodiment of the invention considers that the up-sampling reconstruct (introducing after a while) of the down-sampling of pretreatment stage (being step S101) and post-processed on the impact of video coding performance, improves traditional encryption algorithm in step S103.

Be illustrated in figure 2 as the flow chart of the rate-distortion optimization algorithm of the embodiment of the invention, may further comprise the steps particularly:

Step S201 carries out up-sampling to the macro block in the Two bors d's oeuveres sequence.

Particularly, in the up-sampling process, keep the height of macro block constant, double width or keep width constant, highly double, for example the macro block up-sampling with 8*8 is 16*8 or 8*16.

Step S202 obtains the correspondence image zone of macro block in two paths of three-dimensional videos behind the up-sampling.

The coordinate (x, y) of reference macroblock top left corner pixel is if the maintenance width is constant in the up-sampling process, highly become twice, then its corresponding original graph the position of image (x ₀, y ₀) be:

Wherein, w is the width of video sequence;

If in the up-sampling process, keep highly constant, width change twice, then its corresponding original graph the position of image (x ₀, y ₀) be:

Wherein, h is the height of video sequence.

Step S203, the coding distortion D (x) between the macro block behind the calculating up-sampling and the correspondence image zone of two paths of three-dimensional videos.

In this step, calculate behind the up-sampling macro block and the coding distortion between the correspondence image zone in original left view or the right view, concrete computational process is identical with the computational methods of prior art, herein for simplicity, repeats no more.

Step S204 according to coding distortion D (x), determines the coding mode of macro block.

In coding distortion D (x) substitution rate distortion function J (λ)=D (x)+λ R (x), carry out lagrangian optimization, thereby determine the pattern of macro block, wherein, λ is Lagrange factor.

This mode of sampling, the problem of model selection just is converted in the span of λ, and the Lagrangian optimization of sampling finds and satisfies minimum (R, the D) point of J (λ).

Then, according to the coding mode of determining by above-mentioned rate-distortion optimization algorithm the Two bors d's oeuveres sequence is encoded.

Step S103 is reconstructed to obtain the coding result of each road video to the Two bors d's oeuveres sequence of having encoded.

Sequence behind the coding is split into two-way down-sampling reproducing sequence, then, by with step S102 in identical top sampling method, be original resolution with the down-sampling reproducing sequence interpolation of two-way, then produce final left and right sides view coding.

For realizing above-described embodiment, the present invention also proposes a kind of code device of two paths of three-dimensional videos.Be illustrated in figure 3 as the structural representation of code device of the two paths of three-dimensional videos of the embodiment of the invention, this code device comprises: concatenation module 100, coding module 200 and reconstructed module 300.

Wherein, concatenation module 100 is used for two paths of three-dimensional videos is spliced to obtain the Two bors d's oeuveres sequence.Coding module 200 is used for take two paths of three-dimensional videos as reference, and the Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding.Reconstructed module 300 is for the coding result that the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain each road video.

Particularly, as shown in Figure 4, concatenation module 100 can comprise downsampling unit 110 and concatenation unit 120.Downsampling unit 110 is used for respectively every road of two-way video being carried out every row ground or the underground sampling of interlacing.Concatenation unit 120 is used for the two-path video behind the down-sampling is spliced to obtain the Two bors d's oeuveres sequence.

As shown in Figure 5, coding module 200 can comprise up-sampling unit 210, search unit 220 and computing unit 230.Up-sampling unit 210 is used for the macro block of Two bors d's oeuveres sequence is carried out up-sampling.Search unit 220 is used for obtaining macro block behind the up-sampling in the correspondence image zone of two paths of three-dimensional videos.Computing unit 230 is used for calculating the coding distortion between the correspondence image zone of macro block behind the up-sampling and two paths of three-dimensional videos, and determines the coding mode of macro block according to described coding distortion.

The specific works process of each module and unit can be identical with the description in the said method, herein for simplicity, repeats no more.

Although illustrated and described embodiments of the invention, for the ordinary skill in the art, be appreciated that without departing from the principles and spirit of the present invention and can carry out multiple variation, modification, replacement and modification to these embodiment that scope of the present invention is by claims and be equal to and limit.

Claims

1. the coding method of a two paths of three-dimensional videos is characterized in that, may further comprise the steps:

A: two paths of three-dimensional videos is spliced to obtain the Two bors d's oeuveres sequence;

B: take described two paths of three-dimensional videos as reference, described Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding; And

C: the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain the coding result of each road video,

Wherein, described step B further comprises:

B1: the macro block in the described Two bors d's oeuveres sequence is carried out up-sampling;

B2: obtain the correspondence image zone of macro block in described two paths of three-dimensional videos behind the described up-sampling;

B3: the coding distortion between the macro block behind the calculating up-sampling and the correspondence image zone of described two paths of three-dimensional videos; And

B4: according to described coding distortion and following formula, determine the coding mode of described macro block,

min{J(λ)=D(x)+λR(x)}

Wherein, D (x) is described coding distortion, and R (x) is code check, and λ is Lagrange factor.

2. method according to claim 1 is characterized in that, described steps A further comprises:

Respectively described two paths of three-dimensional videos is carried out every row ground or the underground sampling of interlacing; And

Two-way three-dimensional video-frequency behind the described down-sampling is spliced to obtain the Two bors d's oeuveres sequence.

3. method according to claim 1 is characterized in that, wherein, described macro block is being carried out in the process of up-sampling, keeps the width of described macro block or highly constant.

4. the code device of a two paths of three-dimensional videos is characterized in that, comprising:

Concatenation module is used for two paths of three-dimensional videos is spliced to obtain the Two bors d's oeuveres sequence;

Coding module is used for take described two paths of three-dimensional videos as reference, and described Two bors d's oeuveres sequence is carried out the rate-distortion optimization coding; And

Reconstructed module, for the coding result that the Two bors d's oeuveres sequence of having encoded is reconstructed to obtain each road video,

Wherein, described coding module further comprises:

The up-sampling unit is used for the macro block of described Two bors d's oeuveres sequence is carried out up-sampling;

Search unit is used for obtaining macro block behind the described up-sampling in the correspondence image zone of described two paths of three-dimensional videos; With

Computing unit be used for to calculate the coding distortion between the correspondence image zone of macro block behind the up-sampling and described two paths of three-dimensional videos, and determines the coding mode of described macro block according to described coding distortion and following formula,

min{J(λ)=D(x)+λR(x)}

5. device according to claim 4 is characterized in that, described concatenation module further comprises:

Downsampling unit is used for respectively described two paths of three-dimensional videos being carried out every row ground or the underground sampling of interlacing; With

Concatenation unit is used for the two-way three-dimensional video-frequency behind the described down-sampling is spliced to obtain the Two bors d's oeuveres sequence.

6. device according to claim 4 is characterized in that, described up-sampling unit is carrying out in the process of up-sampling the macro block in the described Two bors d's oeuveres sequence, keeps the height of described macro block or width constant.