CN109327703A

CN109327703A - A kind of depth map decoding method based on different sampling blocks

Info

Publication number: CN109327703A
Application number: CN201811349423.9A
Authority: CN
Inventors: 高凯; 朴燕; 孙荣春; 王宇; 臧景峰
Original assignee: Changchun University of Science and Technology
Current assignee: Changchun University of Science and Technology
Priority date: 2018-11-13
Filing date: 2018-11-13
Publication date: 2019-02-12

Abstract

The present invention relates to a kind of depth map decoding methods based on different sampling blocks, belong to the depth map encoding technical field in 3 D stereo video coding standard.End is coded and decoded including multiresolution macro block mode in edge detection, macroblock partition, Intra prediction mode selection, frame, the code stream come will be decoded and carry out block reconstruct, reconstructed blocks are carried out by up-sampling treatment according to coding rule.It only includes this feature of object profile information that advantage, which is for depth map, it is proposed a kind of depth map encoding method based on different sampling blocks, by carrying out adaptive geometric division to the depth macro block comprising discontinuous fluid field, obtain preferable prediction result, to improve the efficiency of depth map encoding, while reducing the complexity of depth map encoding.

Description

A kind of depth map decoding method based on different sampling blocks

Technical field

The invention belongs to the depth map encoding technical fields in 3 D stereo video coding standard, are related to a kind of raising depth The decoding method of graph code efficiency.

Background technique

3 D stereo video refers to that user is shown by 3 D stereo video and sets as the following main video applications It is standby to enjoy true 3 D stereo video content.The relevant technologies of 3 D video, for example, the acquisition of 3 D stereo video, The technologies such as the display of 3 D stereo video coding, 3 D stereo video have received extensive attention.Wherein, depth information is by coloured silk The depth value composition of location of pixels in chromatic graph picture, therefore depth sequence compaction is usually regarded as being greyscale video to be pressed Contracting.In 3D video system and the application of free viewpoint video system, the transmission of data be by a series of color texture video and What geological information was constituted, and the geological information of video is reflected by depth map, in 3D video system, depth map be by Applied to drafting virtual view, therefore the problem of depth sequential coding, research is to be different from traditional 2D color video encoding 's.In three-dimensional video-frequency, video information corresponding to the viewpoint of viewing it is expected in order to obtain user by virtual viewpoint rendering, I Need to obtain depth map corresponding to each video frame.In order to effectively store and transmit the stereo video data of magnanimity, Three-dimensional video-frequency system generallys use multi-view point video encoding method and compresses respectively to multi-view point video and depth map sequence.It is logical It crosses in viewpoint and interview prediction encodes, multi-view point video encoding method can effectively compress superfluous between multichannel depth map It is remaining.

Why important depth information is, because the efficiency of depth information compressed encoding not only directly influences whole system Compression efficiency, and the quality of decoded depth map also directly influences the subjective and objective matter of whole system terminal video picture Amount.

Traditional multi-view point video encoding method is based on a kind of multiple view video coding on H.264/MVC coding standard Expansion scheme.In traditional multiple view video coding, encoder encodes each frame image using macro block as unit.It is each A 16 × 16 small macro blocks can be further divided into 16 × 8,8 × 16 and 8 × 8 sub-blocks and 8 × 4,4 × 8 and 4 × 4 Sub- block.Different sub-blocks and sub- block are referred to as predicting unit.In an encoding process, encoder transports each predicting unit Dynamic estimation, obtains the rate distortion costs of motion compensated prediction.Then the model selection based on rate-distortion optimization is carried out, is obtained each The optimal coding mode of block and prediction residual, and transition coding is carried out to residual error.

It is different from traditional color video, depth map only includes the range information of each pixel, does not include any video Texture information.Therefore, the coded macroblocks in depth map in interior of articles only includes uniform depth texture, is in object edge Coded macroblocks then include two or more discontinuous depth areas.Traditional macro block division can not effectively indicate object edge Edge.Especially under conditions of low encoder bit rate, by dividing obtained sub-block and sub- block mode is less is selected.However, adopting When with traditional multi-view point video encoding method to depth map encoding, there is still a need for transport all macro block modes with encoder Dynamic estimation and mode decision.The process needs to consume a large amount of computing resource, increases the complexity of coding side.

But the component characteristic of depth sequence is different from color texture video, and depth map has structure single, the hierarchy of objectivies Between distinctness and target different levels the features such as clear-cut margin, smooth region and apparent object edge are interlaced out It is existing, the surface of target object is defined according to the distance of camera distance scene objects, and edge then represents the boundary of target, it can Obviously to distinguish target and background.In fact, the presence of these discontinuity zones makes traditional Video Coding Scheme no longer so Efficiently, because traditional Video Coding Scheme is usually for handling low pass vision signal, if using traditional Video coding Scheme encodes depth sequence, then can in depth sequence depth map image locus of discontinuity and smooth area generate it is apparent Blocky artifact effect will lead to the quality decline of decoded depth sequence, not additionally, due to depth sequence after decoding in this way It is to be directly used in display, but be used to carry out the drafting of virtual view, therefore the depth map of quality decline will lead to and synthesize The position of viewpoint is distorted, and influences the subjective and objective effect of virtual view, therefore the academia that is collapsed into of depth sequence is studied One of main problem.

Summary of the invention

The present invention provides a kind of depth map decoding method based on different sampling blocks, in order to obtain higher depth The more efficient coding method of the characteristics of spending figure compression efficiency, needing for depth map design.

The technical solution adopted by the present invention is that, including the following steps:

1) macroblock partition: to currently H.264/MVC coding standard, reading in depth map to be encoded, uses Sobel operator first It carries out edge detection and obtains edge image；Secondly 16 × 16 segmentations are carried out to segmentation block, if texture-free edge in 16 × 16 pieces, Down-sampled is 4 × 4 pieces, such as 16 × 16 pieces of textured edges, is divided further into 48 × 8 pieces；Texture-free side in such as 8 × 8 pieces Edge, down-sampled is 4 × 4 pieces, such as textured edge, is divided further into 44 × 4 pieces；

2) Intra prediction mode selection:

(1) current 4 × 4 pieces are calculated and rebuilds the squared difference and (SSD, Sum of Suqared between 4 × 4 pieces Difference) and coding bit rate；

(2) if 4 × 4 pieces are texture-free piece, using DC prediction mode；

(3) if 4 × 4 pieces is, containing texture block, H.264/MVC middle formula (1) calculate 9 kinds of intra prediction modes using original Rate distortion value is calculated；

J(s,c,m|QP,λ_m)=SSD (s, c, m | QP)+λ_mR(s,c,m|QP) (1)

In formula, QP is the quantization parameter of macro block, λ_mIt is Lagrange's multiplier, related with QP, SSD () is original brightness block s Squared difference between reconstructed block c and, R () is the bit number encoded using mode m；

(4) select the mode with the smallest rate distortion value as best 4 × 4 intra prediction mode；

3) intra-frame macro block pattern-coding:

16 × 16 directly it is down-sampled be that 4 × 4 marks are denoted as 01；

8 × 8 directly it is down-sampled be that 4 × 4 marks are denoted as 10；

8 × 8, which are divided into 44 × 4 marks, is denoted as 11；

The gray value of current block pixel is subtracted each other with predicted value, obtains the difference of present encoding block, difference is carried out Discrete cosine transform, quantization and entropy coding；

4) decoding end will decode the code stream come and carry out block reconstruct, and carries out up-sampling treatment.

The invention has the advantages that only including this feature of object profile information for depth map, propose a kind of based on different samplings The depth map encoding method of block is obtained by carrying out adaptive geometric division to the depth macro block comprising discontinuous fluid field Preferable prediction result to improve the efficiency of depth map encoding, while reducing the complexity of depth map encoding.

Detailed description of the invention

Fig. 1 is the flow chart that the present invention is used in depth map encoding.

Specific embodiment

Include the following steps:

1) edge detection: to currently H.264/MVC coding standard, reading in depth map to be encoded, uses Sobel operator first It carries out edge detection and obtains edge image；

2) macroblock partition: be split depth image for edge whether is contained in edge image segmentation block, first 16 × 16 segmentations are first carried out, if texture-free edge in 16 × 16 edge image blocks, down-sampled to 16 × 16 depth image blocks is 4 16 × 16 depth image blocks are divided further into 48 × 8 pieces by × 4 pieces, such as textured edge；As in 8 × 8 edge image blocks Texture-free edge, by 8 × 8 depth image blocks it is down-sampled be 4 × 4 pieces, such as textured edge, further 8 × 8 depth image block point It is segmented into 44 × 4 pieces；

3) Intra prediction mode selection:

(2) if 4 × 4 pieces are texture-free piece, using DC prediction mode；

(3) the rate distortion value of 9 kinds of intra prediction modes is calculated according to formula (1) respectively

J(s,c,m|QP,λ_m)=SSD (s, c, m | QP)+λ_mR(s,c,m|QP) (1)

4) multiresolution macro block mode encodes in frame:

16 × 16 directly it is down-sampled be that 4 × 4 marks are denoted as 01；

8 × 8 directly it is down-sampled be that 4 × 4 marks are denoted as 10；

8 × 8, which are divided into 44 × 4 marks, is denoted as 11；

5) decoding end will decode the code stream come and carry out block reconstruct, carried out reconstructed blocks at up-sampling according to coding rule Reason.

Claims

1. a kind of depth map decoding method based on different sampling blocks, characterized in that it comprises the following steps:

1) edge detection；

2) macroblock partition；

3) Intra prediction mode selection；

4) multiresolution macro block mode encodes in frame；

5) decoding end will decode the code stream come and carry out block reconstruct, reconstructed blocks are carried out up-sampling treatment according to coding rule.

2. a kind of depth map decoding method based on different sampling blocks according to claim 1, it is characterised in that: step 1) in, to currently H.264/MVC coding standard, depth map to be encoded is read in, edge detection is carried out using Sobel operator first and obtains To edge image.

3. a kind of depth map decoding method based on different sampling blocks according to claim 2, it is characterised in that: step 2) in, be split depth image for edge whether is contained in edge image segmentation block, carry out 16 × 16 points first It cuts, if texture-free edge in 16 × 16 edge image blocks, down-sampled to 16 × 16 depth image blocks is 4 × 4 pieces, such as textured 16 × 16 depth image blocks are divided further into 48 × 8 pieces by edge；Texture-free edge in such as 8 × 8 edge image blocks, by 8 Down-sampled × 8 depth image blocks are 4 × 4 pieces, and such as textured edge, further 8 × 8 depth image block are divided into 44 × 4 pieces.

4. a kind of depth map decoding method based on different sampling blocks according to claim 2, it is characterised in that: step 3) in, selection method is as follows:

(2) if 4 × 4 pieces are texture-free piece, using DC prediction mode；

J(s,c,m|QP,λ_m)=SSD (s, c, m | QP)+λ_mR(s,c,m|QP) (1)

In formula, QP is the quantization parameter of macro block, λ_mLagrange's multiplier, it is related with QP, SSD () be original brightness block s with again Build squared difference between block c and, R () is the bit number encoded using mode m；

(4) select the mode with the smallest rate distortion value as best 4 × 4 intra prediction mode.

5. a kind of depth map decoding method based on different sampling blocks according to claim 2, it is characterised in that: step 4) in, coding method is as follows:

16 × 16 directly it is down-sampled be that 4 × 4 marks are denoted as 01；

8 × 8 directly it is down-sampled be that 4 × 4 marks are denoted as 10；

8 × 8, which are divided into 44 × 4 marks, is denoted as 11；

The gray value of current block pixel is subtracted each other with predicted value, obtains the difference of present encoding block, difference is carried out discrete Cosine transform, quantization and entropy coding.