CN109327703A - A kind of depth map decoding method based on different sampling blocks - Google Patents

A kind of depth map decoding method based on different sampling blocks Download PDF

Info

Publication number
CN109327703A
CN109327703A CN201811349423.9A CN201811349423A CN109327703A CN 109327703 A CN109327703 A CN 109327703A CN 201811349423 A CN201811349423 A CN 201811349423A CN 109327703 A CN109327703 A CN 109327703A
Authority
CN
China
Prior art keywords
depth map
edge
block
pieces
depth
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811349423.9A
Other languages
Chinese (zh)
Inventor
高凯
朴燕
孙荣春
王宇
臧景峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Changchun University of Science and Technology
Original Assignee
Changchun University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Changchun University of Science and Technology filed Critical Changchun University of Science and Technology
Priority to CN201811349423.9A priority Critical patent/CN109327703A/en
Publication of CN109327703A publication Critical patent/CN109327703A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/11Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/147Data rate or code amount at the encoder output according to rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/597Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding

Abstract

The present invention relates to a kind of depth map decoding methods based on different sampling blocks, belong to the depth map encoding technical field in 3 D stereo video coding standard.End is coded and decoded including multiresolution macro block mode in edge detection, macroblock partition, Intra prediction mode selection, frame, the code stream come will be decoded and carry out block reconstruct, reconstructed blocks are carried out by up-sampling treatment according to coding rule.It only includes this feature of object profile information that advantage, which is for depth map, it is proposed a kind of depth map encoding method based on different sampling blocks, by carrying out adaptive geometric division to the depth macro block comprising discontinuous fluid field, obtain preferable prediction result, to improve the efficiency of depth map encoding, while reducing the complexity of depth map encoding.

Description

A kind of depth map decoding method based on different sampling blocks
Technical field
The invention belongs to the depth map encoding technical fields in 3 D stereo video coding standard, are related to a kind of raising depth The decoding method of graph code efficiency.
Background technique
3 D stereo video refers to that user is shown by 3 D stereo video and sets as the following main video applications It is standby to enjoy true 3 D stereo video content.The relevant technologies of 3 D video, for example, the acquisition of 3 D stereo video, The technologies such as the display of 3 D stereo video coding, 3 D stereo video have received extensive attention.Wherein, depth information is by coloured silk The depth value composition of location of pixels in chromatic graph picture, therefore depth sequence compaction is usually regarded as being greyscale video to be pressed Contracting.In 3D video system and the application of free viewpoint video system, the transmission of data be by a series of color texture video and What geological information was constituted, and the geological information of video is reflected by depth map, in 3D video system, depth map be by Applied to drafting virtual view, therefore the problem of depth sequential coding, research is to be different from traditional 2D color video encoding 's.In three-dimensional video-frequency, video information corresponding to the viewpoint of viewing it is expected in order to obtain user by virtual viewpoint rendering, I Need to obtain depth map corresponding to each video frame.In order to effectively store and transmit the stereo video data of magnanimity, Three-dimensional video-frequency system generallys use multi-view point video encoding method and compresses respectively to multi-view point video and depth map sequence.It is logical It crosses in viewpoint and interview prediction encodes, multi-view point video encoding method can effectively compress superfluous between multichannel depth map It is remaining.
Why important depth information is, because the efficiency of depth information compressed encoding not only directly influences whole system Compression efficiency, and the quality of decoded depth map also directly influences the subjective and objective matter of whole system terminal video picture Amount.
Traditional multi-view point video encoding method is based on a kind of multiple view video coding on H.264/MVC coding standard Expansion scheme.In traditional multiple view video coding, encoder encodes each frame image using macro block as unit.It is each A 16 × 16 small macro blocks can be further divided into 16 × 8,8 × 16 and 8 × 8 sub-blocks and 8 × 4,4 × 8 and 4 × 4 Sub- block.Different sub-blocks and sub- block are referred to as predicting unit.In an encoding process, encoder transports each predicting unit Dynamic estimation, obtains the rate distortion costs of motion compensated prediction.Then the model selection based on rate-distortion optimization is carried out, is obtained each The optimal coding mode of block and prediction residual, and transition coding is carried out to residual error.
It is different from traditional color video, depth map only includes the range information of each pixel, does not include any video Texture information.Therefore, the coded macroblocks in depth map in interior of articles only includes uniform depth texture, is in object edge Coded macroblocks then include two or more discontinuous depth areas.Traditional macro block division can not effectively indicate object edge Edge.Especially under conditions of low encoder bit rate, by dividing obtained sub-block and sub- block mode is less is selected.However, adopting When with traditional multi-view point video encoding method to depth map encoding, there is still a need for transport all macro block modes with encoder Dynamic estimation and mode decision.The process needs to consume a large amount of computing resource, increases the complexity of coding side.
But the component characteristic of depth sequence is different from color texture video, and depth map has structure single, the hierarchy of objectivies Between distinctness and target different levels the features such as clear-cut margin, smooth region and apparent object edge are interlaced out It is existing, the surface of target object is defined according to the distance of camera distance scene objects, and edge then represents the boundary of target, it can Obviously to distinguish target and background.In fact, the presence of these discontinuity zones makes traditional Video Coding Scheme no longer so Efficiently, because traditional Video Coding Scheme is usually for handling low pass vision signal, if using traditional Video coding Scheme encodes depth sequence, then can in depth sequence depth map image locus of discontinuity and smooth area generate it is apparent Blocky artifact effect will lead to the quality decline of decoded depth sequence, not additionally, due to depth sequence after decoding in this way It is to be directly used in display, but be used to carry out the drafting of virtual view, therefore the depth map of quality decline will lead to and synthesize The position of viewpoint is distorted, and influences the subjective and objective effect of virtual view, therefore the academia that is collapsed into of depth sequence is studied One of main problem.
Summary of the invention
The present invention provides a kind of depth map decoding method based on different sampling blocks, in order to obtain higher depth The more efficient coding method of the characteristics of spending figure compression efficiency, needing for depth map design.
The technical solution adopted by the present invention is that, including the following steps:
1) macroblock partition: to currently H.264/MVC coding standard, reading in depth map to be encoded, uses Sobel operator first It carries out edge detection and obtains edge image;Secondly 16 × 16 segmentations are carried out to segmentation block, if texture-free edge in 16 × 16 pieces, Down-sampled is 4 × 4 pieces, such as 16 × 16 pieces of textured edges, is divided further into 48 × 8 pieces;Texture-free side in such as 8 × 8 pieces Edge, down-sampled is 4 × 4 pieces, such as textured edge, is divided further into 44 × 4 pieces;
2) Intra prediction mode selection:
(1) current 4 × 4 pieces are calculated and rebuilds the squared difference and (SSD, Sum of Suqared between 4 × 4 pieces Difference) and coding bit rate;
(2) if 4 × 4 pieces are texture-free piece, using DC prediction mode;
(3) if 4 × 4 pieces is, containing texture block, H.264/MVC middle formula (1) calculate 9 kinds of intra prediction modes using original Rate distortion value is calculated;
J(s,c,m|QP,λm)=SSD (s, c, m | QP)+λmR(s,c,m|QP) (1)
In formula, QP is the quantization parameter of macro block, λmIt is Lagrange's multiplier, related with QP, SSD () is original brightness block s Squared difference between reconstructed block c and, R () is the bit number encoded using mode m;
(4) select the mode with the smallest rate distortion value as best 4 × 4 intra prediction mode;
3) intra-frame macro block pattern-coding:
16 × 16 directly it is down-sampled be that 4 × 4 marks are denoted as 01;
8 × 8 directly it is down-sampled be that 4 × 4 marks are denoted as 10;
8 × 8, which are divided into 44 × 4 marks, is denoted as 11;
The gray value of current block pixel is subtracted each other with predicted value, obtains the difference of present encoding block, difference is carried out Discrete cosine transform, quantization and entropy coding;
4) decoding end will decode the code stream come and carry out block reconstruct, and carries out up-sampling treatment.
The invention has the advantages that only including this feature of object profile information for depth map, propose a kind of based on different samplings The depth map encoding method of block is obtained by carrying out adaptive geometric division to the depth macro block comprising discontinuous fluid field Preferable prediction result to improve the efficiency of depth map encoding, while reducing the complexity of depth map encoding.
Detailed description of the invention
Fig. 1 is the flow chart that the present invention is used in depth map encoding.
Specific embodiment
Include the following steps:
1) edge detection: to currently H.264/MVC coding standard, reading in depth map to be encoded, uses Sobel operator first It carries out edge detection and obtains edge image;
2) macroblock partition: be split depth image for edge whether is contained in edge image segmentation block, first 16 × 16 segmentations are first carried out, if texture-free edge in 16 × 16 edge image blocks, down-sampled to 16 × 16 depth image blocks is 4 16 × 16 depth image blocks are divided further into 48 × 8 pieces by × 4 pieces, such as textured edge;As in 8 × 8 edge image blocks Texture-free edge, by 8 × 8 depth image blocks it is down-sampled be 4 × 4 pieces, such as textured edge, further 8 × 8 depth image block point It is segmented into 44 × 4 pieces;
3) Intra prediction mode selection:
(1) current 4 × 4 pieces are calculated and rebuilds the squared difference and (SSD, Sum of Suqared between 4 × 4 pieces Difference) and coding bit rate;
(2) if 4 × 4 pieces are texture-free piece, using DC prediction mode;
(3) the rate distortion value of 9 kinds of intra prediction modes is calculated according to formula (1) respectively
J(s,c,m|QP,λm)=SSD (s, c, m | QP)+λmR(s,c,m|QP) (1)
In formula, QP is the quantization parameter of macro block, λmIt is Lagrange's multiplier, related with QP, SSD () is original brightness block s Squared difference between reconstructed block c and, R () is the bit number encoded using mode m;
(4) select the mode with the smallest rate distortion value as best 4 × 4 intra prediction mode;
4) multiresolution macro block mode encodes in frame:
16 × 16 directly it is down-sampled be that 4 × 4 marks are denoted as 01;
8 × 8 directly it is down-sampled be that 4 × 4 marks are denoted as 10;
8 × 8, which are divided into 44 × 4 marks, is denoted as 11;
The gray value of current block pixel is subtracted each other with predicted value, obtains the difference of present encoding block, difference is carried out Discrete cosine transform, quantization and entropy coding;
5) decoding end will decode the code stream come and carry out block reconstruct, carried out reconstructed blocks at up-sampling according to coding rule Reason.

Claims (5)

1. a kind of depth map decoding method based on different sampling blocks, characterized in that it comprises the following steps:
1) edge detection;
2) macroblock partition;
3) Intra prediction mode selection;
4) multiresolution macro block mode encodes in frame;
5) decoding end will decode the code stream come and carry out block reconstruct, reconstructed blocks are carried out up-sampling treatment according to coding rule.
2. a kind of depth map decoding method based on different sampling blocks according to claim 1, it is characterised in that: step 1) in, to currently H.264/MVC coding standard, depth map to be encoded is read in, edge detection is carried out using Sobel operator first and obtains To edge image.
3. a kind of depth map decoding method based on different sampling blocks according to claim 2, it is characterised in that: step 2) in, be split depth image for edge whether is contained in edge image segmentation block, carry out 16 × 16 points first It cuts, if texture-free edge in 16 × 16 edge image blocks, down-sampled to 16 × 16 depth image blocks is 4 × 4 pieces, such as textured 16 × 16 depth image blocks are divided further into 48 × 8 pieces by edge;Texture-free edge in such as 8 × 8 edge image blocks, by 8 Down-sampled × 8 depth image blocks are 4 × 4 pieces, and such as textured edge, further 8 × 8 depth image block are divided into 44 × 4 pieces.
4. a kind of depth map decoding method based on different sampling blocks according to claim 2, it is characterised in that: step 3) in, selection method is as follows:
(1) current 4 × 4 pieces are calculated and rebuilds the squared difference and (SSD, Sum of Suqared between 4 × 4 pieces Difference) and coding bit rate;
(2) if 4 × 4 pieces are texture-free piece, using DC prediction mode;
(3) the rate distortion value of 9 kinds of intra prediction modes is calculated according to formula (1) respectively
J(s,c,m|QP,λm)=SSD (s, c, m | QP)+λmR(s,c,m|QP) (1)
In formula, QP is the quantization parameter of macro block, λmLagrange's multiplier, it is related with QP, SSD () be original brightness block s with again Build squared difference between block c and, R () is the bit number encoded using mode m;
(4) select the mode with the smallest rate distortion value as best 4 × 4 intra prediction mode.
5. a kind of depth map decoding method based on different sampling blocks according to claim 2, it is characterised in that: step 4) in, coding method is as follows:
16 × 16 directly it is down-sampled be that 4 × 4 marks are denoted as 01;
8 × 8 directly it is down-sampled be that 4 × 4 marks are denoted as 10;
8 × 8, which are divided into 44 × 4 marks, is denoted as 11;
The gray value of current block pixel is subtracted each other with predicted value, obtains the difference of present encoding block, difference is carried out discrete Cosine transform, quantization and entropy coding.
CN201811349423.9A 2018-11-13 2018-11-13 A kind of depth map decoding method based on different sampling blocks Pending CN109327703A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811349423.9A CN109327703A (en) 2018-11-13 2018-11-13 A kind of depth map decoding method based on different sampling blocks

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811349423.9A CN109327703A (en) 2018-11-13 2018-11-13 A kind of depth map decoding method based on different sampling blocks

Publications (1)

Publication Number Publication Date
CN109327703A true CN109327703A (en) 2019-02-12

Family

ID=65259646

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811349423.9A Pending CN109327703A (en) 2018-11-13 2018-11-13 A kind of depth map decoding method based on different sampling blocks

Country Status (1)

Country Link
CN (1) CN109327703A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110519606A (en) * 2019-08-22 2019-11-29 天津大学 Intelligent coding method in a kind of deep video frame
WO2021037277A1 (en) * 2019-09-01 2021-03-04 Beijing Bytedance Network Technology Co., Ltd. Rules for video coding using geometric partitioning mode
WO2021104433A1 (en) * 2019-11-30 2021-06-03 Beijing Bytedance Network Technology Co., Ltd. Simplified inter prediction with geometric partitioning
CN113613000A (en) * 2021-08-20 2021-11-05 天津大学 Intelligent multi-resolution depth video intra-frame prediction method
US11700378B2 (en) 2019-12-24 2023-07-11 Beijing Bytedance Network Technology Co., Ltd High level syntax for inter prediction with geometric partitioning
US11758143B2 (en) 2019-10-10 2023-09-12 Beijing Bytedance Network Technology Co., Ltd Motion vector handling in geometry partition mode
CN113613000B (en) * 2021-08-20 2024-04-26 天津大学 Intelligent multi-resolution depth video intra-frame prediction method

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110519606A (en) * 2019-08-22 2019-11-29 天津大学 Intelligent coding method in a kind of deep video frame
CN110519606B (en) * 2019-08-22 2021-12-07 天津大学 Depth video intra-frame intelligent coding method
WO2021037277A1 (en) * 2019-09-01 2021-03-04 Beijing Bytedance Network Technology Co., Ltd. Rules for video coding using geometric partitioning mode
US11689735B2 (en) 2019-09-01 2023-06-27 Beijing Bytedance Network Technology Co., Ltd. Alignment of prediction weights in video coding
US11758143B2 (en) 2019-10-10 2023-09-12 Beijing Bytedance Network Technology Co., Ltd Motion vector handling in geometry partition mode
WO2021104433A1 (en) * 2019-11-30 2021-06-03 Beijing Bytedance Network Technology Co., Ltd. Simplified inter prediction with geometric partitioning
US11659180B2 (en) 2019-11-30 2023-05-23 Beijing Bytedance Network Technology Co., Ltd Simplified inter prediction with geometric partitioning
US11700378B2 (en) 2019-12-24 2023-07-11 Beijing Bytedance Network Technology Co., Ltd High level syntax for inter prediction with geometric partitioning
CN113613000A (en) * 2021-08-20 2021-11-05 天津大学 Intelligent multi-resolution depth video intra-frame prediction method
CN113613000B (en) * 2021-08-20 2024-04-26 天津大学 Intelligent multi-resolution depth video intra-frame prediction method

Similar Documents

Publication Publication Date Title
US11115681B2 (en) Hybrid video coding supporting intermediate view synthesis
CN111133476B (en) System, apparatus and method for compression and decompression of a point cloud comprising a plurality of points
CN109327703A (en) A kind of depth map decoding method based on different sampling blocks
JP6312787B2 (en) Moving picture decoding apparatus, moving picture decoding method, moving picture encoding apparatus, moving picture encoding method, moving picture encoded data, and recording medium
Merkle et al. The effects of multiview depth video compression on multiview rendering
CN109525846B (en) Apparatus and method for encoding and decoding
CN109257596B (en) Decoder, encoder and reconstruction, encoding, decoding, transmission and processing method
KR101354387B1 (en) Depth map generation techniques for conversion of 2d video data to 3d video data
KR20050105271A (en) Video encoding
CN103581647A (en) Depth map sequence fractal coding method based on motion vectors of color video
CN107864380B (en) 3D-HEVC fast intra-frame prediction decision method based on DCT
KR20150036261A (en) Apparatus for coding a bit stream representing a three-dimensional video
Kundu Light field compression using homography and 2D warping
Liu et al. Generic segment-wise DC for 3D-HEVC depth intra coding
WO2014183652A1 (en) Method and apparatus for depth video coding using endurable view synthesis distortion
Hamout et al. Fast depth map intra-mode selection for 3D-HEVC intra-coding
Biswas SSIM-based joint-bit allocation for 3D video coding
Zhang et al. Rendering distortion estimation model for 3D high efficiency depth coding
Dou et al. Segment-based view synthesis optimization scheme in 3D-HEVC
Jiang et al. Fast inter-frame prediction in multi-view video coding based on perceptual distortion threshold model
Ahmmed et al. An edge aware motion modeling technique leveraging on the discrete cosine basis oriented motion model and frame super resolution
Zhu et al. View synthesis oriented depth map coding algorithm
Shah et al. Evaluating multi-view plus depth coding solutions for 3D video scenarios
Hanca et al. Segmentation-based intra coding of depth maps using texture information
Oh et al. High priority intra coding method for depth video coding

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190212