CN104661023A

CN104661023A - Image or video coding method based on predistortion and training filter

Info

Publication number: CN104661023A
Application number: CN201510058184.1A
Authority: CN
Inventors: 段绿茵; 徐岩; 雷志春
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2015-02-04
Filing date: 2015-02-04
Publication date: 2015-05-27
Anticipated expiration: 2035-02-04
Also published as: CN104661023B

Abstract

The invention relates to an image or video compression coding and provides an image or video coding method based on predistortion and filter training, and the method not only can improve the coding efficiency but also avoids a ringing effect so as to guarantee the image quality. The image or video coding method based on predistortion and filter training comprises the following steps: through a low-pass filter, performing predistortion fuzzification treatment on a sequence of each frame of video before coding; at a decoding end, adopting a large amount of images with various image structures to form a training sample library at first; then, performing offline training on a filter set by the classified images; when reconstructing fuzzy images at the decoding end, classifying the image structures by adopting a combined method of ADRC (adaptive dynamic range coding) and the other classification method, finding corresponding optimal filter coefficients in a lookup table according to a classification result, forming optimal filters, performing defuzzification respectively based on image structure types, and finally performing synthesis to obtain defuzzified images. The image or video coding method is mainly applied to image or video compression coding.

Description

Based on predistortion and image or the method for video coding of training filter

Technical field

The present invention relates to image, video compression coding, particularly based on predistortion and image or the method for video coding of training filter.

Background technology

Along with the development of electronic equipment, the Internet and improving constantly of people's demand, digital picture and video data produce just at a terrific speed and propagate, and high definition or ultra high-definition video become development trend.Such as, 3D, HDR (high dynamic range), 4K or 8K video etc., storing or transmitting these video datas all needs efficient compression method to reduce data volume and to ensure the quality of image.

Major video compression standard mainly contain MPEG-2, MPEG-4, H.264/AVC, HEVC etc.These compression and coding standards utilize video sequence spatial coherence and temporal correlation, remove the bulk redundancy information existed in video sequence, only retain a small amount of uncorrelated information and transmit, to reduce code check, to save transmission bandwidth.And receiver utilizes these irrelevant information, according to certain decoding algorithm, original video sequence content can be recovered under the prerequisite ensureing certain picture quality.Estimation in video compression coding and dct transform are the methods of important minimizing signal redundancy.Estimation make use of the local directivity architectural characteristic of image, and DCT make use of the grading structure characteristic that natural image has, i.e. low-pass characteristic, is intended to remove the redundancy in block between pixel.Dct transform by concentration of energy on a small amount of low frequency coefficient, can more easily realize compression in conjunction with quantification and entropy code.These methods all contribute to the minimizing of amount of coded data, and in view of current multi-medium data amount is increasing, under limited storage and bandwidth resources condition, improving video compression coding efficiency further has important realistic meaning.

2007, Lei.Zhichun [1] proposed a kind of video compression coding scheme, and this scheme at coding side predistortion Fuzzy processing video, and transmits ambiguity function in code stream, when decoding end reconstructs by this ambiguity function deconvolution deblurring.The method effectively can reduce video data to be encoded amount and then reduce video stream code rate, but because using deconvolution deblurring to produce ringing effect when reconstructing.

Ringing effect is the inherent shortcoming of deconvolution deblurring method.In image processing field, researcher is devoted to find a kind of new deblurring algorithm to replace deconvolution always.1998, the people such as T.Kondo [2] proposed a kind of least-mean-square filter based on Images Classification, had good effect for image enhaucament.Adaptive dynamic range coding (ADRC) algorithm that picture structure of giving chapter and verse again afterwards carries out classifying is to recover original image [3].2008, the people [4] such as Ling Shao propose the algorithm Recovery image that ADRC combines with another kind of image classification method, and test for the algorithm that ADRC and mean absolute difference (MAD), ADRC and standard deviation (STD), ADRC and dynamic range (DR) combine, effect is better than cascading filter.2014, nightstool Yang Dengren [5] proposes a kind of boundary effect Restrainable algorithms, utilize the convolution pyramid filter group model with symmetry coefficient, one group of filter coefficient is trained in the region of type every in certain particular image, and train the bank of filters obtained to be applied to the continuation region solving other image respective type this, the method computational speed is fast, effectively can suppress the ringing effect of various Frequency Domain Deconvolution algorithm.

In sum, existing research proves that coding side predistortion Fuzzy Processing can reduce code check, training filtered method can realize image deblurring and avoid ringing effect (only for image procossing, not relating to coding) respectively.Not yet find that there is these two kinds of methods combining to get up for image and video coding field at present.

Therefore, the present invention proposes a kind of based on predistortion and the training image of filter or Video Coding Scheme, namely uses predistortion (obfuscation) process to reduce code check, adopt off-line training is good filter to carry out the compression coding scheme of deblurring process Recovery image quality in decoding end at coding side.

List of references

[1] L.Zhichun.Signal coding and decoding with pre-and post-processing. Europe, 06006924.2 [P], 2007-10-03.

[2]T.Kondo and K.Kawaguchi,Adaptive dynamic range encoding method andapparatus.U.S.patent 5444487,Aug.1998.

[3]T.Kondo,Y.Node,T.Fujiwara,and Y.Okumura,Picture conversion apparatus.pictureconversion method,learning apparatus and learning method,U.S.patent 6323905,Nov.2001.

[4]Ling Shao,Hui Zhang,and Gerard de Haan.An Overview and Performance Evaluation ofClassification-Base Least Squares Trained Filters[C].//Transaction on ImageProcessing,2008,17(10):1772-1782.

[5] nightstool is raised, Liu Xuehui, Wu Enhua. the quick Restrainable algorithms [J] of image deconvolution boundary effect. and computer-aided design and graphics journal, 2014,26 (7): 1051-1066.

Summary of the invention

For overcoming the deficiencies in the prior art, providing one to improve code efficiency, avoiding ringing effect again, thus ensure picture quality based on predistortion and the training image of filter or method for video coding.For this reason, the technical scheme that the present invention takes is, based on predistortion and image or the method for video coding of training filter, by low pass filter, first carries out predistortion Fuzzy processing to every frame video sequence before the coding;

In decoding end, first adopt the image composition training sample database in a large number with various picture structure, the difference according to picture structure is classified to zones of different, and the concrete method adopting ADRC to combine with another kind of sorting technique is classified to image;

Then, carry out off-line training with sorted image to bank of filters, adopt the training algorithm of least mean-square error, the filter coefficient of one group of optimum is trained in the region for every type, forms look-up table (LUT), stored in decoding end reconstructed module;

After decoding end reconstructs blurred picture, the method adopting ADRC to combine with another sorting technique is classified to picture structure, corresponding optimal filter coefficients composition optimal filter is found in a lookup table according to classification results, by picture structure classification deblurring respectively, finally synthesis obtains the image after deblurring.

Picture structure specifically comprises smooth region, texture region, fringe region.

Another kind of sorting technique is specially the one in local entropy, mean absolute difference (MAD), standard deviation (STD), dynamic range (DR).

When decoded reconstructed image, if run into the picture structure do not had in training sample database, namely the filter coefficient of this structure corresponding is not had in look-up table, Deconvolution Method is then adopted to replace training filter Recovery image or video, by Deconvolution Method as a kind of alternative scheme outside training filter deblurring algorithm, for recovering original image or video.

Compared with the prior art, technical characterstic of the present invention and effect:

The present invention proposes based on predistortion and image or the Video Coding Scheme of training filter, and the predistortion Fuzzy Processing of coding side can reduce code check, improve code efficiency.The training filter deblurring algorithm of decoding and reconstituting end selects different filters to carry out deblurring recovery original image according to the difference of picture structure, can avoid ringing effect, improve decoded picture or video quality.

Accompanying drawing explanation

Fig. 1 the present invention propose based on predistortion and the training image of filter or Video Coding Scheme.

Embodiment

Multi-medium data amount is increasing, and consumer requires more and more higher to multimedia quality simultaneously.Therefore need to study and can improve code efficiency further and the algorithm ensureing certain coding quality.

Carry out predistortion Fuzzy processing at coding side can reduce data to be encoded amount, improve code efficiency, but challenge is proposed to the deblurring function of decoding end reconstructed module.If transmission ambiguity function can increase code stream expense, and traditional deconvolution image deblurring algorithm can cause ringing effect.Therefore, the predistortion-deblurring scheme being applicable to Video coding is developed significant.

The present invention proposes a kind of based on predistortion and image or the Video Coding Scheme of training filter.This scheme carries out predistortion (obfuscation) process at coding side to image or video, and without the need to transfer encoding end predistortion (obfuscation) function in code stream, select the good different filter of off-line training to carry out deblurring in decoding end according to different images structure.Can either code efficiency be improved, avoid ringing effect again, ensure that picture quality.

In existing encoding scheme, usually, will carry out low-pass filtering before the coding to remove high-frequency noise, the mode of this low-pass filtering does not have a significant effect to picture quality.For reducing data to be encoded amount further, the present invention first carries out predistortion Fuzzy processing to every frame video sequence before the coding, it can be made to be realized by the low pass filter that size is 5 × 5, and then carry out compressed encoding.The fuzzy high fdrequency component of image that makes significantly reduces, and compression efficiency is higher, but Fuzzy processing makes image fault, needs to adopt deblurring algorithm Recovery image quality when decoding end reconstructs.

The algorithm of training filter deblurring is used for decoding end reconstructed module by the present invention.First adopt and there is various picture structure (smooth region in a large number, texture region, fringe region etc.) image composition training sample database, difference according to picture structure is classified to zones of different, usually, the different structure of method to image of adaptive dynamic range coding (ADRC) can be adopted to classify, but the method does not have the difference of differentiate between images high-contrast and low contrast structure and does not consider the problem that object detail Partial shrinkage noise is identical, the method that ADRC therefore can be adopted to combine with another kind of sorting technique is classified to image.Such as, the method etc. that combines of ADRC and local entropy, ADRC and mean absolute difference (MAD), ADRC and standard deviation (STD), ADRC and dynamic range (DR).

Then, carry out off-line training with sorted image to bank of filters, can adopt the training algorithm of least mean-square error, the filter coefficient of one group of optimum is trained in the region for every type, form look-up table (LUT), stored in decoding end reconstructed module.

After decoding end reconstructs blurred picture, the method adopting ADRC to combine with another sorting technique (as local entropy) is classified to picture structure, corresponding optimal filter coefficients composition optimal filter is found in a lookup table according to classification results, by picture structure classification deblurring respectively, finally synthesis obtains the image after deblurring.

In addition, when decoded reconstructed image, if run into the picture structure (namely not having the filter coefficient of this structure corresponding in look-up table) do not had in training sample database, Deconvolution Method can be adopted to replace training filter Recovery image or video, namely Deconvolution Method can as a kind of alternative scheme outside training filter deblurring algorithm, for recovering original image or video.

The present invention proposes a kind of based on predistortion and image or the Video Coding Scheme of training filter.Wherein, in filter training, the method adopting adaptive dynamic range coding (ADRC) and local entropy (or ADRC and mean absolute difference (MAD), ADRC and standard deviation (STD), ADRC and dynamic range (DR)) to combine is classified for the different structure of image, adopt the training algorithm of least mean-square error afterwards, one group of filter coefficient is trained in region for every type, form look-up table (LUT), stored in decoding end reconstructed module.In the implementation process of whole scheme, first, at coding side, pre-distortion is carried out to reduce code check to input picture or video sequence; Then according to existing encoding scheme, encoding and decoding are carried out to the image after predistortion or video; Finally, when decoded reconstructed image, using the filter that off-line training completes, according to searching best filter factor in the look-up table that do not coexist (LUT) of picture structure, completing the reconstruction of image or video.

Claims

1., based on predistortion and image or a method for video coding of training filter, it is characterized in that, comprise the steps:

At coding side: by low pass filter, first predistortion Fuzzy processing is carried out to every frame video sequence before the coding;

2. according to claim 1 based on predistortion and image or the method for video coding of training filter, it is characterized in that, picture structure specifically comprises smooth region, texture region, fringe region.

3. according to claim 1 based on predistortion and image or the method for video coding of training filter, it is characterized in that, another kind of sorting technique is specially the one in local entropy, mean absolute difference (MAD), standard deviation (STD), dynamic range (DR).

4. according to claim 1 based on predistortion and image or the method for video coding of training filter, it is characterized in that, when decoded reconstructed image, if run into the picture structure do not had in training sample database, namely the filter coefficient of this structure corresponding is not had in look-up table, Deconvolution Method is then adopted to replace training filter Recovery image or video, by Deconvolution Method as a kind of alternative scheme outside training filter deblurring algorithm, for recovering original image or video.