CN104661023A - Image or video coding method based on predistortion and training filter - Google Patents

Image or video coding method based on predistortion and training filter Download PDF

Info

Publication number
CN104661023A
CN104661023A CN201510058184.1A CN201510058184A CN104661023A CN 104661023 A CN104661023 A CN 104661023A CN 201510058184 A CN201510058184 A CN 201510058184A CN 104661023 A CN104661023 A CN 104661023A
Authority
CN
China
Prior art keywords
image
filter
training
predistortion
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510058184.1A
Other languages
Chinese (zh)
Other versions
CN104661023B (en
Inventor
段绿茵
徐岩
雷志春
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN201510058184.1A priority Critical patent/CN104661023B/en
Publication of CN104661023A publication Critical patent/CN104661023A/en
Application granted granted Critical
Publication of CN104661023B publication Critical patent/CN104661023B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention relates to an image or video compression coding and provides an image or video coding method based on predistortion and filter training, and the method not only can improve the coding efficiency but also avoids a ringing effect so as to guarantee the image quality. The image or video coding method based on predistortion and filter training comprises the following steps: through a low-pass filter, performing predistortion fuzzification treatment on a sequence of each frame of video before coding; at a decoding end, adopting a large amount of images with various image structures to form a training sample library at first; then, performing offline training on a filter set by the classified images; when reconstructing fuzzy images at the decoding end, classifying the image structures by adopting a combined method of ADRC (adaptive dynamic range coding) and the other classification method, finding corresponding optimal filter coefficients in a lookup table according to a classification result, forming optimal filters, performing defuzzification respectively based on image structure types, and finally performing synthesis to obtain defuzzified images. The image or video coding method is mainly applied to image or video compression coding.

Description

Based on predistortion and image or the method for video coding of training filter
Technical field
The present invention relates to image, video compression coding, particularly based on predistortion and image or the method for video coding of training filter.
Background technology
Along with the development of electronic equipment, the Internet and improving constantly of people's demand, digital picture and video data produce just at a terrific speed and propagate, and high definition or ultra high-definition video become development trend.Such as, 3D, HDR (high dynamic range), 4K or 8K video etc., storing or transmitting these video datas all needs efficient compression method to reduce data volume and to ensure the quality of image.
Major video compression standard mainly contain MPEG-2, MPEG-4, H.264/AVC, HEVC etc.These compression and coding standards utilize video sequence spatial coherence and temporal correlation, remove the bulk redundancy information existed in video sequence, only retain a small amount of uncorrelated information and transmit, to reduce code check, to save transmission bandwidth.And receiver utilizes these irrelevant information, according to certain decoding algorithm, original video sequence content can be recovered under the prerequisite ensureing certain picture quality.Estimation in video compression coding and dct transform are the methods of important minimizing signal redundancy.Estimation make use of the local directivity architectural characteristic of image, and DCT make use of the grading structure characteristic that natural image has, i.e. low-pass characteristic, is intended to remove the redundancy in block between pixel.Dct transform by concentration of energy on a small amount of low frequency coefficient, can more easily realize compression in conjunction with quantification and entropy code.These methods all contribute to the minimizing of amount of coded data, and in view of current multi-medium data amount is increasing, under limited storage and bandwidth resources condition, improving video compression coding efficiency further has important realistic meaning.
2007, Lei.Zhichun [1] proposed a kind of video compression coding scheme, and this scheme at coding side predistortion Fuzzy processing video, and transmits ambiguity function in code stream, when decoding end reconstructs by this ambiguity function deconvolution deblurring.The method effectively can reduce video data to be encoded amount and then reduce video stream code rate, but because using deconvolution deblurring to produce ringing effect when reconstructing.
Ringing effect is the inherent shortcoming of deconvolution deblurring method.In image processing field, researcher is devoted to find a kind of new deblurring algorithm to replace deconvolution always.1998, the people such as T.Kondo [2] proposed a kind of least-mean-square filter based on Images Classification, had good effect for image enhaucament.Adaptive dynamic range coding (ADRC) algorithm that picture structure of giving chapter and verse again afterwards carries out classifying is to recover original image [3].2008, the people [4] such as Ling Shao propose the algorithm Recovery image that ADRC combines with another kind of image classification method, and test for the algorithm that ADRC and mean absolute difference (MAD), ADRC and standard deviation (STD), ADRC and dynamic range (DR) combine, effect is better than cascading filter.2014, nightstool Yang Dengren [5] proposes a kind of boundary effect Restrainable algorithms, utilize the convolution pyramid filter group model with symmetry coefficient, one group of filter coefficient is trained in the region of type every in certain particular image, and train the bank of filters obtained to be applied to the continuation region solving other image respective type this, the method computational speed is fast, effectively can suppress the ringing effect of various Frequency Domain Deconvolution algorithm.
In sum, existing research proves that coding side predistortion Fuzzy Processing can reduce code check, training filtered method can realize image deblurring and avoid ringing effect (only for image procossing, not relating to coding) respectively.Not yet find that there is these two kinds of methods combining to get up for image and video coding field at present.
Therefore, the present invention proposes a kind of based on predistortion and the training image of filter or Video Coding Scheme, namely uses predistortion (obfuscation) process to reduce code check, adopt off-line training is good filter to carry out the compression coding scheme of deblurring process Recovery image quality in decoding end at coding side.
List of references
[1] L.Zhichun.Signal coding and decoding with pre-and post-processing. Europe, 06006924.2 [P], 2007-10-03.
[2]T.Kondo and K.Kawaguchi,Adaptive dynamic range encoding method andapparatus.U.S.patent 5444487,Aug.1998.
[3]T.Kondo,Y.Node,T.Fujiwara,and Y.Okumura,Picture conversion apparatus.pictureconversion method,learning apparatus and learning method,U.S.patent 6323905,Nov.2001.
[4]Ling Shao,Hui Zhang,and Gerard de Haan.An Overview and Performance Evaluation ofClassification-Base Least Squares Trained Filters[C].//Transaction on ImageProcessing,2008,17(10):1772-1782.
[5] nightstool is raised, Liu Xuehui, Wu Enhua. the quick Restrainable algorithms [J] of image deconvolution boundary effect. and computer-aided design and graphics journal, 2014,26 (7): 1051-1066.
Summary of the invention
For overcoming the deficiencies in the prior art, providing one to improve code efficiency, avoiding ringing effect again, thus ensure picture quality based on predistortion and the training image of filter or method for video coding.For this reason, the technical scheme that the present invention takes is, based on predistortion and image or the method for video coding of training filter, by low pass filter, first carries out predistortion Fuzzy processing to every frame video sequence before the coding;
In decoding end, first adopt the image composition training sample database in a large number with various picture structure, the difference according to picture structure is classified to zones of different, and the concrete method adopting ADRC to combine with another kind of sorting technique is classified to image;
Then, carry out off-line training with sorted image to bank of filters, adopt the training algorithm of least mean-square error, the filter coefficient of one group of optimum is trained in the region for every type, forms look-up table (LUT), stored in decoding end reconstructed module;
After decoding end reconstructs blurred picture, the method adopting ADRC to combine with another sorting technique is classified to picture structure, corresponding optimal filter coefficients composition optimal filter is found in a lookup table according to classification results, by picture structure classification deblurring respectively, finally synthesis obtains the image after deblurring.
Picture structure specifically comprises smooth region, texture region, fringe region.
Another kind of sorting technique is specially the one in local entropy, mean absolute difference (MAD), standard deviation (STD), dynamic range (DR).
When decoded reconstructed image, if run into the picture structure do not had in training sample database, namely the filter coefficient of this structure corresponding is not had in look-up table, Deconvolution Method is then adopted to replace training filter Recovery image or video, by Deconvolution Method as a kind of alternative scheme outside training filter deblurring algorithm, for recovering original image or video.
Compared with the prior art, technical characterstic of the present invention and effect:
The present invention proposes based on predistortion and image or the Video Coding Scheme of training filter, and the predistortion Fuzzy Processing of coding side can reduce code check, improve code efficiency.The training filter deblurring algorithm of decoding and reconstituting end selects different filters to carry out deblurring recovery original image according to the difference of picture structure, can avoid ringing effect, improve decoded picture or video quality.
Accompanying drawing explanation
Fig. 1 the present invention propose based on predistortion and the training image of filter or Video Coding Scheme.
Embodiment
Multi-medium data amount is increasing, and consumer requires more and more higher to multimedia quality simultaneously.Therefore need to study and can improve code efficiency further and the algorithm ensureing certain coding quality.
Carry out predistortion Fuzzy processing at coding side can reduce data to be encoded amount, improve code efficiency, but challenge is proposed to the deblurring function of decoding end reconstructed module.If transmission ambiguity function can increase code stream expense, and traditional deconvolution image deblurring algorithm can cause ringing effect.Therefore, the predistortion-deblurring scheme being applicable to Video coding is developed significant.
The present invention proposes a kind of based on predistortion and image or the Video Coding Scheme of training filter.This scheme carries out predistortion (obfuscation) process at coding side to image or video, and without the need to transfer encoding end predistortion (obfuscation) function in code stream, select the good different filter of off-line training to carry out deblurring in decoding end according to different images structure.Can either code efficiency be improved, avoid ringing effect again, ensure that picture quality.
In existing encoding scheme, usually, will carry out low-pass filtering before the coding to remove high-frequency noise, the mode of this low-pass filtering does not have a significant effect to picture quality.For reducing data to be encoded amount further, the present invention first carries out predistortion Fuzzy processing to every frame video sequence before the coding, it can be made to be realized by the low pass filter that size is 5 × 5, and then carry out compressed encoding.The fuzzy high fdrequency component of image that makes significantly reduces, and compression efficiency is higher, but Fuzzy processing makes image fault, needs to adopt deblurring algorithm Recovery image quality when decoding end reconstructs.
The algorithm of training filter deblurring is used for decoding end reconstructed module by the present invention.First adopt and there is various picture structure (smooth region in a large number, texture region, fringe region etc.) image composition training sample database, difference according to picture structure is classified to zones of different, usually, the different structure of method to image of adaptive dynamic range coding (ADRC) can be adopted to classify, but the method does not have the difference of differentiate between images high-contrast and low contrast structure and does not consider the problem that object detail Partial shrinkage noise is identical, the method that ADRC therefore can be adopted to combine with another kind of sorting technique is classified to image.Such as, the method etc. that combines of ADRC and local entropy, ADRC and mean absolute difference (MAD), ADRC and standard deviation (STD), ADRC and dynamic range (DR).
Then, carry out off-line training with sorted image to bank of filters, can adopt the training algorithm of least mean-square error, the filter coefficient of one group of optimum is trained in the region for every type, form look-up table (LUT), stored in decoding end reconstructed module.
After decoding end reconstructs blurred picture, the method adopting ADRC to combine with another sorting technique (as local entropy) is classified to picture structure, corresponding optimal filter coefficients composition optimal filter is found in a lookup table according to classification results, by picture structure classification deblurring respectively, finally synthesis obtains the image after deblurring.
In addition, when decoded reconstructed image, if run into the picture structure (namely not having the filter coefficient of this structure corresponding in look-up table) do not had in training sample database, Deconvolution Method can be adopted to replace training filter Recovery image or video, namely Deconvolution Method can as a kind of alternative scheme outside training filter deblurring algorithm, for recovering original image or video.
The present invention proposes a kind of based on predistortion and image or the Video Coding Scheme of training filter.Wherein, in filter training, the method adopting adaptive dynamic range coding (ADRC) and local entropy (or ADRC and mean absolute difference (MAD), ADRC and standard deviation (STD), ADRC and dynamic range (DR)) to combine is classified for the different structure of image, adopt the training algorithm of least mean-square error afterwards, one group of filter coefficient is trained in region for every type, form look-up table (LUT), stored in decoding end reconstructed module.In the implementation process of whole scheme, first, at coding side, pre-distortion is carried out to reduce code check to input picture or video sequence; Then according to existing encoding scheme, encoding and decoding are carried out to the image after predistortion or video; Finally, when decoded reconstructed image, using the filter that off-line training completes, according to searching best filter factor in the look-up table that do not coexist (LUT) of picture structure, completing the reconstruction of image or video.

Claims (4)

1., based on predistortion and image or a method for video coding of training filter, it is characterized in that, comprise the steps:
At coding side: by low pass filter, first predistortion Fuzzy processing is carried out to every frame video sequence before the coding;
In decoding end, first adopt the image composition training sample database in a large number with various picture structure, the difference according to picture structure is classified to zones of different, and the concrete method adopting ADRC to combine with another kind of sorting technique is classified to image;
Then, carry out off-line training with sorted image to bank of filters, adopt the training algorithm of least mean-square error, the filter coefficient of one group of optimum is trained in the region for every type, forms look-up table (LUT), stored in decoding end reconstructed module;
After decoding end reconstructs blurred picture, the method adopting ADRC to combine with another sorting technique is classified to picture structure, corresponding optimal filter coefficients composition optimal filter is found in a lookup table according to classification results, by picture structure classification deblurring respectively, finally synthesis obtains the image after deblurring.
2. according to claim 1 based on predistortion and image or the method for video coding of training filter, it is characterized in that, picture structure specifically comprises smooth region, texture region, fringe region.
3. according to claim 1 based on predistortion and image or the method for video coding of training filter, it is characterized in that, another kind of sorting technique is specially the one in local entropy, mean absolute difference (MAD), standard deviation (STD), dynamic range (DR).
4. according to claim 1 based on predistortion and image or the method for video coding of training filter, it is characterized in that, when decoded reconstructed image, if run into the picture structure do not had in training sample database, namely the filter coefficient of this structure corresponding is not had in look-up table, Deconvolution Method is then adopted to replace training filter Recovery image or video, by Deconvolution Method as a kind of alternative scheme outside training filter deblurring algorithm, for recovering original image or video.
CN201510058184.1A 2015-02-04 2015-02-04 Image or method for video coding based on predistortion and training wave filter Expired - Fee Related CN104661023B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510058184.1A CN104661023B (en) 2015-02-04 2015-02-04 Image or method for video coding based on predistortion and training wave filter

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510058184.1A CN104661023B (en) 2015-02-04 2015-02-04 Image or method for video coding based on predistortion and training wave filter

Publications (2)

Publication Number Publication Date
CN104661023A true CN104661023A (en) 2015-05-27
CN104661023B CN104661023B (en) 2018-03-09

Family

ID=53251651

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510058184.1A Expired - Fee Related CN104661023B (en) 2015-02-04 2015-02-04 Image or method for video coding based on predistortion and training wave filter

Country Status (1)

Country Link
CN (1) CN104661023B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110149554A (en) * 2019-05-31 2019-08-20 Oppo广东移动通信有限公司 Method, apparatus, electronic equipment and the storage medium of video image processing
CN110598584A (en) * 2019-08-26 2019-12-20 天津大学 Convolutional neural network face recognition algorithm based on wavelet transform and DCT
CN112805990A (en) * 2018-11-15 2021-05-14 深圳市欢太科技有限公司 Video processing method and device, electronic equipment and computer readable storage medium
CN113723472A (en) * 2021-08-09 2021-11-30 北京大学 Image classification method based on dynamic filtering equal-variation convolution network model

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0601878A1 (en) * 1992-12-10 1994-06-15 Sony Corporation Adaptive dynamic range encoding method and apparatus
US6323905B1 (en) * 1997-12-25 2001-11-27 Sony Corporation Picture conversion apparatus picture conversion method learning apparatus and learning method
CN101325655A (en) * 2007-06-15 2008-12-17 索尼株式会社 Image signal processing apparatus and method, image display and output apparatus
US20090097542A1 (en) * 2006-03-31 2009-04-16 Sony Deutschland Gmbh Signal coding and decoding with pre- and post-processing
US20090161947A1 (en) * 2007-12-21 2009-06-25 Sony Corporation Image processing device and method, learning device and method, program, and recording medium
CN102413330A (en) * 2007-06-12 2012-04-11 浙江大学 Texture-adaptive video coding/decoding system
CN103475876A (en) * 2013-08-27 2013-12-25 北京工业大学 Learning-based low-bit-rate compression image super-resolution reconstruction method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0601878A1 (en) * 1992-12-10 1994-06-15 Sony Corporation Adaptive dynamic range encoding method and apparatus
US6323905B1 (en) * 1997-12-25 2001-11-27 Sony Corporation Picture conversion apparatus picture conversion method learning apparatus and learning method
US20090097542A1 (en) * 2006-03-31 2009-04-16 Sony Deutschland Gmbh Signal coding and decoding with pre- and post-processing
CN102413330A (en) * 2007-06-12 2012-04-11 浙江大学 Texture-adaptive video coding/decoding system
CN101325655A (en) * 2007-06-15 2008-12-17 索尼株式会社 Image signal processing apparatus and method, image display and output apparatus
US20090161947A1 (en) * 2007-12-21 2009-06-25 Sony Corporation Image processing device and method, learning device and method, program, and recording medium
CN103475876A (en) * 2013-08-27 2013-12-25 北京工业大学 Learning-based low-bit-rate compression image super-resolution reconstruction method

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112805990A (en) * 2018-11-15 2021-05-14 深圳市欢太科技有限公司 Video processing method and device, electronic equipment and computer readable storage medium
CN110149554A (en) * 2019-05-31 2019-08-20 Oppo广东移动通信有限公司 Method, apparatus, electronic equipment and the storage medium of video image processing
CN110149554B (en) * 2019-05-31 2021-06-15 Oppo广东移动通信有限公司 Video image processing method and device, electronic equipment and storage medium
CN110598584A (en) * 2019-08-26 2019-12-20 天津大学 Convolutional neural network face recognition algorithm based on wavelet transform and DCT
CN113723472A (en) * 2021-08-09 2021-11-30 北京大学 Image classification method based on dynamic filtering equal-variation convolution network model
CN113723472B (en) * 2021-08-09 2023-11-24 北京大学 Image classification method based on dynamic filtering constant-variation convolutional network model

Also Published As

Publication number Publication date
CN104661023B (en) 2018-03-09

Similar Documents

Publication Publication Date Title
CN107197260A (en) Video coding post-filter method based on convolutional neural networks
CN104661023B (en) Image or method for video coding based on predistortion and training wave filter
CN113055674B (en) Compressed video quality enhancement method based on two-stage multi-frame cooperation
US7697611B2 (en) Method for processing motion information
CN105898300A (en) Improved transformation coefficient sign data hiding method based on recovered transformation coefficient
CN101668196A (en) Low code rate image compression method based on down sampling and interpolation
CN113592746B (en) Method for enhancing quality of compressed video by fusing space-time information from coarse to fine
Wang et al. Compressed screen content image super resolution
CN115442613A (en) Interframe information-based noise removal method using GAN
Liu et al. Convolutional neural network-based residue super-resolution for video coding
CN108933942B (en) Filtering method of compressed video and filtering device for compressed video
CN105141967A (en) Fast adaptive loop filter algorithm based on just noticeable distortion (JND) model
Qi et al. CNN-based post-processing filter for video compression with multi-scale feature representation
JP6174966B2 (en) Image coding apparatus, image coding method, and program
Zhang et al. WCDANN: A Lightweight CNN Post-Processing Filter for VVC-Based Video Compression
Ding et al. CARNet: Compression Artifact Reduction for Point Cloud Attribute
Man et al. Deep learning-assisted video compression framework
Ning et al. Video Reversible Data Hiding Based on Motion Vector
CN113507607B (en) Compressed video multi-frame quality enhancement method without motion compensation
CN103841421A (en) USDC data processing method
Darazi et al. Adaptive lifting scheme-based method for joint coding 3D-stereo images with luminance correction and optimized prediction
Tang et al. Chaotic moving video quality enhancement based on deep in-loop filtering
CN114554213B (en) Motion adaptive and detail-focused compressed video quality enhancement method
Nayan et al. Baseline JPEG-like DWT CODEC for disparity compensated residual coding of stereo images
CN115496683A (en) Compressed video quality enhancement method based on holoknowledge network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180309

Termination date: 20210204