CN104219526A - HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion - Google Patents

HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion Download PDF

Info

Publication number
CN104219526A
CN104219526A CN201410440120.3A CN201410440120A CN104219526A CN 104219526 A CN104219526 A CN 104219526A CN 201410440120 A CN201410440120 A CN 201410440120A CN 104219526 A CN104219526 A CN 104219526A
Authority
CN
China
Prior art keywords
distortion
perceptual quality
perceptual
rate
perceptible
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410440120.3A
Other languages
Chinese (zh)
Other versions
CN104219526B (en
Inventor
周芸
于洋
王辉淇
李敬娜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Posts and Telecommunications
Academy of Broadcasting Science Research Institute
Original Assignee
Beijing University of Posts and Telecommunications
Academy of Broadcasting Science Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications, Academy of Broadcasting Science Research Institute filed Critical Beijing University of Posts and Telecommunications
Priority to CN201410440120.3A priority Critical patent/CN104219526B/en
Publication of CN104219526A publication Critical patent/CN104219526A/en
Application granted granted Critical
Publication of CN104219526B publication Critical patent/CN104219526B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

本发明涉及一种基于恰可察感知质量判决准则的HEVC率失真优化算法,其技术特点是:分析每一帧中每个宏块的运动模式及静态纹理特征,获得当前宏块的感知质量类型,得到图像显著区域;计算基于视觉显著性区域的恰可察失真阈值;计算基于恰可察失真模型的感知质量;根据基于恰可察失真模型的感知质量进行率失真优化。本发明设计合理,其采用基于恰可察感知质量判决准则进行HEVC率失真优化,能够克服均方误差MSE作为衡量视频失真评价标准的不足,使得最终的编码效果更加符合人眼的主观感知质量,同时,在主观质量不降低的前提下容忍更多的噪声,去除不必要的感知冗余,从而提高了压缩效率,降低了编码后文件的码率。

The present invention relates to an HEVC rate-distortion optimization algorithm based on just perceptible perceptual quality judgment criteria, and its technical characteristics are: analyzing the motion mode and static texture feature of each macroblock in each frame, and obtaining the perceptual quality type of the current macroblock , get the salient area of the image; calculate the just detectable distortion threshold based on the visual salient area; calculate the perceptual quality based on the just perceptible distortion model; perform rate-distortion optimization according to the perceptual quality based on the just perceptible distortion model. The present invention has a reasonable design, and adopts the HEVC rate-distortion optimization based on just perceptible perceptual quality judgment criteria, which can overcome the deficiency of mean square error (MSE) as an evaluation criterion for measuring video distortion, so that the final coding effect is more in line with the subjective perceptual quality of human eyes. At the same time, more noise is tolerated without reducing the subjective quality, and unnecessary perceptual redundancy is removed, thereby improving compression efficiency and reducing the code rate of the encoded file.

Description

Based on the HEVC rate-distortion optimization algorithm just can examining perceived quality decision rule
Technical field
The invention belongs to technical field of video coding, especially a kind of HEVC rate-distortion optimization algorithm based on perceived quality decision rule just can be examined.
Background technology
In recent years, along with the fast development of national economy, the progress of technology and people's improving constantly video quality demands, high definition/ultra high-definition video coding technique becomes as the basic core technology of the business such as future home movie theatre, digital broadcast television, Internet video, high-definition movie the focus that industry pays close attention to.For high definition/ultra high-definition video communication, existing video encoding standard compares distance certain in addition at compression ratio with actual application demand.For this reason, International Organization for standardization ISO/IEC (MPEG) and ITU-T starts the planning of generation digital video compression standard---Video coding (High Efficiency Video Coding efficiently, HEVC), target is on H.264/AVC high-grade basis, and compression efficiency is enhanced about more than once.
Efficient Video coding (HEVC) always has two links in loop filtering process: block-eliminating effect filtering and self adaptation sampling point compensate SAO.Wherein, self adaptation sampling point compensation SAO can be divided into banded compensation (Band Offset, BO) and the large class of edge compensation (Edge offset, EO) two further.Edge compensation algorithm (EO) compensates mainly for the profile of object each in image, needs from level, vertical, left-leaning unity slope and selects four class adjacent encoder blocks of Right deviation unity slope a kind ofly to carry out comparing of the value of current pixel point and the value of adjacent two pixels.Banded backoff algorithm (BO) is mainly used in compensating the color of object inside each in image and lines information, its division compensating type is completely based on the amplitude of pixel itself, that is image pixel intensities is divided into 32 grades from 0 to maximum by HEVC, the selection of percent of pass aberration optimizing, wherein the pixel compensation of 4 successives finally will write code stream.
HEVC encoder, according to picture material, adopts the method for rate-distortion optimization, chooses best coding mode in alternative modes numerous with interframe in frame.Although to a certain extent, rate distortion mode adjudging can make cataloged procedure become complicated, just because of the application of rate-distortion optimization technology, encoder can obtain optimum prediction information as far as possible, thus ensure that picture quality, improves the overall performance of encoder.
The rate distortion framework of conventional video coding comprises HEVC and all uses mean square error MSE to calculate as distortion value.Although in most cases MSE can reflect the real quality of image, still have certain situation to be that MSE does not reflect, such as salt-pepper noise can produce huge interference to picture, and the MSE of whole two field picture may not be very large.
Summary of the invention
The object of the invention is to overcome the deficiencies in the prior art, provide a kind of reasonable in design, subjective visual quality is high and can remove more perception redundancies based on the HEVC rate-distortion optimization algorithm just can examining perceived quality decision rule.
The present invention solves existing technical problem and takes following technical scheme to realize:
Based on the HEVC rate-distortion optimization algorithm just can examining perceived quality decision rule, comprise the following steps:
Step 1, before the coding side of efficient video codec carries out mode adjudging, analyze the motor pattern of each macro block in each frame and static textural characteristics, obtain the perceived quality type of current macro, and obtain salient region of image according to different motion states;
Step 2, to calculate view-based access control model salient region according to specific image region just can examine distortion threshold;
Step 3, according to view-based access control model salient region just can examine distortion threshold calculate based on the perceived quality just can examining distortion model;
Step 4, basis carry out rate-distortion optimization based on the perceived quality just can examining distortion model.
And described step 1 perceived quality type adopts following Mathematical Modeling to obtain:
mp k t = Normal Pattern if max { P k t } = p 0 , k t , s k t ≠ SS Aliased Pattern if max { P k t } = p 1 , k t , s k t ≠ SS Hysteresis Pattern if max { P k t } ≠ p 2 , k t , s k t = SS Background if max { P k t } = p 2 , k t , s k t ≠ SS
In formula, for the perceived quality type of current macro, Normal Pattern is normal perceived quality type, and Aliased Pattern is distortion-aware quality type, and Hysteresis Pattern is delayed perceived quality type, and Background is static perceived quality type, with be respectively the probability of normal condition, distortion status, inactive state appearance, for with vector form, for status attribute descriptor, SS represents inactive state.
And described salient region of image is the region that the macro block of Normal Pattern type and the macro block of Hysteresis Pattern type combine formation.
And the distortion threshold of just can examining of described step 2 view-based access control model salient region calculates according to the following equation:
In above-mentioned formula, FJND be view-based access control model salient region just can examine distortion threshold, T basic(k, n, i, j), F lum, F contrast, F temporaland F foveabasic threshold value, intensity modifier value, contrast correction value, time-domain correction value and marking area correction value respectively.
And described step 3 is obtained by PSNR-HA and PSNR-HMA two kinds of distortion criterion weightings based on the perceived quality just can examining distortion model, and its computational methods are as follows:
(1) for given reference block A and distortion block B, the difference of both calculating with the mean value of reference block A and distortion block B coefficient respectively;
(2) correction matrix C=B+Delt is obtained;
(3) correction factor is calculated ρ = Σ ( A - A ‾ ) ( C - C ‾ ) Σ ( C - C ‾ ) 2 ;
(4) revised macro block D=C × ρ is calculated;
(5) calculating just can examine the revised distortion value MSE of distortion model hVS, computational methods are as follows:
Wherein coeff o(i, j) and coeff d(i, j) represents the pixel value of reference block and reconstructed block correspondence position respectively, and what jnd (i, j) then represented second correspondence position calculated just can examine distortion threshold;
(6) if M 1> M 2then
M 1 = M 2 + ( M 1 - M 2 ) coef 1 , &rho; < 1 ( M 1 - M 2 ) coef 2 , &rho; &GreaterEqual; 1 ; Finally obtain
MSE HVS=M 1+Delt 2×coef3
In above formula, coef1, coef2, coef3 represent the modifying factor to perceptual error, get 0.002,0.25 and 0.04 respectively according to experiment experience;
(7) PSNR-HA is obtained according to the above-mentioned perceived quality distortion calculated:
PSNR - HA = 10 log 10 ( 255 2 M )
For coloured image, M is Y, Cr, Cb luminance component, the average distortion that the perceptual distortion weighting of two color components obtains, as shown in the formula calculating:
M=(M Y+M Cb×coef4+M Cr×coef4)/(1+2×coef4)
M y, M cb, M cra luminance component Y respectively, the perceptual distortion of two color components Cb, Cr, weight coefficient coef4 is 0.5;
The correction of PSNR-HMA is at calculating MSE hVStime do corresponding correction with distortion model just can be examined.
And the method that described step 4 carries out rate-distortion optimization is: just will can examine perceived quality based on salient region of image and melt and be combined on R-λ model, and obtain as drag:
dJ dR = dQ dR - &lambda; = 0
In above formula, J is cost function, then λ=dQ/dR, for each macro block, and calculation cost function:
j 1=q 1(QP 1)-λ·r 1(QP 1)for?coding?block#1
j 2=q 2(QP 2)-λ·r 2(QP 2)for?coding?block#2
……
j N=q N(QP N)-λ·r N(QP N)for?coding?block#N
max { J } = max ( &Sigma; n = 1 N q n ( QP n ) - &lambda; &CenterDot; &Sigma; n = 1 N r i ( QP n ) )
In above formula, QP is quantization step, compares by search the coding mode obtaining making max{J} minimum.
Advantage of the present invention and good effect are:
The present invention is reasonable in design, it adopts and carries out HEVC rate-distortion optimization based on just examining perceived quality decision rule, mean square error MSE can be overcome as the deficiency weighing video distortion evaluation criterion, final encoding efficiency is made more to meet the subjective perceptual quality of human eye, meanwhile, more noise can be tolerated under the prerequisite that the video after coding does not reduce at subjective quality, remove unnecessary perception redundancy, thus improve compression efficiency, reduce the code check of the rear file of coding.
Accompanying drawing explanation
Fig. 1 is general frame figure of the present invention;
Fig. 2 is the video interception that embodiment provides;
Fig. 3 is the remarkable figure to obtaining after Fig. 2 process.
Embodiment
Below in conjunction with accompanying drawing, the embodiment of the present invention is further described.
Based on the HEVC rate-distortion optimization algorithm just can examining perceived quality decision rule, as shown in Figure 1, comprise the following steps:
Step 1, before the coding side of efficient video codec carries out mode adjudging, analyze the motor pattern of each macro block in each frame and static textural characteristics, draw the perceived quality type of current macro, and obtain salient region of image according to different motion states.
In this step, for each coding unit CU of each frame, and its each divided block, first set up the status attribute descriptor of complete set (representing the state of the kth macro block in a frame of t), as follows:
s k t = Normal state ( NS ) if MV k t &NotEqual; 0 , R k t &le; 1 Aliased state ( AS ) if MV k t &NotEqual; 0 , R k t > 1 Stationary state if MV k t = 0 Wherein
R k t = Temporal residual energy Texture energy = &Sigma; q &Element; B k t { B k t ( q ) - B k t ( q + MV k t ) } 2 &Sigma; q &Element; B k t { B k t ( q ) } 2 - 1 N &times; N { &Sigma; q &Element; B k t ( B k t ( q ) ) } 2
In formula, NS, AS, SS represent normal condition, distortion status, inactive state respectively, represent motion vector, the ratio representing current block residual energy and texture energy ratio, represent the kth macro block in a frame of t, the size of q to be corresponding quantization parameter N × N be macro block.
With vector form represent status attribute descriptor as follows:
I k t = [ 0,1,1 ] T if s k t = NS [ 1 , 0,1 ] T if s k t = AS L k t = &Sigma; r = t - T t I k t = [ L 0 , k t , L 1 , k t , L 2 , k t ] T [ 1,1,0 ] T if s k t = SS
Wherein three values corresponding three kinds of states situation about not occurring respectively, namely 0 represents and does not occur, and 1 represents and occurs.
At a time, represent the weight of the probability that NS, AS and SS occur respectively, three uses vector form represent; with represent the probability that in three, state occurs, be expressed as vector form as follows:
P k t &equiv; p 0 , k t p 1 , k t p 2 , k t = 1 | W k t | &omega; 0 , k t &omega; 1 , k t &omega; 2 , k t , Wherein W k t = &omega; 0 , k t &omega; 1 , k t &omega; 2 , k t = &omega; 0 , k t &CenterDot; exp - &eta;L k t ( 0 ) &omega; 1 , k t &CenterDot; exp - &eta;L k t ( 1 ) &omega; 2 , k t &CenterDot; exp - &eta;L k t ( 2 )
η is a constant being greater than 1, represents iterative rate;
Draw each macro block probability vector according to above-mentioned parameter, judge the perceived quality type belonging to it according to following criterion as follows:
mp k t = Normal Pattern if max { P k t } = p 0 , k t , s k t &NotEqual; SS Aliased Pattern if max { P k t } = p 1 , k t , s k t &NotEqual; SS Hysteresis Pattern if max { P k t } &NotEqual; p 2 , k t , s k t = SS Background if max { P k t } = p 2 , k t , s k t &NotEqual; SS
Background in formula represents that picture still is motionless, is background parts;
Think in this method and only belong to Normal Pattern and Hysteresis Pattern two kinds of situations both attracting attentivenesss, the details differentiating picture can be divided again.The region that the macro block belonging to these two kinds of patterns is combined and salient region.Fig. 2 gives a video interception, and Fig. 3 is the remarkable figure obtained by this step.
Step 2, to calculate view-based access control model salient region according to specific image region just can examine distortion threshold.
This method can attract the attentiveness of beholder for moving region in video content, and the visual sensitivity of human eye outwards declines gradually along with vision central fovea, saliency region and tradition combine so just can be examined distortion model, further excavation video-aware redundancy, improves compression efficiency under the prerequisite ensureing viewing quality.
The distortion threshold FJND that just can examine of view-based access control model salient region calculates according to the following equation:
In above-mentioned formula, T basic(k, n, i, j), F lum, F contrast, F temporaland F foveabe basic threshold value, intensity modifier value, contrast correction value, time-domain correction value and marking area correction value respectively, wherein former three is for the threshold value of still image and modifying factor, after two modifying factors for video.The calculating of each factor is respectively:
(1) basic threshold value T basic(k, n, i, j):
In formula, (n, i, j) represents (i, the j) of n-th piece individual position, and s illustrates that space adds up the parameter of effect, gets 0.25 by empirical value; A, b, c, r are that constant equals 1.33,0.11,0.18 and 0.6 respectively.φ iand φ jit is the normalization factor of dct transform &phi; m = 1 / N , m = 0 2 / N , m > 0 , ω ijit is frequency &omega; ij = 1 2 N ( i / &theta; x ) 2 + ( j / &theta; y ) 2 ,
(2) intensity modifier value F lumfor:
F Lum = ( 60 - I &OverBar; ) / 150 + 1 I &OverBar; &le; 60 1 60 < I &OverBar; < 170 ( I &OverBar; - 170 ) / 425 + 1 I &GreaterEqual; 170
In formula, represent the mean flow rate of this macro block.
(3) contrast correction value F contrastfor:
F Contrast = &psi; , for ( i 2 + j 2 ) &le; 16 in Plane and Edgeblock &psi; &CenterDot; min ( 4 , max ( 1 , ( C ( n , i , j ) T Basic ( n , i , j ) &CenterDot; F Lum ( n ) ) 0.36 ) ) , others
For the macro block F that plane, edge also have texture tri-kinds dissimilar contrastaccording to above-mentioned formulae discovery.First, candy operator is used to find out marginal points all in image; Then, edge calculation dot density function ρ edge=Σ edge/N 2; Then according to the value of two formulae discovery ψ below:
Block _ type = Plane , &rho; edge &le; 0.1 Edge , 0.1 < &rho; edge &le; 0.2 Texture , &rho; edge > 0.2 , &psi; = 1 , for Plane and Edge block 2.25 , for ( i 2 + j 2 ) &le; 16 in Texture block 1.25 , for ( i 2 + j 2 ) > 16 in Texture block
So just can obtain the contrast correction factor.
(4) time-domain correction value F temporalfor:
F in above formula ttemporal frequency, f sit is spatial frequency.
(5) marking area correction value F foveafor:
F Fovea ( k , n , i , j , v , e ) = W f &kappa; ( bg ( k , i , j ) ) ( v , e )
In above formula, κ (bg (k, i, j)) is background luminance function, and this background luminance function is as follows:
&kappa; ( bg ( k , i , j ) ) = 0.5 + 1 2 &pi; &sigma; exp ( - ( log 2 ( bg ( k , i , j ) + 1 ) - &mu; ) 2 2 &sigma; 2 ) &mu; = 7 , &sigma; = 0.8
W f ( v , e ) = 1 + ( 1 - f m ( v , e ) f m ( v , 0 ) ) &epsiv; , &epsiv; = 1.0
f m(v,e)=min(f c(e),f d(v))
F cdetermined by contrast sensitivity function, be set to maximum 1.0 herein; f dbe display cut-off frequency, be set to the half of monitor resolution.Parameter e represents the distance of the saliency regional center that this position obtains to Part I.
Step 3, according to view-based access control model salient region just can examine distortion threshold calculate based on the perceived quality just can examining distortion model.
The present invention uses the calculating replacing distortion in original coding framework based on the perceived quality evaluation BMMF that just can examine distortion model, and this perceived quality evaluation BMMF is calculated as follows and obtains:
BMMF x ( n ) = &Sigma; n = 1 3 w xy &CenterDot; Q ^ xy ( B y o , B y d )
Wherein x is the index of judgement matrix, namely belongs to which macro block; Y represents the type belonging to this macro block, and Class1 represents smooth, and type 2 represents edge, and type 3 represents texture, with represent reference macroblock respectively and rebuild macro block; Q xyrepresent the value of x macro block under y type evaluation criterion;
BMMF x ( n ) = &Sigma; n = 1 3 w xy &CenterDot; Q ^ xy ( B y o , B y d )
Q in above-mentioned formula is based on the perceived quality just can examining distortion model (perceived quality decision rule), and this perceived quality is obtained by PSNR-HA and PSNR-HMA two kinds of distortion criterion weightings; then represent that distortion evaluation of estimate obtains weighted average, w xyit is weight.For PSNR-HA and PSNR-HMA, this method has done following correction on the existing basis just can examining distortion model, and concrete modification method comprises following process:
(1) for given reference block A and distortion block B, the difference of both calculating with the mean value of reference block A and distortion block B coefficient respectively;
(2) correction matrix C=B+Delt is obtained;
(3) correction factor is calculated &rho; = &Sigma; ( A - A &OverBar; ) ( C - C &OverBar; ) &Sigma; ( C - C &OverBar; ) 2 ;
(4) revised macro block D=C × ρ is calculated;
(5) calculating just can examine the revised distortion value MSE of distortion model hVS, as follows:
Wherein coeff o(i, j) and coeff d(i, j) represents the pixel value of reference block and reconstructed block correspondence position respectively.What jnd (i, j) then represented second correspondence position calculated just can examine distortion threshold, and the distortion being greater than this thresholding is that human eye can be examined, and the distortion being less than this thresholding to be human eye imperceptible, at utmost can excavate perception redundancy like this;
(6) if M 1> M 2then
M 1 = M 2 + ( M 1 - M 2 ) coef 1 , &rho; < 1 ( M 1 - M 2 ) coef 2 , &rho; &GreaterEqual; 1 ; Finally obtain
MSE HVS=M 1+Delt 2×coef3
In above-mentioned two formula, coef1, coef2, coef3 represent the modifying factor to perceptual error, get 0.002,0.25 and 0.04 respectively according to experiment experience;
(7) PSNR-HA can be obtained according to the above-mentioned perceived quality distortion calculated, as follows:
PSNR - HA = 10 log 10 ( 255 2 M )
For coloured image, M is Y, Cr, Cb luminance component, the average distortion that the perceptual distortion weighting of two color components obtains, as shown in the formula calculating:
M=(M Y+M Cb×coef4+M Cr×coef4)/(1+2×coef4)
M y, M cb, M crbe a luminance component Y respectively, the perceptual distortion of two color components Cb, Cr, weight coefficient coef4 is 0.5.
Correction and the PSNR-HA of PSNR-HMA are similar, are all at calculating MSE hVStime do corresponding correction with distortion model just can be examined, do not repeating at this.
Step 4, basis carry out rate-distortion optimization based on the perceived quality just can examining distortion model, find best coding mode.
Traditional rate-distortion optimization model is based on R-D model, and in HEVC, have employed R-λ model, and it is more accurate than R-D model.Just can examine perceived quality based on salient region of image incorporate wherein by what obtain in step 3 on the basis of R-λ model below, more be met the model of subjective quality, as follows:
dJ dR = dQ dR - &lambda; = 0
In above formula, J is cost function, then λ=dQ/dR.For each macro block, calculation cost function:
j 1=q 1(QP 1)-λ·r 1(QP 1)for?coding?block#1
j 2=q 2(QP 2)-λ·r 2(QP 2)for?coding?block#2
……
j N=q N(QP N)-λ·r N(QP N)for?coding?block#N
max { J } = max ( &Sigma; n = 1 N q n ( QP n ) - &lambda; &CenterDot; &Sigma; n = 1 N r i ( QP n ) )
In above formula, QP is quantization step, and the target of optimization compares by searching for seemingly the coding mode obtaining making max{J} minimum.
Do a test according to method of the present invention below, experiment effect of the present invention is described.
Test environment: Visual Studio2010;
Cycle tests: select the video test sequence of three kinds of sizes as follows from HEVC official cycle tests:
832x480:BQMall,Basketball-Drill
1280x720:Johnny,FourPeople
1920x1080:Basketball-Drive,BQterrace
Test result is as follows:
Table one Y-PSNR (dB)
Table two code check (kbps)
Table three subjective testing standard
Table four subjective test results
Experiment conclusion
This experimental result chooses six HEVC standard cycle testss.Can to find out compared with HEVC reference software HM10.0 under the identical quantization parameter of same sequence (QP) value that Y-PSNR (PSNR) will low 1.13dB ~ 4.12dB according to table one, this can tolerate more noise under prerequisite that this technology remains unchanged at Subjective video quality is described.Under can obtaining identical QP value according to table two, compare original HEVC and use the present invention can obtain less bit rate, improve compression ratio.
In subjective test, original HM10.0 compressed video is placed on left side, compressed video of the present invention is put in right side, asks 30 observers to test according to the standard of table three, obtains the result of table four after statistical average.Can find out that compressed video of the present invention is generally better than HEVC primitive technology, especially under the condition of larger QP value qualitatively supervisor; And when little QP value because the distortion after primitive technology compression is very little, so the two subjective difference can be ignored substantially.
Through objective and subjective experiment, test result all shows compared with original HEVC coding techniques, the present invention can overcome mean square error MSE as the deficiency weighing video distortion evaluation criterion, more noise is tolerated under the prerequisite that subjective quality does not reduce, remove unnecessary perception redundancy, thus raising compression efficiency, reduce the code check of the rear file of coding.
It is emphasized that; embodiment of the present invention is illustrative; instead of it is determinate; therefore the present invention includes the embodiment be not limited to described in embodiment; every other execution modes drawn by those skilled in the art's technical scheme according to the present invention, belong to the scope of protection of the invention equally.

Claims (6)

1.一种基于显著图及恰可察感知质量判决准则的HEVC率失真优化算法,其特征在于包括以下步骤:1. A HEVC rate-distortion optimization algorithm based on saliency graph and perceivable quality judgment criterion, characterized in that it comprises the following steps: 步骤1、在高效视频编解码器的编码端进行模式判决前,分析每一帧中每个宏块的运动模式及静态纹理特征,获得当前宏块的感知质量类型,并根据不同的运动状态得到图像显著区域;Step 1. Before the mode decision is made at the encoding end of the high-efficiency video codec, analyze the motion mode and static texture features of each macroblock in each frame to obtain the perceptual quality type of the current macroblock, and obtain according to different motion states salient areas of the image; 步骤2、根据显著图像区域计算基于视觉显著性区域的恰可察失真阈值;Step 2. Calculating the just detectable distortion threshold based on the visually salient region according to the salient image region; 步骤3、根据基于视觉显著性区域的恰可察失真阈值计算基于恰可察失真模型的感知质量;Step 3, calculating the perceptual quality based on the just detectable distortion model according to the just detectable distortion threshold based on the visual salient region; 步骤4、根据基于恰可察失真模型的感知质量进行率失真优化。Step 4. Perform rate-distortion optimization according to the perceptual quality based on the just-observable distortion model. 2.根据权利要求1所述的基于恰可察感知质量判决准则的HEVC率失真优化算法,其特征在于:所述步骤1感知质量类型采用以下数学模型获得:2. The HEVC rate-distortion optimization algorithm based on just perceptual quality judgment criterion according to claim 1, characterized in that: said step 1 perceptual quality type adopts the following mathematical model to obtain: mpmp kk tt == Normal PatternNormal Pattern ifif maxmax {{ PP kk tt }} == pp 00 ,, kk tt ,, sthe s kk tt &NotEqual;&NotEqual; SSSS Aliased PatternAliased Pattern ifif maxmax {{ PP kk tt }} == pp 11 ,, kk tt ,, sthe s kk tt &NotEqual;&NotEqual; SSSS Hysteresis PatternHysteresis Pattern ifif maxmax {{ PP kk tt }} &NotEqual;&NotEqual; pp 22 ,, kk tt ,, sthe s kk tt == SSSS Backgroundbackground ifif maxmax {{ PP kk tt }} == pp 22 ,, kk tt ,, sthe s kk tt &NotEqual;&NotEqual; SSSS 式中,为当前宏块的感知质量类型,Normal Pattern为正常感知质量类型,Aliased Pattern为失真感知质量类型,Hysteresis Pattern为滞后感知质量类型,Background为静止感知质量类型,分别为正常状态、失真状态、静止状态出现的概率,的向量形式,为状态属性描述符,SS表示静止状态。In the formula, is the perceptual quality type of the current macroblock, Normal Pattern is the normal perceptual quality type, Aliased Pattern is the distortion perceptual quality type, Hysteresis Pattern is the hysteresis perceptual quality type, Background is the static perceptual quality type, and are the probability of normal state, distorted state and static state, respectively, for and in vector form, It is a state attribute descriptor, and SS represents a static state. 3.根据权利要求2所述的基于恰可察感知质量判决准则的HEVC率失真优化算法,其特征在于:所述的图像显著区域为Normal Pattern类型的宏块和Hysteresis Pattern类型的宏块组合在一起构成的区域。3. The HEVC rate-distortion optimization algorithm based on just perceptual quality judgment criterion according to claim 2, characterized in that: the salient area of the image is a macroblock of the Normal Pattern type and a macroblock of the Hysteresis Pattern type combined in area together. 4.根据权利要求1所述的基于恰可察感知质量判决准则的HEVC率失真优化算法,其特征在于:所述步骤2基于视觉显著性区域的恰可察失真阈值按照下述公式计算得到:4. The HEVC rate-distortion optimization algorithm based on just perceptible perceptual quality judgment criterion according to claim 1, characterized in that: said step 2 is based on the just perceptible distortion threshold of the visually salient region calculated according to the following formula: 上述公式中,FJND为基于视觉显著性区域的恰可察失真阈值,TBasic(k,n,i,j)、FLum、FContrast、FTemporal和FFovea分别是基本阈值、亮度修正值、对比度修正值、时间域修正值和显著区域修正值。In the above formula, FJND is the just detectable distortion threshold based on the visual salience area, T Basic (k,n,i,j), F Lum , F Contrast , F Temporal and F Fovea are the basic threshold, brightness correction value, Contrast correction value, time domain correction value and salient area correction value. 5.根据权利要求1所述的基于恰可察感知质量判决准则的HEVC率失真优化算法,其特征在于:所述步骤3基于恰可察失真模型的感知质量由PSNR-HA和PSNR-HMA两种失真准则加权而得,其计算方法如下:5. The HEVC rate-distortion optimization algorithm based on just perceptual quality judgment criterion according to claim 1, characterized in that: said step 3 is based on the perceptual quality of just perceptible distortion model by PSNR-HA and PSNR-HMA It is obtained by weighting the distortion criteria, and its calculation method is as follows: (1)对于给定的参考块A和失真块B,计算二者的差异分别是参考块A和失真块B系数的平均值;(1) For a given reference block A and distortion block B, calculate the difference between the two and are the average values of the coefficients of the reference block A and the distortion block B, respectively; (2)得到修正矩阵C=B+Delt;(2) obtain correction matrix C=B+Delt; (3)计算修正系数 &rho; = &Sigma; ( A - A &OverBar; ) ( C - C &OverBar; ) &Sigma; ( C - C &OverBar; ) 2 ; (3) Calculate the correction factor &rho; = &Sigma; ( A - A &OverBar; ) ( C - C &OverBar; ) &Sigma; ( C - C &OverBar; ) 2 ; (4)计算修正后的宏块D=C×ρ;(4) Calculating the corrected macroblock D=C×ρ; (5)计算恰可察失真模型修正后的失真值MSEHVS,计算方法如下:(5) Calculate the distortion value MSE HVS corrected by the just observable distortion model, the calculation method is as follows: 其中coeffo(i,j)和coeffd(i,j)分别代表参考块和重建块对应位置的像素值,jnd(i,j)则表示第二部计算的对应位置的恰可察失真门限;Among them, coeff o (i, j) and coeff d (i, j) represent the pixel values of the corresponding positions of the reference block and the reconstruction block, respectively, and jnd (i, j) represents the just detectable distortion threshold of the corresponding position calculated in the second part ; (6)如果M1>M2(6) If M 1 >M 2 then M 1 = M 2 + ( M 1 - M 2 ) coef 1 , &rho; < 1 ( M 1 - M 2 ) coef 2 , &rho; &GreaterEqual; 1 ; 最终得到 m 1 = m 2 + ( m 1 - m 2 ) coef 1 , &rho; < 1 ( m 1 - m 2 ) coef 2 , &rho; &Greater Equal; 1 ; finally got MSEHVS=M1+Delt2×coef3MSE HVS = M 1 +Delt 2 ×coef3 上式中,coef1、coef2、coef3表示对感知误差的修正因子,按照实验经验分别取0.002、0.25和0.04;In the above formula, coef1, coef2, and coef3 represent correction factors for perceptual errors, which are respectively 0.002, 0.25, and 0.04 according to experimental experience; (7)根据上述计算得到的感知质量失真得到PSNR-HA:(7) According to the perceptual quality distortion obtained from the above calculation, PSNR-HA is obtained: PSNRPSNR -- HAHA == 1010 loglog 1010 (( 255255 22 Mm )) 对于彩色图像,M是Y,Cr,Cb一个亮度分量,两个颜色分量的感知失真加权得到的平均失真,如下式计算得到:For a color image, M is a brightness component of Y, Cr, and Cb, and the average distortion obtained by weighting the perceptual distortion of the two color components is calculated as follows: M=(MY+MCb×coef4+MCr×coef4)/(1+2×coef4)M=(M Y +M Cb ×coef4+M Cr ×coef4)/(1+2×coef4) MY、MCb、MCr分别是一个亮度分量Y,两个颜色分量Cb、Cr的感知失真,加权系数coef4是0.5;M Y , M Cb , and M Cr are respectively a luminance component Y and the perceptual distortion of two color components Cb and Cr, and the weighting coefficient coef4 is 0.5; PSNR-HMA的修正是在计算MSEHVS时用恰可察失真模型做相应修正。The correction of PSNR-HMA is to use the just detectable distortion model to make corresponding corrections when calculating MSE HVS . 6.根据权利要求1所述的基于恰可察感知质量判决准则的HEVC率失真优化算法,其特征在于:所述步骤4进行率失真优化的方法为:将基于图像显著区域恰可察感知质量融结合在R-λ模型上,得到如下模型:6. The HEVC rate-distortion optimization algorithm based on just perceptible perceptual quality judgment criterion according to claim 1, characterized in that: the method for performing rate-distortion optimization in step 4 is: the just perceptible perceptual quality based on the salient region of the image Combined with the R-λ model, the following model is obtained: dJj dRd == dQwxya dRd -- &lambda;&lambda; == 00 上式中J是代价函数,则λ=dQ/dR,针对每一个宏块,计算代价函数:In the above formula, J is the cost function, then λ=dQ/dR, for each macroblock, calculate the cost function: j1=q1(QP1)-λ·r1(QP1)for coding block#1j 1 =q 1 (QP 1 )-λ·r 1 (QP 1 ) for coding block#1 j2=q2(QP2)-λ·r2(QP2)for coding block#2j 2 =q 2 (QP 2 )-λ·r 2 (QP 2 ) for coding block#2 ……... jN=qN(QPN)-λ·rN(QPN)for coding block#Nj N =q N (QP N )-λ r N (QP N ) for coding block#N maxmax {{ JJ }} == maxmax (( &Sigma;&Sigma; nno == 11 NN qq nno (( QPQP nno )) -- &lambda;&lambda; &CenterDot;&Center Dot; &Sigma;&Sigma; nno == 11 NN rr ii (( QPQP nno )) )) 上式中,QP是量化步长,通过搜索比较得到使max{J}最小的编码模式。In the above formula, QP is the quantization step size, and the coding mode that minimizes max{J} is obtained by searching and comparing.
CN201410440120.3A 2014-09-01 2014-09-01 HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion Expired - Fee Related CN104219526B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410440120.3A CN104219526B (en) 2014-09-01 2014-09-01 HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410440120.3A CN104219526B (en) 2014-09-01 2014-09-01 HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion

Publications (2)

Publication Number Publication Date
CN104219526A true CN104219526A (en) 2014-12-17
CN104219526B CN104219526B (en) 2017-05-24

Family

ID=52100604

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410440120.3A Expired - Fee Related CN104219526B (en) 2014-09-01 2014-09-01 HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion

Country Status (1)

Country Link
CN (1) CN104219526B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113810555A (en) * 2021-09-17 2021-12-17 福建省二建建设集团有限公司 A Video Quality Evaluation Method Based on Least Perceivable Difference and Block Effect
CN118101945A (en) * 2024-04-28 2024-05-28 石家庄铁道大学 Perceptual video coding method combining saliency and just-noticeable distortion
CN113810555B (en) * 2021-09-17 2025-02-18 福建省二建建设集团有限公司 A video quality assessment method based on just noticeable difference and blocking effect

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101710995A (en) * 2009-12-10 2010-05-19 武汉大学 Video coding system based on vision characteristic
CN102420988A (en) * 2011-12-02 2012-04-18 上海大学 Multi-view video coding system utilizing visual characteristics
CN103051901A (en) * 2013-01-14 2013-04-17 北京华兴宏视技术发展有限公司 Video data coding device and video data encoding method
CN103124347A (en) * 2012-10-22 2013-05-29 上海大学 Method for guiding multi-view video coding quantization process by visual perception characteristics
CN103220517A (en) * 2012-01-20 2013-07-24 索尼公司 Quantization matrix design for HEVC standard
CN103873861A (en) * 2014-02-24 2014-06-18 西南交通大学 Coding mode selection method for HEVC (high efficiency video coding)
US20140169451A1 (en) * 2012-12-13 2014-06-19 Mitsubishi Electric Research Laboratories, Inc. Perceptually Coding Images and Videos
CN103918271A (en) * 2011-06-01 2014-07-09 王舟 Method and system for perceptual video coding based on structural similarity

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101710995A (en) * 2009-12-10 2010-05-19 武汉大学 Video coding system based on vision characteristic
CN103918271A (en) * 2011-06-01 2014-07-09 王舟 Method and system for perceptual video coding based on structural similarity
CN102420988A (en) * 2011-12-02 2012-04-18 上海大学 Multi-view video coding system utilizing visual characteristics
CN103220517A (en) * 2012-01-20 2013-07-24 索尼公司 Quantization matrix design for HEVC standard
CN103124347A (en) * 2012-10-22 2013-05-29 上海大学 Method for guiding multi-view video coding quantization process by visual perception characteristics
US20140169451A1 (en) * 2012-12-13 2014-06-19 Mitsubishi Electric Research Laboratories, Inc. Perceptually Coding Images and Videos
CN103051901A (en) * 2013-01-14 2013-04-17 北京华兴宏视技术发展有限公司 Video data coding device and video data encoding method
CN103873861A (en) * 2014-02-24 2014-06-18 西南交通大学 Coding mode selection method for HEVC (high efficiency video coding)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113810555A (en) * 2021-09-17 2021-12-17 福建省二建建设集团有限公司 A Video Quality Evaluation Method Based on Least Perceivable Difference and Block Effect
CN113810555B (en) * 2021-09-17 2025-02-18 福建省二建建设集团有限公司 A video quality assessment method based on just noticeable difference and blocking effect
CN118101945A (en) * 2024-04-28 2024-05-28 石家庄铁道大学 Perceptual video coding method combining saliency and just-noticeable distortion

Also Published As

Publication number Publication date
CN104219526B (en) 2017-05-24

Similar Documents

Publication Publication Date Title
CN105306944B (en) Chromatic component Forecasting Methodology in hybrid video coding standard
US8787454B1 (en) Method and apparatus for data compression using content-based features
CN106028026A (en) Effective objective video quality evaluation method based on temporal-spatial structure
CN104219525B (en) Perception method for video coding based on conspicuousness and minimum discernable distortion
CN104378636B (en) A kind of video encoding method and device
CN103607590B (en) Efficient video coding based on structural similarity perceives Rate-distortion optimization method
CN106412572B (en) A kind of video flowing coding quality evaluation method based on kinetic characteristic
CN104469386B (en) A kind of perception method for encoding stereo video of the proper appreciable error model based on DOF
CN103096079B (en) A kind of multi-view video rate control based on proper discernable distortion
CN110199521A (en) Low complex degree hybrid domain for damaging Video coding cooperates with in-loop filter
CN107767363A (en) It is a kind of based on natural scene without refer to high-dynamics image quality evaluation algorithm
CN105635743A (en) Minimum noticeable distortion method and system based on saliency detection and total variation
EP4175297A1 (en) Point cloud quality assessment method, encoder, decoder, and storage medium
Yang et al. Optimized-SSIM based quantization in optical remote sensing image compression
CN104602028A (en) Entire frame loss error concealment method for B frame of stereoscopic video
CN108521572B (en) A Residual Filtering Method Based on Pixel Domain JND Model
CN105844640A (en) Color image quality evaluation method based on gradient
CN104219526A (en) HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion
CN105141967B (en) Based on the quick self-adapted loop circuit filtering method that can just perceive distortion model
Zhang et al. Towards accurate visual information estimation with entropy of primitive
CN112235574B (en) A Cascading Method of Quantization Parameters for High Efficiency Video Coding Standard for Site Inspection Tool Images
CN112019840A (en) Video quality evaluation method combining JND and VMAF
Wang et al. Quality assessment for MPEG-2 video streams using a neural network model
Leszczuk et al. Freely available large-scale video quality assessment database in full-hd resolution with h. 264 coding
CN114549302A (en) An image super-resolution reconstruction method and system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170524

Termination date: 20210901

CF01 Termination of patent right due to non-payment of annual fee