CN104219526A

CN104219526A - HEVC rate distortion optimization algorithm based on just-noticeable perception quality judging criterion

Info

Publication number: CN104219526A
Application number: CN201410440120.3A
Authority: CN
Inventors: 周芸; 于洋; 王辉淇; 李敬娜
Original assignee: Beijing University of Posts and Telecommunications; Academy of Broadcasting Science Research Institute
Current assignee: Beijing University of Posts and Telecommunications; Academy of Broadcasting Science Research Institute
Priority date: 2014-09-01
Filing date: 2014-09-01
Publication date: 2014-12-17
Anticipated expiration: 2034-09-01
Also published as: CN104219526B

Abstract

The present invention relates to an HEVC rate-distortion optimization algorithm based on just perceptible perceptual quality judgment criteria, and its technical characteristics are: analyzing the motion mode and static texture feature of each macroblock in each frame, and obtaining the perceptual quality type of the current macroblock , get the salient area of the image; calculate the just detectable distortion threshold based on the visual salient area; calculate the perceptual quality based on the just perceptible distortion model; perform rate-distortion optimization according to the perceptual quality based on the just perceptible distortion model. The present invention has a reasonable design, and adopts the HEVC rate-distortion optimization based on just perceptible perceptual quality judgment criteria, which can overcome the deficiency of mean square error (MSE) as an evaluation criterion for measuring video distortion, so that the final coding effect is more in line with the subjective perceptual quality of human eyes. At the same time, more noise is tolerated without reducing the subjective quality, and unnecessary perceptual redundancy is removed, thereby improving compression efficiency and reducing the code rate of the encoded file.

Description

Based on the HEVC rate-distortion optimization algorithm just can examining perceived quality decision rule

Technical field

The invention belongs to technical field of video coding, especially a kind of HEVC rate-distortion optimization algorithm based on perceived quality decision rule just can be examined.

Background technology

In recent years, along with the fast development of national economy, the progress of technology and people's improving constantly video quality demands, high definition/ultra high-definition video coding technique becomes as the basic core technology of the business such as future home movie theatre, digital broadcast television, Internet video, high-definition movie the focus that industry pays close attention to.For high definition/ultra high-definition video communication, existing video encoding standard compares distance certain in addition at compression ratio with actual application demand.For this reason, International Organization for standardization ISO/IEC (MPEG) and ITU-T starts the planning of generation digital video compression standard---Video coding (High Efficiency Video Coding efficiently, HEVC), target is on H.264/AVC high-grade basis, and compression efficiency is enhanced about more than once.

Efficient Video coding (HEVC) always has two links in loop filtering process: block-eliminating effect filtering and self adaptation sampling point compensate SAO.Wherein, self adaptation sampling point compensation SAO can be divided into banded compensation (Band Offset, BO) and the large class of edge compensation (Edge offset, EO) two further.Edge compensation algorithm (EO) compensates mainly for the profile of object each in image, needs from level, vertical, left-leaning unity slope and selects four class adjacent encoder blocks of Right deviation unity slope a kind ofly to carry out comparing of the value of current pixel point and the value of adjacent two pixels.Banded backoff algorithm (BO) is mainly used in compensating the color of object inside each in image and lines information, its division compensating type is completely based on the amplitude of pixel itself, that is image pixel intensities is divided into 32 grades from 0 to maximum by HEVC, the selection of percent of pass aberration optimizing, wherein the pixel compensation of 4 successives finally will write code stream.

HEVC encoder, according to picture material, adopts the method for rate-distortion optimization, chooses best coding mode in alternative modes numerous with interframe in frame.Although to a certain extent, rate distortion mode adjudging can make cataloged procedure become complicated, just because of the application of rate-distortion optimization technology, encoder can obtain optimum prediction information as far as possible, thus ensure that picture quality, improves the overall performance of encoder.

The rate distortion framework of conventional video coding comprises HEVC and all uses mean square error MSE to calculate as distortion value.Although in most cases MSE can reflect the real quality of image, still have certain situation to be that MSE does not reflect, such as salt-pepper noise can produce huge interference to picture, and the MSE of whole two field picture may not be very large.

Summary of the invention

The object of the invention is to overcome the deficiencies in the prior art, provide a kind of reasonable in design, subjective visual quality is high and can remove more perception redundancies based on the HEVC rate-distortion optimization algorithm just can examining perceived quality decision rule.

The present invention solves existing technical problem and takes following technical scheme to realize:

Based on the HEVC rate-distortion optimization algorithm just can examining perceived quality decision rule, comprise the following steps:

Step 1, before the coding side of efficient video codec carries out mode adjudging, analyze the motor pattern of each macro block in each frame and static textural characteristics, obtain the perceived quality type of current macro, and obtain salient region of image according to different motion states;

Step 2, to calculate view-based access control model salient region according to specific image region just can examine distortion threshold;

Step 3, according to view-based access control model salient region just can examine distortion threshold calculate based on the perceived quality just can examining distortion model;

Step 4, basis carry out rate-distortion optimization based on the perceived quality just can examining distortion model.

And described step 1 perceived quality type adopts following Mathematical Modeling to obtain:

{mp}_{k}^{t} = \{\begin{matrix} Normal Pattern & if & \max {P_{k}^{t}} = p_{0, k}^{t}, s_{k}^{t} &NotEqual; SS \\ Aliased Pattern & if & \max {P_{k}^{t}} = p_{1, k}^{t}, s_{k}^{t} &NotEqual; SS \\ Hysteresis Pattern & if & \max {P_{k}^{t}} &NotEqual; p_{2, k}^{t}, s_{k}^{t} = SS \\ Background & if & \max {P_{k}^{t}} = p_{2, k}^{t}, s_{k}^{t} &NotEqual; SS \end{matrix}

In formula, for the perceived quality type of current macro, Normal Pattern is normal perceived quality type, and Aliased Pattern is distortion-aware quality type, and Hysteresis Pattern is delayed perceived quality type, and Background is static perceived quality type, with be respectively the probability of normal condition, distortion status, inactive state appearance, for with vector form, for status attribute descriptor, SS represents inactive state.

And described salient region of image is the region that the macro block of Normal Pattern type and the macro block of Hysteresis Pattern type combine formation.

And the distortion threshold of just can examining of described step 2 view-based access control model salient region calculates according to the following equation:

In above-mentioned formula, FJND be view-based access control model salient region just can examine distortion threshold, T _basic(k, n, i, j), F _lum, F _contrast, F _temporaland F _foveabasic threshold value, intensity modifier value, contrast correction value, time-domain correction value and marking area correction value respectively.

And described step 3 is obtained by PSNR-HA and PSNR-HMA two kinds of distortion criterion weightings based on the perceived quality just can examining distortion model, and its computational methods are as follows:

(1) for given reference block A and distortion block B, the difference of both calculating with the mean value of reference block A and distortion block B coefficient respectively;

(2) correction matrix C=B+Delt is obtained;

(3) correction factor is calculated

ρ = \frac{Σ (A - \overset{&OverBar;}{A}) (C - \overset{&OverBar;}{C})}{Σ {(C - \overset{&OverBar;}{C})}^{2}};

(4) revised macro block D=C × ρ is calculated;

(5) calculating just can examine the revised distortion value MSE of distortion model _hVS, computational methods are as follows:

Wherein coeff _o(i, j) and coeff _d(i, j) represents the pixel value of reference block and reconstructed block correspondence position respectively, and what jnd (i, j) then represented second correspondence position calculated just can examine distortion threshold;

(6) if M ₁> M ₂then

M_{1} = M_{2} + \{\begin{matrix} (M_{1} - M_{2}) coef 1, ρ < 1 \\ (M_{1} - M_{2}) coef 2, ρ &GreaterEqual; 1 \end{matrix};

Finally obtain

MSE _HVS＝M ₁+Delt ²×coef3

In above formula, coef1, coef2, coef3 represent the modifying factor to perceptual error, get 0.002,0.25 and 0.04 respectively according to experiment experience;

(7) PSNR-HA is obtained according to the above-mentioned perceived quality distortion calculated:

PSNR - HA = 10 \log_{10} (\frac{255^{2}}{M})

For coloured image, M is Y, Cr, Cb luminance component, the average distortion that the perceptual distortion weighting of two color components obtains, as shown in the formula calculating:

M＝(M _Y+M _Cb×coef4+M _Cr×coef4)/(1+2×coef4)

M _y, M _cb, M _cra luminance component Y respectively, the perceptual distortion of two color components Cb, Cr, weight coefficient coef4 is 0.5;

The correction of PSNR-HMA is at calculating MSE _hVStime do corresponding correction with distortion model just can be examined.

And the method that described step 4 carries out rate-distortion optimization is: just will can examine perceived quality based on salient region of image and melt and be combined on R-λ model, and obtain as drag:

\frac{dJ}{dR} = \frac{dQ}{dR} - λ = 0

In above formula, J is cost function, then λ=dQ/dR, for each macro block, and calculation cost function:

j ₁＝q ₁(QP ₁)-λ·r ₁(QP ₁)for?coding?block#1

j ₂＝q ₂(QP ₂)-λ·r ₂(QP ₂)for?coding?block#2

……

j _N＝q _N(QP _N)-λ·r _N(QP _N)for?coding?block#N

\max {J} = \max (Σ_{n = 1}^{N} q_{n} ({QP}_{n}) - λ \cdot Σ_{n = 1}^{N} r_{i} ({QP}_{n}))

In above formula, QP is quantization step, compares by search the coding mode obtaining making max{J} minimum.

Advantage of the present invention and good effect are:

The present invention is reasonable in design, it adopts and carries out HEVC rate-distortion optimization based on just examining perceived quality decision rule, mean square error MSE can be overcome as the deficiency weighing video distortion evaluation criterion, final encoding efficiency is made more to meet the subjective perceptual quality of human eye, meanwhile, more noise can be tolerated under the prerequisite that the video after coding does not reduce at subjective quality, remove unnecessary perception redundancy, thus improve compression efficiency, reduce the code check of the rear file of coding.

Accompanying drawing explanation

Fig. 1 is general frame figure of the present invention;

Fig. 2 is the video interception that embodiment provides;

Fig. 3 is the remarkable figure to obtaining after Fig. 2 process.

Embodiment

Below in conjunction with accompanying drawing, the embodiment of the present invention is further described.

Based on the HEVC rate-distortion optimization algorithm just can examining perceived quality decision rule, as shown in Figure 1, comprise the following steps:

Step 1, before the coding side of efficient video codec carries out mode adjudging, analyze the motor pattern of each macro block in each frame and static textural characteristics, draw the perceived quality type of current macro, and obtain salient region of image according to different motion states.

In this step, for each coding unit CU of each frame, and its each divided block, first set up the status attribute descriptor of complete set (representing the state of the kth macro block in a frame of t), as follows:

s_{k}^{t} = \{\begin{matrix} Normal state (NS) & if & {MV}_{k}^{t} &NotEqual; 0, R_{k}^{t} \leq 1 \\ Aliased state (AS) & if & {MV}_{k}^{t} &NotEqual; 0, R_{k}^{t} > 1 \\ Stationary state & if & {MV}_{k}^{t} = 0 \end{matrix}

Wherein

R_{k}^{t} = \frac{Temporal residual energy}{Texture energy} = \frac{Σ_{q &Element; B_{k}^{t}} {B_{k}^{t} (q) - B_{k}^{t} (q + {MV}_{k}^{t})}^{2}}{Σ_{q &Element; B_{k}^{t}} {B_{k}^{t} (q)}^{2} - \frac{1}{N \times N} {Σ_{q &Element; B_{k}^{t}} (B_{k}^{t} (q))}^{2}}

In formula, NS, AS, SS represent normal condition, distortion status, inactive state respectively, represent motion vector, the ratio representing current block residual energy and texture energy ratio, represent the kth macro block in a frame of t, the size of q to be corresponding quantization parameter N × N be macro block.

With vector form represent status attribute descriptor as follows:

I_{k}^{t} = \{\begin{matrix} {[0,1,1]}^{T} & if & s_{k}^{t} = NS \\ {[1, 0,1]}^{T} & if & s_{k}^{t} = AS & L_{k}^{t} = Σ_{r = t - T}^{t} I_{k}^{t} = {[L_{0, k}^{t}, L_{1, k}^{t}, L_{2, k}^{t}]}^{T} \\ {[1,1,0]}^{T} & if & s_{k}^{t} = SS \end{matrix}

Wherein three values corresponding three kinds of states situation about not occurring respectively, namely 0 represents and does not occur, and 1 represents and occurs.

At a time, represent the weight of the probability that NS, AS and SS occur respectively, three uses vector form represent; with represent the probability that in three, state occurs, be expressed as vector form as follows:

P_{k}^{t} &equiv; [\begin{matrix} p_{0, k}^{t} \\ p_{1, k}^{t} \\ p_{2, k}^{t} \end{matrix}] = \frac{1}{| W_{k}^{t} |} [\begin{matrix} ω_{0, k}^{t} \\ ω_{1, k}^{t} \\ ω_{2, k}^{t} \end{matrix}],

Wherein

W_{k}^{t} = [\begin{matrix} ω_{0, k}^{t} \\ ω_{1, k}^{t} \\ ω_{2, k}^{t} \end{matrix}] = [\begin{matrix} ω_{0, k}^{t} \cdot \exp^{- {ηL}_{k}^{t} (0)} \\ ω_{1, k}^{t} \cdot \exp^{- {ηL}_{k}^{t} (1)} \\ ω_{2, k}^{t} \cdot \exp^{- {ηL}_{k}^{t} (2)} \end{matrix}]

η is a constant being greater than 1, represents iterative rate;

Draw each macro block probability vector according to above-mentioned parameter, judge the perceived quality type belonging to it according to following criterion as follows:

{mp}_{k}^{t} = \{\begin{matrix} Normal Pattern & if & \max {P_{k}^{t}} = p_{0, k}^{t}, s_{k}^{t} &NotEqual; SS \\ Aliased Pattern & if & \max {P_{k}^{t}} = p_{1, k}^{t}, s_{k}^{t} &NotEqual; SS \\ Hysteresis Pattern & if & \max {P_{k}^{t}} &NotEqual; p_{2, k}^{t}, s_{k}^{t} = SS \\ Background & if & \max {P_{k}^{t}} = p_{2, k}^{t}, s_{k}^{t} &NotEqual; SS \end{matrix}

Background in formula represents that picture still is motionless, is background parts;

Think in this method and only belong to Normal Pattern and Hysteresis Pattern two kinds of situations both attracting attentivenesss, the details differentiating picture can be divided again.The region that the macro block belonging to these two kinds of patterns is combined and salient region.Fig. 2 gives a video interception, and Fig. 3 is the remarkable figure obtained by this step.

Step 2, to calculate view-based access control model salient region according to specific image region just can examine distortion threshold.

This method can attract the attentiveness of beholder for moving region in video content, and the visual sensitivity of human eye outwards declines gradually along with vision central fovea, saliency region and tradition combine so just can be examined distortion model, further excavation video-aware redundancy, improves compression efficiency under the prerequisite ensureing viewing quality.

The distortion threshold FJND that just can examine of view-based access control model salient region calculates according to the following equation:

In above-mentioned formula, T _basic(k, n, i, j), F _lum, F _contrast, F _temporaland F _foveabe basic threshold value, intensity modifier value, contrast correction value, time-domain correction value and marking area correction value respectively, wherein former three is for the threshold value of still image and modifying factor, after two modifying factors for video.The calculating of each factor is respectively:

(1) basic threshold value T _basic(k, n, i, j):

In formula, (n, i, j) represents (i, the j) of n-th piece individual position, and s illustrates that space adds up the parameter of effect, gets 0.25 by empirical value; A, b, c, r are that constant equals 1.33,0.11,0.18 and 0.6 respectively.φ _iand φ _jit is the normalization factor of dct transform

φ_{m} = \{\begin{matrix} \sqrt{1 / N}, m = 0 \\ \sqrt{2 / N}, m > 0 \end{matrix},

ω _ijit is frequency

ω_{ij} = \frac{1}{2 N} \sqrt{{(i / θ_{x})}^{2} + {(j / θ_{y})}^{2}},

(2) intensity modifier value F _lumfor:

F_{Lum} = \{\begin{matrix} (60 - \overset{&OverBar;}{I}) / 150 + 1 & \overset{&OverBar;}{I} \leq 60 \\ 1 & 60 < \overset{&OverBar;}{I} < 170 \\ (\overset{&OverBar;}{I} - 170) / 425 + 1 & I &GreaterEqual; 170 \end{matrix}

In formula, represent the mean flow rate of this macro block.

(3) contrast correction value F _contrastfor:

F_{Contrast} = \{\begin{matrix} ψ, for (i^{2} + j^{2}) \leq 16 in Plane and Edgeblock \\ ψ \cdot \min (4, \max (1, {(\frac{C (n, i, j)}{T_{Basic} (n, i, j) \cdot F_{Lum} (n)})}^{0.36})), others \end{matrix}

For the macro block F that plane, edge also have texture tri-kinds dissimilar _contrastaccording to above-mentioned formulae discovery.First, candy operator is used to find out marginal points all in image; Then, edge calculation dot density function ρ _edge=Σ edge/N ²; Then according to the value of two formulae discovery ψ below:

Block_type = \{\begin{matrix} Plane, & ρ_{edge} \leq 0.1 \\ Edge, & 0.1 < ρ_{edge} \leq 0.2 \\ Texture, & ρ_{edge} > 0.2 \end{matrix}, ψ = \{\begin{matrix} 1, & for & Plane and Edge block \\ 2.25, & for & (i^{2} + j^{2}) \leq 16 in Texture block \\ 1.25, & for & (i^{2} + j^{2}) > 16 in Texture block \end{matrix}

So just can obtain the contrast correction factor.

(4) time-domain correction value F _temporalfor:

F in above formula _ttemporal frequency, f _sit is spatial frequency.

(5) marking area correction value F _foveafor:

F_{Fovea} (k, n, i, j, v, e) = W_{f}^{κ (bg (k, i, j))} (v, e)

In above formula, κ (bg (k, i, j)) is background luminance function, and this background luminance function is as follows:

κ (bg (k, i, j)) = 0.5 + \frac{1}{\sqrt{2 π} σ} \exp (- \frac{{(\log_{2} (bg (k, i, j) + 1) - μ)}^{2}}{{2 σ}^{2}}) μ = 7, σ = 0.8

W_{f} (v, e) = 1 + {(1 - \frac{f_{m} (v, e)}{f_{m} (v, 0)})}^{ϵ}, ϵ = 1.0

f _m(v,e)＝min(f _c(e),f _d(v))

F _cdetermined by contrast sensitivity function, be set to maximum 1.0 herein; f _dbe display cut-off frequency, be set to the half of monitor resolution.Parameter e represents the distance of the saliency regional center that this position obtains to Part I.

Step 3, according to view-based access control model salient region just can examine distortion threshold calculate based on the perceived quality just can examining distortion model.

The present invention uses the calculating replacing distortion in original coding framework based on the perceived quality evaluation BMMF that just can examine distortion model, and this perceived quality evaluation BMMF is calculated as follows and obtains:

{BMMF}_{x} (n) = Σ_{n = 1}^{3} w_{xy} \cdot {\hat{Q}}_{xy} (B_{y}^{o}, B_{y}^{d})

Wherein x is the index of judgement matrix, namely belongs to which macro block; Y represents the type belonging to this macro block, and Class1 represents smooth, and type 2 represents edge, and type 3 represents texture, with represent reference macroblock respectively and rebuild macro block; Q _xyrepresent the value of x macro block under y type evaluation criterion;

{BMMF}_{x} (n) = Σ_{n = 1}^{3} w_{xy} \cdot {\hat{Q}}_{xy} (B_{y}^{o}, B_{y}^{d})

Q in above-mentioned formula is based on the perceived quality just can examining distortion model (perceived quality decision rule), and this perceived quality is obtained by PSNR-HA and PSNR-HMA two kinds of distortion criterion weightings; then represent that distortion evaluation of estimate obtains weighted average, w _xyit is weight.For PSNR-HA and PSNR-HMA, this method has done following correction on the existing basis just can examining distortion model, and concrete modification method comprises following process:

(2) correction matrix C=B+Delt is obtained;

(3) correction factor is calculated

ρ = \frac{Σ (A - \overset{&OverBar;}{A}) (C - \overset{&OverBar;}{C})}{Σ {(C - \overset{&OverBar;}{C})}^{2}};

(4) revised macro block D=C × ρ is calculated;

(5) calculating just can examine the revised distortion value MSE of distortion model _hVS, as follows:

Wherein coeff _o(i, j) and coeff _d(i, j) represents the pixel value of reference block and reconstructed block correspondence position respectively.What jnd (i, j) then represented second correspondence position calculated just can examine distortion threshold, and the distortion being greater than this thresholding is that human eye can be examined, and the distortion being less than this thresholding to be human eye imperceptible, at utmost can excavate perception redundancy like this;

(6) if M ₁> M ₂then

M_{1} = M_{2} + \{\begin{matrix} (M_{1} - M_{2}) coef 1, ρ < 1 \\ (M_{1} - M_{2}) coef 2, ρ &GreaterEqual; 1 \end{matrix};

Finally obtain

MSE _HVS＝M ₁+Delt ²×coef3

In above-mentioned two formula, coef1, coef2, coef3 represent the modifying factor to perceptual error, get 0.002,0.25 and 0.04 respectively according to experiment experience;

(7) PSNR-HA can be obtained according to the above-mentioned perceived quality distortion calculated, as follows:

PSNR - HA = 10 \log_{10} (\frac{255^{2}}{M})

M＝(M _Y+M _Cb×coef4+M _Cr×coef4)/(1+2×coef4)

M _y, M _cb, M _crbe a luminance component Y respectively, the perceptual distortion of two color components Cb, Cr, weight coefficient coef4 is 0.5.

Correction and the PSNR-HA of PSNR-HMA are similar, are all at calculating MSE _hVStime do corresponding correction with distortion model just can be examined, do not repeating at this.

Step 4, basis carry out rate-distortion optimization based on the perceived quality just can examining distortion model, find best coding mode.

Traditional rate-distortion optimization model is based on R-D model, and in HEVC, have employed R-λ model, and it is more accurate than R-D model.Just can examine perceived quality based on salient region of image incorporate wherein by what obtain in step 3 on the basis of R-λ model below, more be met the model of subjective quality, as follows:

\frac{dJ}{dR} = \frac{dQ}{dR} - λ = 0

In above formula, J is cost function, then λ=dQ/dR.For each macro block, calculation cost function:

j ₁＝q ₁(QP ₁)-λ·r ₁(QP ₁)for?coding?block#1

j ₂＝q ₂(QP ₂)-λ·r ₂(QP ₂)for?coding?block#2

……

j _N＝q _N(QP _N)-λ·r _N(QP _N)for?coding?block#N

\max {J} = \max (Σ_{n = 1}^{N} q_{n} ({QP}_{n}) - λ \cdot Σ_{n = 1}^{N} r_{i} ({QP}_{n}))

In above formula, QP is quantization step, and the target of optimization compares by searching for seemingly the coding mode obtaining making max{J} minimum.

Do a test according to method of the present invention below, experiment effect of the present invention is described.

Test environment: Visual Studio2010;

Cycle tests: select the video test sequence of three kinds of sizes as follows from HEVC official cycle tests:

832x480：BQMall，Basketball-Drill

1280x720：Johnny，FourPeople

1920x1080：Basketball-Drive，BQterrace

Test result is as follows:

Table one Y-PSNR (dB)

Table two code check (kbps)

Table three subjective testing standard

Table four subjective test results

Experiment conclusion

This experimental result chooses six HEVC standard cycle testss.Can to find out compared with HEVC reference software HM10.0 under the identical quantization parameter of same sequence (QP) value that Y-PSNR (PSNR) will low 1.13dB ~ 4.12dB according to table one, this can tolerate more noise under prerequisite that this technology remains unchanged at Subjective video quality is described.Under can obtaining identical QP value according to table two, compare original HEVC and use the present invention can obtain less bit rate, improve compression ratio.

In subjective test, original HM10.0 compressed video is placed on left side, compressed video of the present invention is put in right side, asks 30 observers to test according to the standard of table three, obtains the result of table four after statistical average.Can find out that compressed video of the present invention is generally better than HEVC primitive technology, especially under the condition of larger QP value qualitatively supervisor; And when little QP value because the distortion after primitive technology compression is very little, so the two subjective difference can be ignored substantially.

Through objective and subjective experiment, test result all shows compared with original HEVC coding techniques, the present invention can overcome mean square error MSE as the deficiency weighing video distortion evaluation criterion, more noise is tolerated under the prerequisite that subjective quality does not reduce, remove unnecessary perception redundancy, thus raising compression efficiency, reduce the code check of the rear file of coding.

It is emphasized that; embodiment of the present invention is illustrative; instead of it is determinate; therefore the present invention includes the embodiment be not limited to described in embodiment; every other execution modes drawn by those skilled in the art's technical scheme according to the present invention, belong to the scope of protection of the invention equally.

Claims

1. A HEVC rate-distortion optimization algorithm based on saliency graph and perceivable quality judgment criterion, characterized in that it comprises the following steps:

Step 1. Before the mode decision is made at the encoding end of the high-efficiency video codec, analyze the motion mode and static texture features of each macroblock in each frame to obtain the perceptual quality type of the current macroblock, and obtain according to different motion states salient areas of the image;

Step 2. Calculating the just detectable distortion threshold based on the visually salient region according to the salient image region;

Step 3, calculating the perceptual quality based on the just detectable distortion model according to the just detectable distortion threshold based on the visual salient region;

Step 4. Perform rate-distortion optimization according to the perceptual quality based on the just-observable distortion model.

2. The HEVC rate-distortion optimization algorithm based on just perceptual quality judgment criterion according to claim 1, characterized in that: said step 1 perceptual quality type adopts the following mathematical model to obtain:

{mp mp}_{k k}^{t t} = = \{\begin{matrix} Normal Pattern Normal Pattern & if if & max max {{{P P}_{k k}^{t t}}} = = {p p}_{00,, k k}^{t t},, {s the s}_{k k}^{t t} &NotEqual; &NotEqual; SS SS \\ Aliased Pattern Aliased Pattern & if if & max max {{{P P}_{k k}^{t t}}} = = {p p}_{11,, k k}^{t t},, {s the s}_{k k}^{t t} &NotEqual; &NotEqual; SS SS \\ Hysteresis Pattern Hysteresis Pattern & if if & max max {{{P P}_{k k}^{t t}}} &NotEqual; &NotEqual; {p p}_{22,, k k}^{t t},, {s the s}_{k k}^{t t} = = SS SS \\ Background background & if if & max max {{{P P}_{k k}^{t t}}} = = {p p}_{22,, k k}^{t t},, {s the s}_{k k}^{t t} &NotEqual; &NotEqual; SS SS \end{matrix}

In the formula, is the perceptual quality type of the current macroblock, Normal Pattern is the normal perceptual quality type, Aliased Pattern is the distortion perceptual quality type, Hysteresis Pattern is the hysteresis perceptual quality type, Background is the static perceptual quality type, and are the probability of normal state, distorted state and static state, respectively, for and in vector form, It is a state attribute descriptor, and SS represents a static state.

3. The HEVC rate-distortion optimization algorithm based on just perceptual quality judgment criterion according to claim 2, characterized in that: the salient area of the image is a macroblock of the Normal Pattern type and a macroblock of the Hysteresis Pattern type combined in area together.

4. The HEVC rate-distortion optimization algorithm based on just perceptible perceptual quality judgment criterion according to claim 1, characterized in that: said step 2 is based on the just perceptible distortion threshold of the visually salient region calculated according to the following formula:

In the above formula, FJND is the just detectable distortion threshold based on the visual salience area, T _Basic (k,n,i,j), F _Lum , F _Contrast , F _Temporal and F _Fovea are the basic threshold, brightness correction value, Contrast correction value, time domain correction value and salient area correction value.

5. The HEVC rate-distortion optimization algorithm based on just perceptual quality judgment criterion according to claim 1, characterized in that: said step 3 is based on the perceptual quality of just perceptible distortion model by PSNR-HA and PSNR-HMA It is obtained by weighting the distortion criteria, and its calculation method is as follows:

(1) For a given reference block A and distortion block B, calculate the difference between the two and are the average values of the coefficients of the reference block A and the distortion block B, respectively;

(2) obtain correction matrix C=B+Delt;

(3) Calculate the correction factor

ρ = \frac{Σ (A - \overset{&OverBar;}{A}) (C - \overset{&OverBar;}{C})}{Σ {(C - \overset{&OverBar;}{C})}^{2}};

(4) Calculating the corrected macroblock D=C×ρ;

(5) Calculate the distortion value MSE _HVS corrected by the just observable distortion model, the calculation method is as follows:

Among them, coeff _o (i, j) and coeff _d (i, j) represent the pixel values of the corresponding positions of the reference block and the reconstruction block, respectively, and jnd (i, j) represents the just detectable distortion threshold of the corresponding position calculated in the second part ;

(6) If M ₁ >M ₂ then

m_{1} = m_{2} + \{\begin{matrix} (m_{1} - m_{2}) coef 1, ρ < 1 \\ (m_{1} - m_{2}) coef 2, ρ &Greater Equal; 1 \end{matrix};

finally got

MSE _HVS = M ₁ +Delt ² ×coef3

In the above formula, coef1, coef2, and coef3 represent correction factors for perceptual errors, which are respectively 0.002, 0.25, and 0.04 according to experimental experience;

(7) According to the perceptual quality distortion obtained from the above calculation, PSNR-HA is obtained:

PSNR PSNR - - HA HA = = 1010 {log log}_{1010} ((\frac{255255^{22}}{M m}))

For a color image, M is a brightness component of Y, Cr, and Cb, and the average distortion obtained by weighting the perceptual distortion of the two color components is calculated as follows:

M＝(M _Y +M _Cb ×coef4+M _Cr ×coef4)/(1+2×coef4)

M _Y , M _Cb , and M _Cr are respectively a luminance component Y and the perceptual distortion of two color components Cb and Cr, and the weighting coefficient coef4 is 0.5;

The correction of PSNR-HMA is to use the just detectable distortion model to make corresponding corrections when calculating MSE _HVS .

6. The HEVC rate-distortion optimization algorithm based on just perceptible perceptual quality judgment criterion according to claim 1, characterized in that: the method for performing rate-distortion optimization in step 4 is: the just perceptible perceptual quality based on the salient region of the image Combined with the R-λ model, the following model is obtained:

\frac{dJ j}{dR d} = = \frac{dQ wxya}{dR d} - - λ λ = = 00

In the above formula, J is the cost function, then λ=dQ/dR, for each macroblock, calculate the cost function:

j ₁ ＝q ₁ (QP ₁ )-λ·r ₁ (QP ₁ ) for coding block#1

j ₂ ＝q ₂ (QP ₂ )-λ·r ₂ (QP ₂ ) for coding block#2

...

j _N ＝q _N (QP _N )-λ r _N (QP _N ) for coding block#N

max max {{J J}} = = max max (({Σ Σ}_{n no = = 11}^{N N} {q q}_{n no} (({QP QP}_{n no})) - - λ λ \cdot &Center Dot; {Σ Σ}_{n no = = 11}^{N N} {r r}_{i i} (({QP QP}_{n no}))))

In the above formula, QP is the quantization step size, and the coding mode that minimizes max{J} is obtained by searching and comparing.