CN102903107B

CN102903107B - Three-dimensional picture quality objective evaluation method based on feature fusion

Info

Publication number: CN102903107B
Application number: CN201210357956.8A
Authority: CN
Inventors: 邵枫; 段芬芳; 蒋刚毅; 郁梅; 李福翠
Original assignee: Ningbo University
Current assignee: Jiangsu Qizhen Information Technology Service Co ltd
Priority date: 2012-09-24
Filing date: 2012-09-24
Publication date: 2015-07-08
Anticipated expiration: 2032-09-24
Also published as: CN102903107A

Abstract

The invention discloses an objective evaluation method of stereoscopic image quality based on feature fusion, which first calculates the cyclopean diagram of the original undistorted stereoscopic image and the cyclopean diagram of the distorted stereoscopic image to be evaluated, and calculates the cyclopean diagram of the two stereoscopic images The mean and standard deviation of the pixel values of each pixel in the image are obtained to obtain the objective evaluation value of each pixel in the cyclopean image of the distorted stereo image to be evaluated, and then the saliency map and the two The distortion map between the cyclopean images, and the objective evaluation measurement value of each pixel in the cyclopean image of the distorted stereo image to be evaluated is fused to obtain the image quality objective evaluation prediction value of the distorted stereo image to be evaluated, The advantage is that the obtained cyclopean image can well simulate the binocular stereo fusion process, and the saliency image and the distortion image are used for fusion, which can effectively improve the correlation between the objective evaluation result and the subjective perception.

Description

An Objective Evaluation Method of Stereo Image Quality Based on Feature Fusion

技术领域technical field

本发明涉及一种图像质量评价方法，尤其是涉及一种基于特征融合的立体图像质量客观评价方法。The invention relates to an image quality evaluation method, in particular to an objective evaluation method for stereoscopic image quality based on feature fusion.

背景技术Background technique

随着图像编码技术和立体显示技术的迅速发展，立体图像技术受到了越来越广泛的关注与应用，已成为当前的一个研究热点。立体图像技术利用人眼的双目视差原理，双目各自独立地接收来自同一场景的左右视点图像，通过大脑融合形成双目视差，从而欣赏到具有深度感和逼真感的立体图像。由于采集系统、存储压缩及传输设备的影响，立体图像会不可避免地引入一系列的失真，而与单通道图像相比，立体图像需要同时保证两个通道的图像质量，对其进行质量评价具有非常重要的意义。然而，目前对立体图像质量缺乏有效的客观评价方法进行评价。因此，建立有效的立体图像质量客观评价模型具有十分重要的意义。With the rapid development of image coding technology and stereoscopic display technology, stereoscopic image technology has received more and more attention and applications, and has become a current research hotspot. Stereoscopic image technology utilizes the binocular parallax principle of the human eye. Both eyes independently receive left and right viewpoint images from the same scene, and form binocular parallax through brain fusion, so as to enjoy a stereoscopic image with a sense of depth and realism. Due to the impact of acquisition system, storage compression and transmission equipment, stereoscopic images will inevitably introduce a series of distortions. Compared with single-channel images, stereoscopic images need to ensure the image quality of two channels at the same time. very important meaning. However, there is currently no effective objective evaluation method to evaluate the stereoscopic image quality. Therefore, it is of great significance to establish an effective objective evaluation model for stereoscopic image quality.

目前的立体图像质量客观评价方法是直接将平面图像质量评价方法直接应用于评价立体图像质量，然而，对立体图像的左右视点图像进行融合产生立体感的过程并不是简单的左右视点图像叠加的过程，还难以用简单的数学方法来表示，因此，如何在立体图像质量评价过程中有效地对双目立体融合进行模拟，如何提取有效的特征信息对评价结果进行融合，使得客观评价结果更加感觉符合人类视觉系统，都是在对立体图像进行客观质量评价过程中需要研究解决的问题。The current objective evaluation method of stereoscopic image quality is to directly apply the planar image quality evaluation method to evaluate the stereoscopic image quality. However, the process of fusing the left and right viewpoint images of the stereoscopic image to produce a stereoscopic effect is not a simple process of superimposing the left and right viewpoint images. , it is difficult to use simple mathematical methods to express, therefore, how to effectively simulate binocular stereo fusion in the process of stereo image quality evaluation, how to extract effective feature information to fuse the evaluation results, so that the objective evaluation results are more in line with The human visual system is a problem that needs to be studied and solved in the process of objective quality evaluation of stereoscopic images.

发明内容Contents of the invention

本发明所要解决的技术问题是提供一种能够有效提高客观评价结果与主观感知的相关性的基于特征融合的立体图像质量客观评价方法。The technical problem to be solved by the present invention is to provide an objective evaluation method for stereoscopic image quality based on feature fusion that can effectively improve the correlation between objective evaluation results and subjective perception.

本发明解决上述技术问题所采用的技术方案为：一种基于特征融合的立体图像质量客观评价方法，其特征在于它的处理过程为：首先，根据原始的无失真的立体图像的左视点图像和右视点图像中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，及原始的无失真的立体图像的左视点图像和右视点图像之间的视差图像，获得原始的无失真的立体图像的独眼图；根据待评价的失真的立体图像的左视点图像和右视点图像中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，及原始的无失真的立体图像的左视点图像和右视点图像之间的视差图像，获得待评价的失真的立体图像的独眼图；其次，根据两个独眼图中的每个像素点的像素值的均值和标准差，获得待评价的失真的立体图像的独眼图中的每个像素点的客观评价度量值；再次，根据原始的无失真的立体图像的独眼图的振幅和相位，获得对应的显著图；根据待评价的失真的立体图像的独眼图的振幅和相位，获得对应的显著图；然后，根据两个显著图及两个独眼图之间的失真图，对待评价的失真的立体图像的独眼图中的每个像素点的客观评价度量值进行融合，得到待评价的失真的立体图像的图像质量客观评价预测值；最后，按照上述处理过程获取多幅不同失真类型不同失真程度的失真的立体图像的图像质量客观评价预测值。The technical scheme adopted by the present invention to solve the above-mentioned technical problems is: an objective evaluation method of stereoscopic image quality based on feature fusion, which is characterized in that its processing process is as follows: first, according to the left viewpoint image of the original undistorted stereoscopic image and The even symmetric frequency response and the odd symmetric frequency response of each pixel in the right view point image at different scales and directions, and the disparity image between the left view point image and the right view point image of the original undistorted stereo image, to obtain the original The cyclopean image of the undistorted stereo image; the even and odd symmetric frequency responses of each pixel in different scales and directions according to the left view point image and the right view point image of the distorted stereo image to be evaluated, and the original The parallax image between the left view point image and the right view point image of the undistorted stereo image obtains the cyclopean image of the distorted stereo image to be evaluated; secondly, according to the average sum of the pixel values of each pixel in the two cyclopia images Standard deviation, obtain the objective evaluation metric value of each pixel in the cyclopean image of the distorted stereo image to be evaluated; again, obtain the corresponding saliency map according to the amplitude and phase of the cyclope image of the original undistorted stereo image; According to the amplitude and phase of the cyclopean image of the distorted stereo image to be evaluated, the corresponding saliency map is obtained; then, according to the two saliency maps and the distortion map between the two cyclopean images, the cyclopean image of the distorted stereo image to be evaluated The objective evaluation measurement value of each pixel in the fusion is carried out to obtain the image quality objective evaluation prediction value of the distorted stereoscopic image to be evaluated; finally, according to the above-mentioned processing process, a plurality of distorted stereoscopic images of different distortion types and different degrees of distortion are obtained The predictive value of the image quality objective evaluation.

本发明的一种基于特征融合的立体图像质量客观评价方法的具体步骤为：The concrete steps of a kind of stereoscopic image quality objective evaluation method based on feature fusion of the present invention are:

①令S_org为原始的无失真的立体图像，令S_dis为待评价的失真的立体图像，将S_org的左视点图像记为{L_org(x,y)}，将S_org的右视点图像记为{R_org(x,y)}，将S_dis的左视点图像记为{L_dis(x,y)}，将S_dis的右视点图像记为{R_dis(x,y)}，其中，此处(x,y)表示左视点图像和右视点图像中的像素点的坐标位置，1≤x≤W，1≤y≤H，W表示左视点图像和右视点图像的宽度，H表示左视点图像和右视点图像的高度，L_org(x,y)表示{L_org(x,y)}中坐标位置为(x,y)的像素点的像素值，R_org(x,y)表示{R_org(x,y)}中坐标位置为(x,y)的像素点的像素值，L_dis(x,y)表示{L_dis(x,y)}中坐标位置为(x,y)的像素点的像素值，R_dis(x,y)表示{R_dis(x,y)}中坐标位置为(x,y)的像素点的像素值；①Let S _org be the original undistorted stereo image, let S _dis be the distorted stereo image to be evaluated, record the left viewpoint image of S _org as {L _org (x,y)}, and let the right viewpoint image of S _org The image is recorded as {R _org (x,y)}, the left view image of S _dis is recorded as {L _dis (x,y)}, and the right view image of S _dis is recorded as {R _dis (x,y)} , where (x, y) represents the coordinate position of the pixel in the left viewpoint image and the right viewpoint image, 1≤x≤W, 1≤y≤H, W represents the width of the left viewpoint image and the right viewpoint image, H represents the height of the left view point image and the right view point image, L _org (x, y) represents the pixel value of the pixel whose coordinate position is (x, y) in {L _org (x, y)}, R _org (x, y) y) means the pixel value of the pixel point whose coordinate position is (x, y) in {R _org (x, y)}, and L _dis (x, y) means that the coordinate position in {L _dis (x, y)} is ( The pixel value of the pixel point of x, y), R _dis (x, y) represents the pixel value of the pixel point whose coordinate position is (x, y) in {R _dis (x, y)};

②根据{L_org(x,y)}、{R_org(x,y)}、{L_dis(x,y)}、{R_dis(x,y)}中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，对应获取{L_org(x,y)}、{R_org(x,y)}、{L_dis(x,y)}、{R_dis(x,y)}中的每个像素点的振幅，然后根据{L_org(x,y)}和{R_org(x,y)}中的每个像素点的振幅及{L_org(x,y)}与{R_org(x,y)}之间的视差图像中的每个像素点的像素值，计算S_org的独眼图，记为{CM_org(x,y)}，并根据{L_dis(x,y)}和{R_dis(x,y)}中的每个像素点的振幅及{L_org(x,y)}与{R_org(x,y)}之间的视差图像中的每个像素点的像素值，计算S_dis的独眼图，记为{CM_dis(x,y)}，其中，CM_org(x,y)表示{CM_org(x,y)}中坐标位置为(x,y)的像素点的像素值，CM_dis(x,y)表示{CM_dis(x,y)}中坐标位置为(x,y)的像素点的像素值；②According to each pixel in {L _org (x,y)}, {R _org (x,y)}, {L _dis (x,y)}, {R _dis (x,y)} at different scales The even symmetric frequency response and odd symmetric frequency response in the and direction, correspondingly get {L _org (x,y)}, {R _org (x,y)}, {L _dis (x,y)}, {R _dis (x ,y)}, and then according to the amplitude of each pixel in {L _org (x,y)} and {R _org (x,y)} and {L _org (x,y) )} and {R _org (x,y)} between the pixel value of each pixel in the disparity image, calculate the Cyclops of S _org , denoted as {CM _org (x,y)}, and according to {L The amplitude of each pixel in _dis (x,y)} and {R _dis (x,y)} and the disparity image between {L _org (x,y)} and {R _org (x,y)} The pixel value of each pixel in , calculate the Cyclops of S _dis , recorded as {CM _dis (x, y)}, where CM _org (x, y) represents the coordinates in {CM _org (x, y)} The pixel value of the pixel point whose position is (x, y), CM _dis (x, y) represents the pixel value of the pixel point whose coordinate position is (x, y) in {CM _dis (x, y)};

③根据{CM_org(x,y)}和{CM_dis(x,y)}中的每个像素点的像素值的均值和标准差，计算{CM_dis(x,y)}中的每个像素点的客观评价度量值，将{CM_dis(x,y)}中坐标位置为(x,y)的像素点的客观评价度量值记为Q_image(x,y)；③According to the mean and standard deviation of the pixel values of each pixel in {CM _org (x,y)} and {CM _dis (x,y)}, calculate each of {CM _dis (x,y)} The objective evaluation measurement value of the pixel point, the objective evaluation measurement value of the pixel point whose coordinate position is (x, y) in {CM _dis (x, y)} is recorded as Q _image (x, y);

④根据{CM_org(x,y)}的振幅和相位，计算{CM_org(x,y)}的显著图，记为{SM_org(x,y)}，并根据{CM_dis(x,y)}的振幅和相位，计算{CM_dis(x,y)}的显著图，记为{SM_dis(x,y)}，其中，SM_org(x,y)表示{SM_org(x,y)}中坐标位置为(x,y)的像素点的像素值，SM_dis(x,y)表示{SM_dis(x,y)}中坐标位置为(x,y)的像素点的像素值；④ According to the amplitude and phase of {CM _org (x,y)}, calculate the saliency map of {CM _org (x,y)}, denoted as {SM _org (x,y)}, and according to {CM _dis (x, y)} amplitude and phase, calculate the saliency map of {CM _dis (x,y)}, denoted as {SM _dis (x,y)}, where, SM _org (x,y) means {SM _org (x, The pixel value of the pixel whose coordinate position is (x, y) in y)}, SM _dis (x, y) means the pixel of the pixel whose coordinate position is (x, y) in {SM _dis (x, y)} value;

⑤计算{CM_org(x,y)}与{CM_dis(x,y)}之间的失真图，记为{DM(x,y)}，将{DM(x,y)}中坐标位置为(x,y)的像素点的像素值记为DM(x,y)，DM(x,y)＝(CM_org(x,y)-CM_dis(x,y))²；⑤ Calculate the distortion map between {CM _org (x,y)} and {CM _dis (x,y)}, record it as {DM(x,y)}, and set the coordinate position in {DM(x,y)} Be that the pixel value of the pixel point of (x, y) is denoted as DM (x, y), DM (x, y)=(CM _org (x, y)-CM _dis (x, y)) ² ;

⑥根据{SM_org(x,y)}和{SM_dis(x,y)}及{DM(x,y)}，对{CM_dis(x,y)}中的每个像素点的客观评价度量值进行融合，得到S_dis的图像质量客观评价预测值，记为Q， $Q = {[\frac{\underset{(x, y) &Element; Ω}{Σ} Q_{image} (x, y) \times SM (x, y)}{\underset{(x, y) &Element; Ω}{Σ} SM (x, y)}]}^{γ} \times {[\frac{\underset{(x, y) &Element; Ω}{Σ} Q_{image} (x, y) \times DM (x, y)}{\underset{(x, y) &Element; Ω}{Σ} DM (x, y)}]}^{β},$ 其中，Ω表示像素域范围，SM(x,y)＝max(SM_org(x,y),SM_dis(x,y))，max()为取最大值函数，γ和β为权重系数；⑥ According to {SM _org (x,y)} and {SM _dis (x,y)} and {DM(x,y)}, the objective evaluation of each pixel in {CM _dis (x,y)} The measured values are fused to obtain the predicted value of the objective evaluation of S _dis image quality, denoted as Q, $Q = {[\frac{\underset{(x, the y) &Element; Ω}{Σ} Q_{image} (x, the y) \times SM (x, the y)}{\underset{(x, the y) &Element; Ω}{Σ} SM (x, the y)}]}^{γ} \times {[\frac{\underset{(x, the y) &Element; Ω}{Σ} Q_{image} (x, the y) \times DM (x, the y)}{\underset{(x, the y) &Element; Ω}{Σ} DM (x, the y)}]}^{β},$ Among them, Ω represents the range of the pixel domain, SM(x, y) = max(SM _org (x, y), SM _dis (x, y)), max() is the maximum value function, γ and β are weight coefficients;

⑦采用n幅原始的无失真的立体图像，建立其在不同失真类型不同失真程度下的失真立体图像集合，该失真立体图像集合包括多幅失真的立体图像，利用主观质量评价方法分别获取失真立体图像集合中每幅失真的立体图像的平均主观评分差值，记为DMOS，DMOS＝100-MOS，其中，MOS表示主观评分均值，DMOS∈[0,100]，n≥1；⑦Using n original undistorted stereoscopic images, establish a set of distorted stereoscopic images under different distortion types and different degrees of distortion. The average subjective score difference of each distorted stereoscopic image in the image set is recorded as DMOS, DMOS=100-MOS, where MOS represents the mean subjective score, DMOS∈[0,100], n≥1;

⑧按照步骤①至步骤⑥计算S_dis的图像质量客观评价预测值的操作，分别计算失真立体图像集合中每幅失真的立体图像的图像质量客观评价预测值。8. According to the operation of calculating the image quality objective evaluation prediction value of S _dis according to step ① to step ⑥, respectively calculate the image quality objective evaluation prediction value of each distorted stereoscopic image in the distorted stereoscopic image set.

所述的步骤②的具体过程为：The concrete process of described step 2. is:

②-1、对{L_org(x,y)}进行滤波处理，得到{L_org(x,y)}中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，将{L_org(x,y)}中坐标位置为(x,y)的像素点在不同尺度和方向的偶对称频率响应记为e_α,θ(x,y)，将{L_org(x,y)}中坐标位置为(x,y)的像素点在不同尺度和方向的奇对称频率响应记为o_α,θ(x,y)，其中，α表示滤波所采用的滤波器的尺度因子，1≤α≤4，θ表示滤波所采用的滤波器的方向因子，1≤θ≤4；②-1. Perform filtering on {L _org (x, y)} to obtain the even symmetric frequency response and odd symmetric frequency response of each pixel in {L _org (x, y)} in different scales and directions, The even symmetric frequency response of the pixel at the coordinate position (x, y) in {L _org (x, y)} in different scales and directions is recorded as e _{α, θ} (x, y), and {L _org (x , y)}, the odd symmetric frequency response of the pixel at the coordinate position (x, y) in different scales and directions is denoted as o _{α, θ} (x, y), where α represents the scale of the filter used for filtering Factor, 1≤α≤4, θ indicates the direction factor of the filter used for filtering, 1≤θ≤4;

②-2、根据{L_org(x,y)}中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，计算{L_org(x,y)}中的每个像素点的振幅，将{L_org(x,y)}中坐标位置为(x,y)的像素点的振幅记为 ${GE}_{org}^{L} (x, y) = Σ_{θ = 1}^{4} Σ_{α = 1}^{4} \sqrt{e_{α, θ} {(x, y)}^{2} + o_{α, θ} {(x, y)}^{2}};$ ②-2. According to the even symmetric frequency response and odd symmetric frequency response of each pixel in {L _org (x,y)} in different scales and directions, calculate each pixel in {L _org (x,y)} The amplitude of the pixel point, the amplitude of the pixel point whose coordinate position is (x, y) in {L _org (x, y)} is recorded as ${GE}_{org}^{L} (x, the y) = Σ_{θ = 1}^{4} Σ_{α = 1}^{4} \sqrt{e_{α, θ} {(x, the y)}^{2} + o_{α, θ} {(x, the y)}^{2}};$

②-3、按照步骤②-1至步骤②-2获取{L_org(x,y)}中的每个像素点的振幅的操作，以相同的方式获取{R_org(x,y)}、{L_dis(x,y)}和{R_dis(x,y)}中的每个像素点的振幅，将{R_org(x,y)}中坐标位置为(x,y)的像素点的振幅记为将{L_dis(x,y)}中坐标位置为(x,y)的像素点的振幅记为将{R_dis(x,y)}中坐标位置为(x,y)的像素点的振幅记为 ②-3. Obtain the amplitude of each pixel in {L _org (x,y)} according to step ②-1 to step ②-2, and obtain {R _org (x,y)} in the same way, The amplitude of each pixel in {L _dis (x, y)} and {R _dis (x, y)}, the pixel whose coordinate position is (x, y) in {R _org (x, y)} The amplitude of The amplitude of the pixel at the coordinate position (x, y) in {L _dis (x, y)} is recorded as Record the amplitude of the pixel at the coordinate position (x, y) in {R _dis (x, y)} as

②-4、采用块匹配法计算{L_org(x,y)}与{R_org(x,y)}之间的视差图像，记为其中，表示中坐标位置为(x,y)的像素点的像素值；②-4. Calculate the parallax image between {L _org (x, y)} and {R _org (x, y)} by block matching method, denoted as in, express The pixel value of the pixel point whose middle coordinate position is (x, y);

②-5、根据{L_org(x,y)}和{R_org(x,y)}中的每个像素点的振幅及中的每个像素点的像素值，计算S_org的独眼图，记为{CM_org(x,y)}，将{CM_org(x,y)}中坐标位置为(x,y)的像素点的像素值记为CM_org(x,y)， ${CM}_{org} (x, y) = \frac{{GE}_{org}^{L} (x, y) \times L_{org} (x, y) + {GE}_{org}^{R} (x - d_{org}^{L} (x, y), y) \times R_{org} (x - d_{org}^{L} (x, y), y)}{{GE}_{org}^{L} (x, y) + {GE}_{org}^{R} (x - d_{org}^{L} (x, y), y)},$ 其中，表示{R_org(x,y)}中坐标位置为的像素点的振幅，表示{R_org(x,y)}中坐标位置为的像素点的像素值；② _- ₅ . According to the amplitude and The pixel value of each pixel in , calculate the Cyclops image of S _org , which is recorded as {CM _org (x, y)}, and the pixel whose coordinate position is (x, y) in {CM _org (x, y)} The pixel value of a point is denoted as CM _org (x,y), ${CM}_{org} (x, the y) = \frac{{GE}_{org}^{L} (x, the y) \times L_{org} (x, the y) + {GE}_{org}^{R} (x - d_{org}^{L} (x, the y), the y) \times R_{org} (x - d_{org}^{L} (x, the y), the y)}{{GE}_{org}^{L} (x, the y) + {GE}_{org}^{R} (x - d_{org}^{L} (x, the y), the y)},$ in, Indicates that the coordinate position in {R _org (x,y)} is The amplitude of the pixel point, Indicates that the coordinate position in {R _org (x,y)} is The pixel value of the pixel point;

②-6、根据{L_dis(x,y)}和{R_dis(x,y)}中的每个像素点的振幅及中的每个像素点的像素值，计算S_dis的独眼图，记为{CM_dis(x,y)}，将{CM_dis(x,y)}中坐标位置为(x,y)的像素点的像素值记为CM_dis(x,y)， ${CM}_{dis} (x, y) = \frac{{GE}_{dis}^{L} (x, y) \times L_{dis} (x, y) + {GE}_{dis}^{R} (x - d_{org}^{L} (x, y), y) \times R_{dis} (x - d_{org}^{L} (x, y), y)}{{GE}_{dis}^{L} (x, y) + {GE}_{dis}^{R} (x - d_{org}^{L} (x, y), y)},$ 其中，表示{R_dis(x,y)}中坐标位置为的像素点的振幅，表示{R_dis(x,y)}中坐标位置为的像素点的像素值。 _② -6 _. According to the amplitude and The pixel value of each pixel in , calculate the Cyclops image of S _dis , which is recorded as {CM _dis (x, y)}, and the pixel whose coordinate position is (x, y) in {CM _dis (x, y)} The pixel value of the point is recorded as CM _dis (x,y), ${CM}_{dis} (x, the y) = \frac{{GE}_{dis}^{L} (x, the y) \times L_{dis} (x, the y) + {GE}_{dis}^{R} (x - d_{org}^{L} (x, the y), the y) \times R_{dis} (x - d_{org}^{L} (x, the y), the y)}{{GE}_{dis}^{L} (x, the y) + {GE}_{dis}^{R} (x - d_{org}^{L} (x, the y), the y)},$ in, Indicates that the coordinate position in {R _dis (x,y)} is The amplitude of the pixel point, Indicates that the coordinate position in {R _dis (x,y)} is The pixel value of the pixel.

所述的步骤②-1中对{L_org(x,y)}进行滤波处理采用的滤波器为log-Garbor滤波器。The filter used for filtering {L _org (x, y)} in the step ②-1 is a log-Garbor filter.

所述的步骤③的具体过程为：The concrete process of described step 3. is:

③-1、计算{CM_org(x,y)}和{CM_dis(x,y)}中的每个像素点的像素值的均值和标准差，将{CM_org(x,y)}中坐标位置为(x₁,y₁)的像素点的像素值的均值和标准差分别记为μ_org(x₁,y₁)和σ_org(x₁,y₁)，将{CM_dis(x,y)}中坐标位置为(x₁,y₁)的像素点的像素值的均值和标准差分别记为μ_dis(x₁,y₁)和σ_dis(x₁,y₁)， ③-1. Calculate the mean and standard deviation of the pixel values of each pixel in {CM _org (x, y)} and {CM _dis (x, y)}, and put {CM _org (x, y)} in The mean and standard deviation of the pixel values at the coordinate position (x ₁ ,y ₁ ) are recorded as μ _org (x ₁ ,y ₁ ) and σ _org (x ₁ ,y ₁ ) respectively, and {CM _dis (x ,y)}, the mean and standard deviation of the pixel values of the pixel at the coordinate position (x ₁ ,y ₁ ) are recorded as μ _dis (x ₁ ,y ₁ ) and σ _dis (x ₁ ,y ₁ ), respectively,

${σ σ}_{org org} (({x x}_{11},, {y the y}_{11})) = = \sqrt{\frac{\underset{(({x x}_{11},, {y the y}_{11})) &Element; &Element; N N (({x x}_{11},, {y the y}_{11}))}{Σ Σ} {(({CM CM}_{org org} (({x x}_{11},, {y the y}_{11})) - - {μ μ}_{org org} (({x x}_{11},, {y the y}_{11}))))}^{22}}{M m}},,$

${μ μ}_{dis dis} (({x x}_{11},, {y the y}_{11})) = = \frac{\underset{(({x x}_{11},, {y the y}_{11})) &Element; &Element; (({x x}_{11},, {y the y}_{11}))}{Σ Σ} {CM CM}_{dis dis} (({x x}_{11},, {y the y}_{11}))}{M m},,$

$σ_{dis} (x_{1}, y_{1}) = \sqrt{\frac{\underset{(x_{1}, y_{1}) &Element; N (x_{1}, y_{1})}{Σ} {({CM}_{dis} (x_{1}, y_{1}) - μ_{dis} (x_{1}, y_{1}))}^{2}}{M}},$ 其中，1≤x₁≤W，1≤y₁≤H，N(x₁,y₁)表示以坐标位置为(x₁,y₁)的像素点为中心的8×8邻域窗口，M表示N(x₁,y₁)内的像素点的个数，CM_org(x₁,y₁)表示{CM_org(x,y)}中坐标位置为(x₁,y₁)的像素点的像素值，CM_dis(x₁,y₁)表示{CM_dis(x,y)}中坐标位置为(x₁,y₁)的像素点的像素值； $σ_{dis} (x_{1}, {the y}_{1}) = \sqrt{\frac{\underset{(x_{1}, {the y}_{1}) &Element; N (x_{1}, {the y}_{1})}{Σ} {({CM}_{dis} (x_{1}, {the y}_{1}) - μ_{dis} (x_{1}, {the y}_{1}))}^{2}}{m}},$ Among them, 1≤x ₁ ≤W, 1≤y ₁ ≤H, N(x ₁ , y ₁ ) represents an 8×8 neighborhood window centered on the pixel at the coordinate position (x ₁ , y ₁ ), M Indicates the number of pixels within N(x ₁ ,y ₁ ), CM _org (x ₁ ,y ₁ ) indicates the pixel at the coordinate position (x ₁ ,y ₁ ) in {CM _org (x,y)} The pixel value of , CM _dis (x ₁ , y ₁ ) represents the pixel value of the pixel whose coordinate position is (x ₁ , y ₁ ) in {CM _dis (x, y)};

③-2、根据{CM_org(x,y)}和{CM_dis(x,y)}中的每个像素点的像素值的均值和标准差，计算{CM_dis(x,y)}中的每个像素点的客观评价度量值，将{CM_dis(x,y)}中坐标位置为(x₁,y₁)的像素点的客观评价度量值记为Q_image(x₁,y₁)， $Q_{image} (x_{1}, y_{1}) = \frac{4 \times (μ_{org} (x_{1}, y_{1}) \times μ_{dis} (x_{1}, y_{1})) \times (σ_{org} (x_{1}, y_{1}) \times σ_{dis} (x_{1}, y_{1})) + C}{(μ_{org} {(x_{1}, y_{1})}^{2} + μ_{dis} {(x_{1}, y_{1})}^{2}) \times (σ_{org} {(x_{1}, y_{1})}^{2} + σ_{dis} {(x_{1}, y_{1})}^{2}) + C},$ 其中，C为控制参数。③-2. According to the mean and standard deviation of the pixel values of each pixel in {CM _org (x, y)} and {CM _dis (x, y)}, calculate {CM _dis (x, y)} The objective evaluation metric value of each pixel in {CM _dis (x,y)}, the objective evaluation metric value of the pixel whose coordinate position is (x ₁ ,y ₁ ) in {CM dis (x,y)} is recorded as Q _image (x ₁ ,y ₁ ), $Q_{image} (x_{1}, {the y}_{1}) = \frac{4 \times (μ_{org} (x_{1}, {the y}_{1}) \times μ_{dis} (x_{1}, {the y}_{1})) \times (σ_{org} (x_{1}, {the y}_{1}) \times σ_{dis} (x_{1}, {the y}_{1})) + C}{(μ_{org} {(x_{1}, {the y}_{1})}^{2} + μ_{dis} {(x_{1}, {the y}_{1})}^{2}) \times (σ_{org} {(x_{1}, {the y}_{1})}^{2} + σ_{dis} {(x_{1}, {the y}_{1})}^{2}) + C},$ Among them, C is the control parameter.

所述的步骤④的具体过程为：The concrete process of described step 4. is:

④-1、对{CM_org(x,y)}进行离散傅立叶变换，得到{CM_org(x,y)}的振幅和相位，分别记为{M_org(u,v)}和{A_org(u,v)}，其中，u表示变换域的振幅或相位的宽度，v表示变换域的振幅或相位的高度，1≤u≤W，1≤v≤H，M_org(u,v)表示{M_org(u,v)}中坐标位置为(u,v)的像素点的振幅值，A_org(u,v)表示{A_org(u,v)}中坐标位置为(u,v)的像素点的相位值；④-1. Perform discrete Fourier transform on {CM _org (x,y)} to obtain the amplitude and phase of {CM _org (x,y)}, which are recorded as {M _org (u,v)} and {A _org (u,v)}, where u represents the amplitude or phase width of the transform domain, v represents the amplitude or phase height of the transform domain, 1≤u≤W, 1≤v≤H, M _org (u,v) Indicates the amplitude value of the pixel point whose coordinate position is (u,v) in {M _org (u,v)}, and A _org (u,v) indicates that the coordinate position in {A _org (u,v)} is (u, The phase value of the pixel point of v);

④-2、计算{M_org(u,v)}的高频分量的振幅，记为{R_org(u,v)}，将{R_org(u,v)}中坐标位置为(u,v)的像素点的高频分量的振幅值记为R_org(u,v)，R_org(u,v)＝log(M_org(u,v))-h_m(u,v)*log(M_org(u,v))，其中，log()为以e为底的对数函数，e＝2.718281828，“*”为卷积操作符号，h_m(u,v)表示m×m的均值滤波；④-2. Calculate the amplitude of the high-frequency component of {M _org (u,v)}, record it as {R _org (u,v)}, and set the coordinate position in {R _org (u,v)} as (u, v) The amplitude value of the high-frequency component of the pixel point is recorded as R _org (u,v), R _org (u,v)=log(M _org (u,v))-h _m (u,v)*log (M _org (u,v)), where log() is a logarithmic function based on e, e=2.718281828, "*" is the convolution operation symbol, h _m (u,v) means m×m mean filtering;

④-3、根据{R_org(u,v)}和{A_org(u,v)}进行离散傅立叶反变换，将获得的反变换图像作为{CM_org(x,y)}的显著图，记为{SM_org(x,y)}，其中，SM_org(x,y)表示{SM_org(x,y)}中坐标位置为(x,y)的像素点的像素值；④-3. Perform inverse discrete Fourier transform according to {R _org (u, v)} and {A _org (u, v)}, and use the obtained inverse transformed image as the saliency map of {CM _org (x, y)}, Recorded as {SM _org (x, y)}, where, SM _org (x, y) represents the pixel value of the pixel whose coordinate position is (x, y) in {SM _org (x, y)};

④-4、按照步骤④-1至步骤④-3获取{CM_org(x,y)}的显著图的操作，以相同的方式获取{CM_dis(x,y)}的显著图，记为{SM_dis(x,y)}，其中，SM_dis(x,y)表示{SM_dis(x,y)}中坐标位置为(x,y)的像素点的像素值。④-4. Follow steps ④-1 to ④-3 to obtain the saliency map of {CM _org (x, y)}, and obtain the saliency map of {CM _dis (x, y)} in the same way, denoted as {SM _dis (x, y)}, wherein, SM _dis (x, y) represents the pixel value of the pixel whose coordinate position is (x, y) in {SM _dis (x, y)}.

与现有技术相比，本发明的优点在于：Compared with the prior art, the present invention has the advantages of:

1)本发明方法通过分别计算原始的无失真的立体图像的独眼图和待评价的失真的立体图像的独眼图，并直接对失真的立体图像的独眼图进行评价，这样能够有效地对双目立体融合过程进行模拟，避免了对左视点图像和右视点图像的客观评价度量值进行线性加权的过程。1) The method of the present invention calculates the cyclopean diagram of the original undistorted stereoscopic image and the monocular diagram of the distorted stereoscopic image to be evaluated respectively, and directly evaluates the monocular diagram of the distorted stereoscopic image, so that the binocular The stereo fusion process is simulated, avoiding the process of linearly weighting the objective evaluation metrics of the left-viewpoint image and the right-viewpoint image.

2)本发明方法通过计算原始的无失真的立体图像的独眼图和待评价的失真的立体图像的独眼图的显著图及两个独眼图之间的失真图，并对待评价的失真的立体图像的独眼图中的每个像素点的客观评价度量值进行融合，可使得评价结果更加感觉符合人类视觉系统，从而有效地提高了客观评价结果与主观感知的相关性。2) The method of the present invention calculates the saliency map of the cyclopean diagram of the original undistorted stereoscopic image and the cyclopean diagram of the distorted stereoscopic image to be evaluated and the distortion diagram between the two cyclopean diagrams, and the distorted stereoscopic image to be evaluated The fusion of the objective evaluation measurement value of each pixel in the Cyclops image can make the evaluation result more in line with the human visual system, thus effectively improving the correlation between the objective evaluation result and subjective perception.

附图说明Description of drawings

图1为本发明方法的总体实现框图；Fig. 1 is the overall realization block diagram of the inventive method;

图2a为Akko(尺寸为640×480)立体图像的左视点图像；Fig. 2 a is the left viewpoint image of Akko (size is 640 * 480) stereoscopic image;

图2b为Akko(尺寸为640×480)立体图像的右视点图像；Fig. 2b is the right viewpoint image of Akko (size is 640 * 480) stereoscopic image;

图3a为Altmoabit(尺寸为1024×768)立体图像的左视点图像；Fig. 3 a is the left viewpoint image of Altmoabit (size is 1024 * 768) stereoscopic image;

图3b为Altmoabit(尺寸为1024×768)立体图像的右视点图像；Fig. 3b is the right viewpoint image of Altmoabit (size is 1024 * 768) stereoscopic image;

图4a为Balloons(尺寸为1024×768)立体图像的左视点图像；Fig. 4 a is the left viewpoint image of the stereoscopic image of Balloons (size is 1024 * 768);

图4b为Balloons(尺寸为1024×768)立体图像的右视点图像；Fig. 4b is the right viewpoint image of the stereoscopic image of Balloons (size is 1024 * 768);

图5a为Doorflower(尺寸为1024×768)立体图像的左视点图像；Fig. 5 a is the left viewpoint image of the stereoscopic image of Doorflower (size is 1024 * 768);

图5b为Doorflower(尺寸为1024×768)立体图像的右视点图像；Fig. 5b is the right viewpoint image of the stereoscopic image of Doorflower (size is 1024 * 768);

图6a为Kendo(尺寸为1024×768)立体图像的左视点图像；Fig. 6 a is the left viewpoint image of Kendo (size is 1024 * 768) stereoscopic image;

图6b为Kendo(尺寸为1024×768)立体图像的右视点图像；Fig. 6b is the right viewpoint image of the stereoscopic image of Kendo (size is 1024 * 768);

图7a为LeaveLaptop(尺寸为1024×768)立体图像的左视点图像；Fig. 7a is the left view point image of LeaveLaptop (size is 1024 * 768) stereoscopic image;

图7b为LeaveLaptop(尺寸为1024×768)立体图像的右视点图像；Fig. 7b is the right viewpoint image of LeaveLaptop (size is 1024 * 768) stereoscopic image;

图8a为Lovebierd1(尺寸为1024×768)立体图像的左视点图像；Fig. 8a is the left viewpoint image of the stereoscopic image of Lovebierd1 (size is 1024 * 768);

图8b为Lovebierd1(尺寸为1024×768)立体图像的右视点图像；Fig. 8b is the right viewpoint image of the stereoscopic image of Lovebierd1 (size is 1024 * 768);

图9a为Newspaper(尺寸为1024×768)立体图像的左视点图像；Fig. 9a is the left viewpoint image of the stereoscopic image of Newspaper (the size is 1024 * 768);

图9b为Newspaper(尺寸为1024×768)立体图像的右视点图像；Fig. 9b is the right viewpoint image of the stereoscopic image of Newspaper (the size is 1024×768);

图10a为Puppy(尺寸为720×480)立体图像的左视点图像；Fig. 10a is the left viewpoint image of Puppy (size is 720 * 480) stereoscopic image;

图10b为Puppy(尺寸为720×480)立体图像的右视点图像；Fig. 10b is the right viewpoint image of Puppy (size is 720 * 480) stereoscopic image;

图11a为Soccer2(尺寸为720×480)立体图像的左视点图像；Fig. 11a is the left viewpoint image of Soccer2 (size is 720 * 480) stereoscopic image;

图11b为Soccer2(尺寸为720×480)立体图像的右视点图像；Fig. 11b is the right viewpoint image of Soccer2 (size is 720 * 480) stereoscopic image;

图12a为Horse(尺寸为720×480)立体图像的左视点图像；Fig. 12a is the left viewpoint image of the stereoscopic image of Horse (size is 720 * 480);

图12b为Horse(尺寸为720×480)立体图像的右视点图像；Fig. 12b is the right viewpoint image of the stereoscopic image of Horse (the size is 720×480);

图13a为Xmas(尺寸为640×480)立体图像的左视点图像；Fig. 13a is the left viewpoint image of Xmas (size is 640 * 480) stereoscopic image;

图13b为Xmas(尺寸为640×480)立体图像的右视点图像；Fig. 13b is the right viewpoint image of Xmas (size is 640 * 480) stereoscopic image;

图14为失真立体图像集合中的各幅失真的立体图像的图像质量客观评价预测值与平均主观评分差值的散点图。FIG. 14 is a scatter diagram of the difference between the predicted image quality objective evaluation value and the average subjective evaluation value of each distorted stereo image in the distorted stereo image set.

具体实施方式Detailed ways

以下结合附图实施例对本发明作进一步详细描述。The present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

本发明提出的一种基于特征融合的立体图像质量客观评价方法，其总体实现框图如图1所示，其处理过程为：首先，根据原始的无失真的立体图像的左视点图像和右视点图像中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，及原始的无失真的立体图像的左视点图像和右视点图像之间的视差图像，获得原始的无失真的立体图像的独眼图；根据待评价的失真的立体图像的左视点图像和右视点图像中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，及原始的无失真的立体图像的左视点图像和右视点图像之间的视差图像，获得待评价的失真的立体图像的独眼图；其次，根据两个独眼图中的每个像素点的像素值的均值和标准差，获得待评价的失真的立体图像的独眼图中的每个像素点的客观评价度量值；再次，根据原始的无失真的立体图像的独眼图的振幅和相位，获得对应的显著图；根据待评价的失真的立体图像的独眼图的振幅和相位，获得对应的显著图；然后，根据两个显著图及两个独眼图之间的失真图，对待评价的失真的立体图像的独眼图中的每个像素点的客观评价度量值进行融合，得到待评价的失真的立体图像的图像质量客观评价预测值；最后，按照上述处理过程获取多幅不同失真类型不同失真程度的失真的立体图像的图像质量客观评价预测值。A kind of stereoscopic image quality objective evaluation method based on feature fusion proposed by the present invention, its overall implementation block diagram is shown in Figure 1, and its processing process is: first, according to the left viewpoint image and the right viewpoint image of the original undistorted stereoscopic image The even symmetric frequency response and odd symmetric frequency response of each pixel in different scales and directions, and the disparity image between the left view point image and the right view point image of the original undistorted stereo image, to obtain the original undistorted stereoscopic image Cyclopsogram of the stereo image; according to the even and odd symmetry frequency responses of each pixel in the left view image and the right view image of the distorted stereo image to be evaluated at different scales and directions, and the original undistorted The disparity image between the left view point image and the right view point image of the stereo image, obtain the monocular image of the distorted stereo image to be evaluated; secondly, according to the mean and the standard deviation of the pixel values of each pixel in the two cyclopia images, Obtain the objective evaluation metric value of each pixel in the cyclopean image of the distorted stereo image to be evaluated; again, obtain the corresponding saliency map according to the amplitude and phase of the cyclopean image of the original undistorted stereo image; Then, according to the two saliency maps and the distortion map between the two cyclopean maps, each element in the cyclopean map of the distorted stereo image to be evaluated The objective evaluation measurement value of each pixel is fused to obtain the image quality objective evaluation prediction value of the distorted stereoscopic image to be evaluated; finally, the image quality of the distorted stereoscopic image of multiple different distortion types and different degrees of distortion is obtained according to the above-mentioned process. Objectively evaluate the predicted value.

本发明方法具体包括以下步骤：The inventive method specifically comprises the following steps:

①令S_org为原始的无失真的立体图像，令S_dis为待评价的失真的立体图像，将S_org的左视点图像记为{L_org(x,y)}，将S_org的右视点图像记为{R_org(x,y)}，将S_dis的左视点图像记为{L_dis(x,y)}，将S_dis的右视点图像记为{R_dis(x,y)}，其中，此处(x,y)表示左视点图像和右视点图像中的像素点的坐标位置，1≤x≤W，1≤y≤H，W表示左视点图像和右视点图像的宽度，H表示左视点图像和右视点图像的高度，L_org(x,y)表示{L_org(x,y)}中坐标位置为(x,y)的像素点的像素值，R_org(x,y)表示{R_org(x,y)}中坐标位置为(x,y)的像素点的像素值，L_dis(x,y)表示{L_dis(x,y)}中坐标位置为(x,y)的像素点的像素值，R_dis(x,y)表示{R_dis(x,y)}中坐标位置为(x,y)的像素点的像素值。①Let S _org be the original undistorted stereo image, let S _dis be the distorted stereo image to be evaluated, record the left viewpoint image of S _org as {L _org (x,y)}, and let the right viewpoint image of S _org The image is recorded as {R _org (x,y)}, the left view image of S _dis is recorded as {L _dis (x,y)}, and the right view image of S _dis is recorded as {R _dis (x,y)} , where (x, y) represents the coordinate position of the pixel in the left viewpoint image and the right viewpoint image, 1≤x≤W, 1≤y≤H, W represents the width of the left viewpoint image and the right viewpoint image, H represents the height of the left view point image and the right view point image, L _org (x, y) represents the pixel value of the pixel whose coordinate position is (x, y) in {L _org (x, y)}, R _org (x, y) y) means the pixel value of the pixel point whose coordinate position is (x, y) in {R _org (x, y)}, and L _dis (x, y) means that the coordinate position in {L _dis (x, y)} is ( x, y), and R _dis (x, y) represents the pixel value of the pixel whose coordinate position is (x, y) in {R _dis (x, y)}.

②根据{L_org(x,y)}、{R_org(x,y)}、{L_dis(x,y)}、{R_dis(x,y)}中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，对应获取{L_org(x,y)}、{R_org(x,y)}、{L_dis(x,y)}、{R_dis(x,y)}中的每个像素点的振幅，然后根据{L_org(x,y)}和{R_org(x,y)}中的每个像素点的振幅及{L_org(x,y)}与{R_org(x,y)}之间的视差图像中的每个像素点的像素值，计算S_org的独眼图(cyclopean map)，记为{CM_org(x,y)}，并根据{L_dis(x,y)}和{R_dis(x,y)}中的每个像素点的振幅及{L_org(x,y)}与{R_org(x,y)}之间的视差图像中的每个像素点的像素值，计算S_dis的独眼图，记为{CM_dis(x,y)}，其中，CM_org(x,y)表示{CM_org(x,y)}中坐标位置为(x,y)的像素点的像素值，CM_dis(x,y)表示{CM_dis(x,y)}中坐标位置为(x,y)的像素点的像素值。②According to each pixel in {L _org (x,y)}, {R _org (x,y)}, {L _dis (x,y)}, {R _dis (x,y)} at different scales The even symmetric frequency response and odd symmetric frequency response in the and direction, correspondingly get {L _org (x,y)}, {R _org (x,y)}, {L _dis (x,y)}, {R _dis (x ,y)}, and then according to the amplitude of each pixel in {L _org (x,y)} and {R _org (x,y)} and {L _org (x,y) )} and {R _org (x,y)} the pixel value of each pixel in the disparity image, calculate the cyclopean map of S _org , denoted as {CM _org (x,y)}, And according to the amplitude of each pixel in {L _dis (x,y)} and {R _dis (x,y)} and the relationship between {L _org (x,y)} and {R _org (x,y)} The pixel value of each pixel in the parallax image between, calculate the Cyclops of S _dis , denoted as {CM _dis (x, y)}, where, CM _org (x, y) means {CM _org (x, y )}, the pixel value of the pixel whose coordinate position is (x, y), CM _dis (x, y) means the pixel value of the pixel whose coordinate position is (x, y) in {CM _dis (x, y)} .

在本实施例中，步骤②的具体过程为：In this embodiment, the specific process of step ② is:

②-1、对{L_org(x,y)}进行滤波处理，得到{L_org(x,y)}中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，将{L_org(x,y)}中坐标位置为(x,y)的像素点在不同尺度和方向的偶对称频率响应记为e_α,θ(x,y)，将{L_org(x,y)}中坐标位置为(x,y)的像素点在不同尺度和方向的奇对称频率响应记为o_α,θ(x,y)，其中，α表示滤波所采用的滤波器的尺度因子，1≤α≤4，θ表示滤波所采用的滤波器的方向因子，1≤θ≤4。②-1. Perform filtering on {L _org (x, y)} to obtain the even symmetric frequency response and odd symmetric frequency response of each pixel in {L _org (x, y)} in different scales and directions, The even symmetric frequency response of the pixel at the coordinate position (x, y) in {L _org (x, y)} in different scales and directions is recorded as e _{α, θ} (x, y), and {L _org (x , y)}, the odd symmetric frequency response of the pixel at the coordinate position (x, y) in different scales and directions is denoted as o _{α, θ} (x, y), where α represents the scale of the filter used for filtering Factor, 1≤α≤4, θ indicates the direction factor of the filter used for filtering, 1≤θ≤4.

在此，对{L_org(x,y)}进行滤波处理采用的滤波器为log-Garbor滤波器。Here, the filter used for filtering {L _org (x, y)} is a log-Garbor filter.

②-2、根据{L_org(x,y)}中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，计算{L_org(x,y)}中的每个像素点的振幅，将{L_org(x,y)}中坐标位置为(x,y)的像素点的振幅记为 ${GE}_{org}^{L} (x, y) = Σ_{θ = 1}^{4} Σ_{α = 1}^{4} \sqrt{e_{α, θ} {(x, y)}^{2} + o_{α, θ} {(x, y)}^{2}} .$ ②-2. According to the even symmetric frequency response and odd symmetric frequency response of each pixel in {L _org (x,y)} in different scales and directions, calculate each pixel in {L _org (x,y)} The amplitude of the pixel point, the amplitude of the pixel point whose coordinate position is (x, y) in {L _org (x, y)} is recorded as ${GE}_{org}^{L} (x, the y) = Σ_{θ = 1}^{4} Σ_{α = 1}^{4} \sqrt{e_{α, θ} {(x, the y)}^{2} + o_{α, θ} {(x, the y)}^{2}} .$

②-3、按照步骤②-1至步骤②-2获取{L_org(x,y)}中的每个像素点的振幅的操作，以相同的方式获取{R_org(x,y)}、{L_dis(x,y)}和{R_dis(x,y)}中的每个像素点的振幅，将{R_org(x,y)}中坐标位置为(x,y)的像素点的振幅记为将{L_dis(x,y)}中坐标位置为(x,y)的像素点的振幅记为将{R_dis(x,y)}中坐标位置为(x,y)的像素点的振幅记为如获取{L_dis(x,y)}中的每个像素点的振幅的操作过程为：1)对{L_dis(x,y)}进行滤波处理，得到{L_dis(x,y)}中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，将{L_dis(x,y)}中坐标位置为(x,y)的像素点在不同尺度和方向的偶对称频率响应记为e_α,θ'(x,y)，将{L_dis(x,y)}中坐标位置为(x,y)的像素点在不同尺度和方向的奇对称频率响应记为o_α,θ'(x,y)，其中，α表示滤波所采用的滤波器的尺度因子，1≤α≤4，θ表示滤波所采用的滤波器的方向因子，1≤θ≤4；2)根据{L_dis(x,y)}中的每个像素点在不同尺度和方向的偶对称频率响应和奇对称频率响应，计算{L_dis(x,y)}中的每个像素点的振幅，将{L_dis(x,y)}中坐标位置为(x,y)的像素点的振幅记为 ${GE}_{dis}^{L} (x, y) = Σ_{θ = 1}^{4} Σ_{α = 1}^{4} \sqrt{{e_{α, θ}}^{'} {(x, y)}^{2} + {o_{α, θ}}^{'} {(x, y)}^{2}} .$ ②-3. Obtain the amplitude of each pixel in {L _org (x,y)} according to step ②-1 to step ②-2, and obtain {R _org (x,y)} in the same way, The amplitude of each pixel in {L _dis (x, y)} and {R _dis (x, y)}, the pixel whose coordinate position is (x, y) in {R _org (x, y)} The amplitude of The amplitude of the pixel at the coordinate position (x, y) in {L _dis (x, y)} is recorded as Record the amplitude of the pixel at the coordinate position (x, y) in {R _dis (x, y)} as For example, the operation process of obtaining the amplitude of each pixel in {L _dis (x, y)} is: 1) Filter {L _dis (x, y)} to obtain {L _dis (x, y)} The even symmetric frequency response and odd symmetric frequency response of each pixel in different scales and directions, and the pixel with coordinate position (x, y) in {L _dis (x,y)} in different scales and directions The even symmetric frequency response is recorded as e _α,θ '(x,y), and the odd symmetric frequency response of the pixel at the coordinate position (x,y) in {L _dis (x,y)} in different scales and directions is recorded as is o _α,θ '(x,y), where α represents the scale factor of the filter used in filtering, 1≤α≤4, θ represents the direction factor of the filter used in filtering, 1≤θ≤4; 2) According to the even symmetric frequency response and odd symmetric frequency response of each pixel in {L _dis (x, y)} in different scales and directions, calculate each pixel in {L _dis (x, y)} The amplitude of the pixel point whose coordinate position is (x, y) in {L _dis (x, y)} is recorded as ${GE}_{dis}^{L} (x, the y) = Σ_{θ = 1}^{4} Σ_{α = 1}^{4} \sqrt{{e_{α, θ}}^{'} {(x, the y)}^{2} + {o_{α, θ}}^{'} {(x, the y)}^{2}} .$

②-4、采用块匹配法计算{L_org(x,y)}与{R_org(x,y)}之间的视差图像，记为其中，表示中坐标位置为(x,y)的像素点的像素值。②-4. Calculate the parallax image between {L _org (x, y)} and {R _org (x, y)} by block matching method, denoted as in, express The pixel value of the pixel whose middle coordinate position is (x, y).

②-5、根据{L_org(x,y)}和{R_org(x,y)}中的每个像素点的振幅及中的每个像素点的像素值，计算S_org的独眼图，记为{CM_org(x,y)}，将{CM_org(x,y)}中坐标位置为(x,y)的像素点的像素值记为CM_org(x,y)， ${CM}_{org} (x, y) = \frac{{GE}_{org}^{L} (x, y) \times L_{org} (x, y) + {GE}_{org}^{R} (x - d_{org}^{L} (x, y), y) \times R_{org} (x - d_{org}^{L} (x, y), y)}{{GE}_{org}^{L} (x, y) + {GE}_{org}^{R} (x - d_{org}^{L} (x, y), y)},$ 其中，表示{R_org(x,y)}中坐标位置为的像素点的振幅，表示{R_org(x,y)}中坐标位置为的像素点的像素值。②- ₅ . According to the _amplitude and The pixel value of each pixel in , calculate the Cyclops image of S _org , which is recorded as {CM _org (x, y)}, and the pixel whose coordinate position is (x, y) in {CM _org (x, y)} The pixel value of a point is denoted as CM _org (x,y), ${CM}_{org} (x, the y) = \frac{{GE}_{org}^{L} (x, the y) \times L_{org} (x, the y) + {GE}_{org}^{R} (x - d_{org}^{L} (x, the y), the y) \times R_{org} (x - d_{org}^{L} (x, the y), the y)}{{GE}_{org}^{L} (x, the y) + {GE}_{org}^{R} (x - d_{org}^{L} (x, the y), the y)},$ in, Indicates that the coordinate position in {R _org (x,y)} is The amplitude of the pixel point, Indicates that the coordinate position in {R _org (x,y)} is The pixel value of the pixel.

③根据{CM_org(x,y)}和{CM_dis(x,y)}中的每个像素点的像素值的均值和标准差，计算{CM_dis(x,y)}中的每个像素点的客观评价度量值，将{CM_dis(x,y)}中坐标位置为(x,y)的像素点的客观评价度量值记为Q_image(x,y)，将{CM_dis(x,y)}中的所有像素点的客观评价度量值用集合表示为{Q_image(x,y)}。③According to the mean and standard deviation of the pixel values of each pixel in {CM _org (x,y)} and {CM _dis (x,y)}, calculate each of {CM _dis (x,y)} The objective evaluation measurement value of the pixel point, the objective evaluation measurement value of the pixel point whose coordinate position is (x, y) in {CM _dis (x, y)} is recorded as Q _image (x, y), and {CM _dis ( The objective evaluation metric values of all pixels in x, y)} are expressed as {Q _image (x, y)} in a set.

在本实施例中，步骤③的具体过程为：In the present embodiment, the specific process of step 3. is:

$σ_{dis} (x_{1}, y_{1}) = \sqrt{\frac{\underset{(x_{1}, y_{1}) &Element; N (x_{1}, y_{1})}{Σ} {({CM}_{dis} (x_{1}, y_{1}) - μ_{dis} (x_{1}, y_{1}))}^{2}}{M}},$ 其中，1≤x₁≤W，1≤y₁≤H，N(x₁,y₁)表示以坐标位置为(x₁,y₁)的像素点为中心的8×8邻域窗口，M表示N(x₁,y₁)内的像素点的个数，CM_org(x₁,y₁)表示{CM_org(x,y)}中坐标位置为(x₁,y₁)的像素点的像素值，CM_dis(x₁,y₁)表示{CM_dis(x,y)}中坐标位置为(x₁,y₁)的像素点的像素值。 $σ_{dis} (x_{1}, {the y}_{1}) = \sqrt{\frac{\underset{(x_{1}, {the y}_{1}) &Element; N (x_{1}, {the y}_{1})}{Σ} {({CM}_{dis} (x_{1}, {the y}_{1}) - μ_{dis} (x_{1}, {the y}_{1}))}^{2}}{m}},$ Among them, 1≤x ₁ ≤W, 1≤y ₁ ≤H, N(x ₁ , y ₁ ) represents an 8×8 neighborhood window centered on the pixel at the coordinate position (x ₁ , y ₁ ), M Indicates the number of pixels within N(x ₁ ,y ₁ ), CM _org (x ₁ ,y ₁ ) indicates the pixel at the coordinate position (x ₁ ,y ₁ ) in {CM _org (x,y)} The pixel value of CM _dis (x ₁ ,y ₁ ) represents the pixel value of the pixel whose coordinate position is (x ₁ ,y ₁ ) in {CM _dis (x,y)}.

③-2、根据{CM_org(x,y)}和{CM_dis(x,y)}中的每个像素点的像素值的均值和标准差，计算{CM_dis(x,y)}中的每个像素点的客观评价度量值，将{CM_dis(x,y)}中坐标位置为(x₁,y₁)的像素点的客观评价度量值记为Q_image(x₁,y₁)， $Q_{image} (x_{1}, y_{1}) = \frac{4 \times (μ_{org} (x_{1}, y_{1}) \times μ_{dis} (x_{1}, y_{1})) \times (σ_{org} (x_{1}, y_{1}) \times σ_{dis} (x_{1}, y_{1})) + C}{(μ_{org} {(x_{1}, y_{1})}^{2} + μ_{dis} {(x_{1}, y_{1})}^{2}) \times (σ_{org} {(x_{1}, y_{1})}^{2} + σ_{dis} {(x_{1}, y_{1})}^{2}) + C},$ 其中，C为控制参数，在本实施例中，取C＝0.01。③-2. According to the mean and standard deviation of the pixel values of each pixel in {CM _org (x, y)} and {CM _dis (x, y)}, calculate {CM _dis (x, y)} The objective evaluation metric value of each pixel in {CM _dis (x,y)}, the objective evaluation metric value of the pixel whose coordinate position is (x ₁ ,y ₁ ) in {CM dis (x,y)} is recorded as Q _image (x ₁ ,y ₁ ), $Q_{image} (x_{1}, {the y}_{1}) = \frac{4 \times (μ_{org} (x_{1}, {the y}_{1}) \times μ_{dis} (x_{1}, {the y}_{1})) \times (σ_{org} (x_{1}, {the y}_{1}) \times σ_{dis} (x_{1}, {the y}_{1})) + C}{(μ_{org} {(x_{1}, {the y}_{1})}^{2} + μ_{dis} {(x_{1}, {the y}_{1})}^{2}) \times (σ_{org} {(x_{1}, {the y}_{1})}^{2} + σ_{dis} {(x_{1}, {the y}_{1})}^{2}) + C},$ Wherein, C is a control parameter, and in this embodiment, C=0.01.

④根据{CM_org(x,y)}的光谱冗余特性，即根据{CM_org(x,y)}的振幅和相位，计算{CM_org(x,y)}的显著图(saliency map)，记为{SM_org(x,y)}，并根据{CM_dis(x,y)}的光谱冗余特性，即根据{CM_dis(x,y)}的振幅和相位，计算{CM_dis(x,y)}的显著图，记为{SM_dis(x,y)}，其中，SM_org(x,y)表示{SM_org(x,y)}中坐标位置为(x,y)的像素点的像素值，SM_dis(x,y)表示{SM_dis(x,y)}中坐标位置为(x,y)的像素点的像素值。④ Calculate the saliency map of {CM _org (x, y)} according to the spectral redundancy characteristics of {CM _org (x, y)}, that is, according to the amplitude and phase of {CM _org (x, y)} , denoted as {SM _org (x,y)}, and according to the spectral redundancy characteristics of {CM _dis (x,y)}, that is, according to the amplitude and phase of {CM _dis (x,y)}, calculate {CM _dis The saliency map of (x,y)} is recorded as {SM _dis (x,y)}, where SM _org (x,y) means that the coordinate position in {SM _org (x,y)} is (x,y) The pixel value of the pixel point of SM _dis (x, y) represents the pixel value of the pixel point whose coordinate position is (x, y) in {SM _dis (x, y)}.

在本实施例中，步骤④的具体过程为：In the present embodiment, the specific process of step ④ is:

④-1、对{CM_org(x,y)}进行离散傅立叶变换，得到{CM_org(x,y)}的振幅和相位，分别记为{M_org(u,v)}和{A_org(u,v)}，其中，u表示变换域的振幅或相位的宽度，v表示变换域的振幅或相位的高度，1≤u≤W，1≤v≤H，M_org(u,v)表示{M_org(u,v)}中坐标位置为(u,v)的像素点的振幅值，A_org(u,v)表示{A_org(u,v)}中坐标位置为(u,v)的像素点的相位值。④-1. Perform discrete Fourier transform on {CM _org (x,y)} to obtain the amplitude and phase of {CM _org (x,y)}, which are recorded as {M _org (u,v)} and {A _org (u,v)}, where u represents the amplitude or phase width of the transform domain, v represents the amplitude or phase height of the transform domain, 1≤u≤W, 1≤v≤H, M _org (u,v) Indicates the amplitude value of the pixel point whose coordinate position is (u,v) in {M _org (u,v)}, and A _org (u,v) indicates that the coordinate position in {A _org (u,v)} is (u, v) The phase value of the pixel point.

④-2、计算{M_org(u,v)}的高频分量的振幅，记为{R_org(u,v)}，将{R_org(u,v)}中坐标位置为(u,v)的像素点的高频分量的振幅值记为R_org(u,v)，R_org(u,v)＝log(M_org(u,v))-h_m(u,v)*log(M_org(u,v))，其中，log()为以e为底的对数函数，e＝2.718281828，“*”为卷积操作符号，h_m(u,v)表示m×m的均值滤波，在本实施例中，取m＝3。④-2. Calculate the amplitude of the high-frequency component of {M _org (u,v)}, record it as {R _org (u,v)}, and set the coordinate position in {R _org (u,v)} as (u, v) The amplitude value of the high-frequency component of the pixel point is recorded as R _org (u,v), R _org (u,v)=log(M _org (u,v))-h _m (u,v)*log (M _org (u,v)), where log() is a logarithmic function based on e, e=2.718281828, "*" is the convolution operation symbol, h _m (u,v) means m×m For mean filtering, in this embodiment, m=3.

④-3、根据{R_org(u,v)}和{A_org(u,v)}进行离散傅立叶反变换，将获得的反变换图像作为{CM_org(x,y)}的显著图，记为{SM_org(x,y)}，其中，SM_org(x,y)表示{SM_org(x,y)}中坐标位置为(x,y)的像素点的像素值。④-3. Perform inverse discrete Fourier transform according to {R _org (u, v)} and {A _org (u, v)}, and use the obtained inverse transformed image as the saliency map of {CM _org (x, y)}, It is recorded as {SM _org (x, y)}, where SM _org (x, y) represents the pixel value of the pixel at the coordinate position (x, y) in {SM _org (x, y)}.

⑤计算{CM_org(x,y)}与{CM_dis(x,y)}之间的失真图(distortion map)，记为{DM(x,y)}，将{DM(x,y)}中坐标位置为(x,y)的像素点的像素值记为DM(x,y)，DM(x,y)＝(CM_org(x,y)-CM_dis(x,y))²。⑤ Calculate the distortion map between {CM _org (x,y)} and {CM _dis (x,y)}, denoted as {DM(x,y)}, and {DM(x,y) }, the pixel value of the pixel whose coordinate position is (x, y) is recorded as DM (x, y), DM (x, y) = (CM _org (x, y)-CM _dis (x, y)) ² .

⑥根据{SM_org(x,y)}和{SM_dis(x,y)}及{DM(x,y)}，对{CM_dis(x,y)}中的每个像素点的客观评价度量值进行融合，得到S_dis的图像质量客观评价预测值，记为Q， $Q = {[\frac{\underset{(x, y) &Element; Ω}{Σ} Q_{image} (x, y) \times SM (x, y)}{\underset{(x, y) &Element; Ω}{Σ} SM (x, y)}]}^{γ} \times {[\frac{\underset{(x, y) &Element; Ω}{Σ} Q_{image} (x, y) \times DM (x, y)}{\underset{(x, y) &Element; Ω}{Σ} DM (x, y)}]}^{β},$ 其中，Ω表示像素域范围，SM(x,y)＝max(SM_org(x,y),SM_dis(x,y))，max()为取最大值函数，γ和β为权重系数，在本实施例中，取γ＝1.601，β＝0.501。⑥ According to {SM _org (x,y)} and {SM _dis (x,y)} and {DM(x,y)}, the objective evaluation of each pixel in {CM _dis (x,y)} The measured values are fused to obtain the predicted value of the objective evaluation of S _dis image quality, denoted as Q, $Q = {[\frac{\underset{(x, the y) &Element; Ω}{Σ} Q_{image} (x, the y) \times SM (x, the y)}{\underset{(x, the y) &Element; Ω}{Σ} SM (x, the y)}]}^{γ} \times {[\frac{\underset{(x, the y) &Element; Ω}{Σ} Q_{image} (x, the y) \times DM (x, the y)}{\underset{(x, the y) &Element; Ω}{Σ} DM (x, the y)}]}^{β},$ Among them, Ω represents the range of the pixel domain, SM(x, y) = max(SM _org (x, y), SM _dis (x, y)), max() is the maximum value function, γ and β are weight coefficients, In this embodiment, γ=1.601 and β=0.501 are taken.

⑦采用n幅原始的无失真的立体图像，建立其在不同失真类型不同失真程度下的失真立体图像集合，该失真立体图像集合包括多幅失真的立体图像，利用主观质量评价方法分别获取失真立体图像集合中每幅失真的立体图像的平均主观评分差值，记为DMOS，DMOS＝100-MOS，其中，MOS表示主观评分均值，DMOS∈[0,100]，n≥1。⑦Using n original undistorted stereoscopic images, establish a set of distorted stereoscopic images under different distortion types and different degrees of distortion. The average subjective score difference of each distorted stereo image in the image set is recorded as DMOS, DMOS=100-MOS, where MOS represents the mean subjective score, DMOS∈[0,100], n≥1.

在本实施例中，利用如图2a和图2b构成的立体图像、图3a和图3b构成的立体图像、图4a和图4b构成的立体图像、图5a和图5b构成的立体图像、图6a和图6b构成的立体图像、图7a和图7b构成的立体图像、图8a和图8b构成的立体图像、图9a和图9b构成的立体图像、图10a和图10b构成的立体图像、图11a和图11b构成的立体图像、图12a和图12b构成的立体图像、图13a和图13b构成的立体图像共12幅(n＝12)无失真的立体图像建立了其在不同失真类型不同失真程度下的失真立体图像集合，该失真立体图像集合共包括4种失真类型的252幅失真的立体图像，其中JPEG压缩的失真的立体图像共60幅，JPEG2000压缩的失真的立体图像共60幅，高斯模糊(Gaussian Blur)的失真的立体图像共60幅，H.264编码的失真的立体图像共72幅。In this embodiment, the stereoscopic image composed of Fig. 2a and Fig. 2b, the stereoscopic image composed of Fig. 3a and Fig. 3b, the stereoscopic image composed of Fig. 4a and Fig. 4b, the stereoscopic image composed of Fig. 5a and Fig. Stereoscopic image composed of Fig. 6b, stereoscopic image composed of Fig. 7a and Fig. 7b, stereoscopic image composed of Fig. 8a and Fig. 8b, stereoscopic image composed of Fig. 9a and Fig. 9b, stereoscopic image composed of Fig. A total of 12 (n=12) undistorted stereoscopic images of the stereoscopic image constituted by Fig. 11b, the stereoscopic image constituted by Fig. 12a and Fig. 12b, and the stereoscopic image constituted by Fig. 13a and Fig. 13b have been established in different distortion types and different degrees of distortion. The following distorted stereo image collection, the distorted stereo image collection includes 252 distorted stereo images of 4 types of distortion, including 60 distorted stereo images compressed by JPEG, 60 distorted stereo images compressed by JPEG2000, Gaussian There are 60 distorted stereo images with Gaussian Blur, and 72 distorted stereo images with H.264 encoding.

采用图2a至图13b所示的12幅无失真的立体图像在不同程度的JPEG压缩、JPEG2000压缩、高斯模糊和H.264编码失真情况下的252幅失真的立体图像来分析本实施例得到的失真的立体图像的图像质量客观评价预测值与平均主观评分差值之间的相关性。这里，利用评估图像质量评价方法的4个常用客观参量作为评价指标，即非线性回归条件下的Pearson相关系数(Pearson linear correlation coefficient，PLCC)、Spearman相关系数(Spearman rank order correlation coefficient，SROCC)，Kendall相关系数(Kendall rank-order correlation coefficient，KROCC)，均方误差(root mean squared error，RMSE)，PLCC和RMSE反映失真的立体图像评价客观模型的准确性，SROCC和KROCC反映其单调性。将按本发明方法计算得到的失真的立体图像的图像质量客观评价预测值做五参数Logistic函数非线性拟合，PLCC、SROCC和KROCC值越高，RMSE值越低说明客观评价方法与平均主观评分差值相关性越好。将分别采用本发明方法与不采用本发明方法得到失真的立体图像的图像质量客观评价预测值与主观评分之间的Pearson相关系数、Spearman相关系数、Kendall相关系数和均方误差进行比较，比较结果如表1、表2、表3和表4所示，从表1、表2、表3和表4中可以看出，采用本发明方法得到的失真的立体图像的最终的图像质量客观评价预测值与平均主观评分差值之间的相关性是很高的，表明客观评价结果与人眼主观感知的结果较为一致，足以说明本发明方法的有效性。Using 12 undistorted stereoscopic images shown in Figures 2a to 13b, 252 distorted stereoscopic images under different degrees of JPEG compression, JPEG2000 compression, Gaussian blur and H.264 encoding distortion are used to analyze the results obtained in this embodiment. Correlation between image quality objective rating predictors and mean subjective rating difference for distorted stereoscopic images. Here, four commonly used objective parameters for evaluating image quality evaluation methods are used as evaluation indicators, namely Pearson correlation coefficient (Pearson linear correlation coefficient, PLCC) and Spearman correlation coefficient (Spearman rank order correlation coefficient, SROCC) under nonlinear regression conditions, Kendall correlation coefficient (Kendall rank-order correlation coefficient, KROCC), mean square error (root mean squared error, RMSE), PLCC and RMSE reflect the accuracy of the distorted stereoscopic image evaluation objective model, and SROCC and KROCC reflect its monotonicity. The five-parameter Logistic function nonlinear fitting is done on the image quality objective evaluation prediction value of the distorted stereoscopic image calculated by the method of the present invention, the higher the PLCC, SROCC and KROCC values, the lower the RMSE value shows that the objective evaluation method and the average subjective rating The better the difference correlation. The Pearson correlation coefficient, the Spearman correlation coefficient, the Kendall correlation coefficient and the mean square error between the image quality objective evaluation prediction value and the subjective rating of the distorted stereoscopic image obtained by the method of the present invention and the method of the present invention are compared respectively, and the comparison results As shown in Table 1, Table 2, Table 3 and Table 4, as can be seen from Table 1, Table 2, Table 3 and Table 4, the final image quality objective evaluation prediction of the distorted stereoscopic image obtained by the method of the present invention The correlation between the value and the average subjective score difference is very high, indicating that the objective evaluation result is relatively consistent with the subjective perception of the human eye, which is enough to illustrate the effectiveness of the method of the present invention.

图14给出了失真立体图像集合中的各幅失真的立体图像的图像质量客观评价预测值与平均主观评分差值的散点图，散点越集中，说明客观评介结果与主观感知的一致性越好。从图14中可以看出，采用本发明方法得到的散点图比较集中，与主观评价数据之间的吻合度较高。Figure 14 shows the scatter diagram of the difference between the image quality objective evaluation prediction value and the average subjective evaluation value of each distorted stereo image in the distorted stereo image set. The more concentrated the scatter points, the consistency between the objective evaluation results and the subjective perception the better. It can be seen from FIG. 14 that the scatter diagram obtained by the method of the present invention is relatively concentrated, and has a high degree of agreement with the subjective evaluation data.

表1 利用本发明方法与不利用本发明方法得到的失真的立体图像的图像质量客观评价预测值与主观评分之间的Pearson相关系数比较Table 1 Comparison of the Pearson correlation coefficient between the objective evaluation predictive value and the subjective rating of the image quality of the distorted stereoscopic image obtained by the method of the present invention and without using the method of the present invention

表2 利用本发明方法与不利用本发明方法得到的失真的立体图像的图像质量客观评价预测值与主观评分之间的Spearman相关系数比较Table 2 Comparison of the Spearman correlation coefficient between the image quality objective evaluation prediction value and the subjective score of the distorted stereoscopic image obtained by the method of the present invention and without using the method of the present invention

表3 利用本发明方法与不利用本发明方法得到的失真的立体图像的图像质量客观评价预测值与主观评分之间的Kendall相关系数比较Table 3 Comparison of the Kendall correlation coefficient between the objective evaluation predictive value and the subjective rating of the image quality of the distorted stereoscopic image obtained by the method of the present invention and not utilizing the method of the present invention

表4 利用本发明方法与不利用本发明方法得到的失真的立体图像的图像质量客观评价预测值与主观评分之间的均方误差比较Table 4 Comparison of the mean square error between the objective evaluation predictive value and the subjective rating of the image quality of the distorted stereoscopic image obtained by the method of the present invention and without using the method of the present invention

Claims

1. A stereoscopic image quality objective evaluation method based on feature fusion is characterized in that its processing process is: first, according to each pixel in the left viewpoint image and the right viewpoint image of the original undistorted stereoscopic image in different The even symmetric frequency response and odd symmetric frequency response of scale and direction, and the disparity image between the left view point image and the right view point image of the original undistorted stereo image, obtain the cyclopean image of the original undistorted stereo image; according to The even symmetric frequency response and odd symmetric frequency response of each pixel in the left and right view images of the distorted stereo image at different scales and directions, and the left and right images of the original undistorted stereo image The disparity image between the viewpoint images is used to obtain the cyclopean image of the distorted stereoscopic image to be evaluated; secondly, the distorted stereoscopic image to be evaluated is obtained according to the mean and standard deviation of the pixel values of each pixel in the two cyclopean images The objective evaluation metric value of each pixel in the Cyclops image; again, according to the amplitude and phase of the original undistorted stereo image Cyclops image, the corresponding saliency map is obtained; according to the cyclops image of the distorted stereo image to be evaluated The corresponding saliency map is obtained; then, according to the distortion map between the two saliency maps and the two cyclopia maps, the objective evaluation metric value of each pixel in the cyclopia map of the distorted stereo image to be evaluated Perform fusion to obtain the image quality objective evaluation prediction value of the distorted stereoscopic image to be evaluated; finally, obtain the image quality objective evaluation prediction value of multiple distorted stereoscopic images of different distortion types and degrees according to the above processing process.

2. a kind of stereo image quality objective evaluation method based on feature fusion according to claim 1, is characterized in that it specifically comprises the following steps:

①Let S _org be the original undistorted stereo image, let S _dis be the distorted stereo image to be evaluated, record the left viewpoint image of S _org as {L _org (x,y)}, and let the right viewpoint image of S _org The image is recorded as {R _org (x,y)}, the left view image of S _dis is recorded as {L _dis (x,y)}, and the right view image of S _dis is recorded as {R _dis (x,y)} , where (x, y) represents the coordinate position of the pixel in the left viewpoint image and the right viewpoint image, 1≤x≤W, 1≤y≤H, W represents the width of the left viewpoint image and the right viewpoint image, H represents the height of the left view point image and the right view point image, L _org (x, y) represents the pixel value of the pixel whose coordinate position is (x, y) in {L _org (x, y)}, R _org (x, y) y) means the pixel value of the pixel point whose coordinate position is (x, y) in {R _org (x, y)}, and L _dis (x, y) means that the coordinate position in {L _dis (x, y)} is ( The pixel value of the pixel point of x, y), R _dis (x, y) represents the pixel value of the pixel point whose coordinate position is (x, y) in {R _dis (x, y)};

②According to each pixel in {L _org (x,y)}, {R _org (x,y)}, {L _dis (x,y)}, {R _dis (x,y)} at different scales The even symmetric frequency response and odd symmetric frequency response in the and direction, correspondingly get {L _org (x,y)}, {R _org (x,y)}, {L _dis (x,y)}, {R _dis (x ,y)}, and then according to the amplitude of each pixel in {L _org (x,y)} and {R _org (x,y)} and {L _org (x,y) )} and {R _org (x,y)} between the pixel value of each pixel in the disparity image, calculate the Cyclops of S _org , denoted as {CM _org (x,y)}, and according to {L The amplitude of each pixel in _dis (x,y)} and {R _dis (x,y)} and the disparity image between {L _org (x,y)} and {R _org (x,y)} The pixel value of each pixel in , calculate the Cyclops of S _dis , recorded as {CM _dis (x, y)}, where CM _org (x, y) represents the coordinates in {CM _org (x, y)} The pixel value of the pixel point whose position is (x, y), CM _dis (x, y) represents the pixel value of the pixel point whose coordinate position is (x, y) in {CM _dis (x, y)};

③According to the mean and standard deviation of the pixel values of each pixel in {CM _org (x,y)} and {CM _dis (x,y)}, calculate each of {CM _dis (x,y)} The objective evaluation measurement value of the pixel point, the objective evaluation measurement value of the pixel point whose coordinate position is (x, y) in {CM _dis (x, y)} is recorded as Q _image (x, y);

④ According to the amplitude and phase of {CM _org (x,y)}, calculate the saliency map of {CM _org (x,y)}, denoted as {SM _org (x,y)}, and according to {CM _dis (x, y)} amplitude and phase, calculate the saliency map of {CM _dis (x,y)}, denoted as {SM _dis (x,y)}, where, SM _org (x,y) means {SM _org (x, The pixel value of the pixel whose coordinate position is (x, y) in y)}, SM _dis (x, y) means the pixel of the pixel whose coordinate position is (x, y) in {SM _dis (x, y)} value;

⑤ Calculate the distortion map between {CM _org (x,y)} and {CM _dis (x,y)}, record it as {DM(x,y)}, and set the coordinate position in {DM(x,y)} Be that the pixel value of the pixel point of (x, y) is denoted as DM (x, y), DM (x, y)=(CM _org (x, y)-CM _dis (x, y)) ² ;

⑥ According to {SM _org (x,y)} and {SM _dis (x,y)} and {DM(x,y)}, the objective evaluation of each pixel in {CM _dis (x,y)} The measured values are fused to obtain the predicted value of the objective evaluation of S _dis image quality, denoted as Q, Among them, Ω represents the range of the pixel domain, SM(x, y) = max(SM _org (x, y), SM _dis (x, y)), max() is the maximum value function, γ and β are weight coefficients;

⑦Using n original undistorted stereoscopic images, establish a set of distorted stereoscopic images under different distortion types and different degrees of distortion. The average subjective score difference of each distorted stereoscopic image in the image set is recorded as DMOS, DMOS=100-MOS, where MOS represents the mean subjective score, DMOS∈[0,100], n≥1;

8. According to the operation of calculating the image quality objective evaluation prediction value of S _dis according to step ① to step ⑥, respectively calculate the image quality objective evaluation prediction value of each distorted stereoscopic image in the distorted stereoscopic image set.

3. a kind of stereoscopic image quality objective evaluation method based on feature fusion according to claim 2, it is characterized in that described step 2. The specific process is:

②-1. Perform filtering on {L _org (x, y)} to obtain the even symmetric frequency response and odd symmetric frequency response of each pixel in {L _org (x, y)} in different scales and directions, The even symmetric frequency response of the pixel at the coordinate position (x, y) in {L _org (x, y)} in different scales and directions is recorded as e _{α, θ} (x, y), and {L _org (x , y)}, the odd symmetric frequency response of the pixel at the coordinate position (x, y) in different scales and directions is denoted as o _{α, θ} (x, y), where α represents the scale of the filter used for filtering Factor, 1≤α≤4, θ indicates the direction factor of the filter used for filtering, 1≤θ≤4;

②-2. According to the even symmetric frequency response and odd symmetric frequency response of each pixel in {L _org (x,y)} in different scales and directions, calculate each pixel in {L _org (x,y)} The amplitude of the pixel point, the amplitude of the pixel point whose coordinate position is (x, y) in {L _org (x, y)} is recorded as

②-3. Obtain the amplitude of each pixel in {L _org (x,y)} according to step ②-1 to step ②-2, and obtain {R _org (x,y)} in the same way, The amplitude of each pixel in {L _dis (x, y)} and {R _dis (x, y)}, the pixel whose coordinate position is (x, y) in {R _org (x, y)} The amplitude of The amplitude of the pixel at the coordinate position (x, y) in {L _dis (x, y)} is recorded as Record the amplitude of the pixel at the coordinate position (x, y) in {R _dis (x, y)} as

②-4. Calculate the parallax image between {L _org (x, y)} and {R _org (x, y)} by block matching method, denoted as in, express The pixel value of the pixel point whose middle coordinate position is (x, y);

② _- ₅ . According to the amplitude and The pixel value of each pixel in , calculate the Cyclops image of S _org , which is recorded as {CM _org (x, y)}, and the pixel whose coordinate position is (x, y) in {CM _org (x, y)} The pixel value of a point is denoted as CM _org (x,y), in, Indicates that the coordinate position in {R _org (x,y)} is The amplitude of the pixel point, Indicates that the coordinate position in {R _org (x,y)} is The pixel value of the pixel point;

_② -6 _. According to the amplitude and The pixel value of each pixel in , calculate the Cyclops image of S _dis , which is recorded as {CM _dis (x, y)}, and the pixel whose coordinate position is (x, y) in {CM _dis (x, y)} The pixel value of the point is recorded as CM _dis (x,y), in, Indicates that the coordinate position in {R _dis (x,y)} is The amplitude of the pixel point, Indicates that the coordinate position in {R _dis (x,y)} is The pixel value of the pixel.

4. a kind of stereoscopic image quality objective evaluation method based on feature fusion according to claim 3, is characterized in that {L _org (x, y)} is carried out to the filter that filter processing adopts in described step 2.-1 For the log-Garbor filter.

5. according to a kind of stereoscopic image quality objective evaluation method based on feature fusion according to any one of claims 2 to 4, it is characterized in that the specific process of described step 3. is:

③-1. Calculate the mean and standard deviation of the pixel values of each pixel in {CM _org (x, y)} and {CM _dis (x, y)}, and put {CM _org (x, y)} in The mean and standard deviation of the pixel values at the coordinate position (x ₁ ,y ₁ ) are recorded as μ _org (x ₁ ,y ₁ ) and σ _org (x ₁ ,y ₁ ) respectively, and {CM _dis (x ,y)}, the mean and standard deviation of the pixel values of the pixel at the coordinate position (x ₁ ,y ₁ ) are recorded as μ _dis (x ₁ ,y ₁ ) and σ _dis (x ₁ ,y ₁ ), respectively,

Among them, 1≤x ₁ ≤W, 1≤y ₁ ≤H, N(x ₁ , y ₁ ) represents an 8×8 neighborhood window centered on the pixel at the coordinate position (x ₁ , y ₁ ), M Indicates the number of pixels within N(x ₁ ,y ₁ ), CM _org (x ₁ ,y ₁ ) indicates the pixel at the coordinate position (x ₁ ,y ₁ ) in {CM _org (x,y)} The pixel value of , CM _dis (x ₁ , y ₁ ) represents the pixel value of the pixel whose coordinate position is (x ₁ , y ₁ ) in {CM _dis (x, y)};

③-2. According to the mean and standard deviation of the pixel values of each pixel in {CM _org (x, y)} and {CM _dis (x, y)}, calculate {CM _dis (x, y)} The objective evaluation metric value of each pixel in {CM _dis (x,y)}, the objective evaluation metric value of the pixel point whose coordinate position is (x ₁ ,y ₁ ) in {CM dis (x,y)} is recorded as Q _image (x ₁ ,y ₁ ), Among them, C is the control parameter.

6. a kind of stereoscopic image quality objective evaluation method based on feature fusion according to claim 5, is characterized in that the concrete process of described step 4. is:

④-1. Perform discrete Fourier transform on {CM _org (x,y)} to obtain the amplitude and phase of {CM _org (x,y)}, which are recorded as {M _org (u,v)} and {A _org (u,v)}, where u represents the amplitude or phase width of the transform domain, v represents the amplitude or phase height of the transform domain, 1≤u≤W, 1≤v≤H, M _org (u,v) Indicates the amplitude value of the pixel point whose coordinate position is (u,v) in {M _org (u,v)}, and A _org (u,v) indicates that the coordinate position in {A _org (u,v)} is (u, The phase value of the pixel point of v);

④-2. Calculate the amplitude of the high-frequency component of {M _org (u,v)}, record it as {R _org (u,v)}, and set the coordinate position in {R _org (u,v)} as (u, v) The amplitude value of the high-frequency component of the pixel point is recorded as R _org (u,v), R _org (u,v)=log(M _org (u,v))-h _m (u,v)*log (M _org (u,v)), where log() is a logarithmic function based on e, e=2.718281828, "*" is the convolution operation symbol, h _m (u,v) means m×m mean filtering;

④-3. Perform inverse discrete Fourier transform according to {R _org (u, v)} and {A _org (u, v)}, and use the obtained inverse transformed image as the saliency map of {CM _org (x, y)}, Recorded as {SM _org (x, y)}, where, SM _org (x, y) represents the pixel value of the pixel whose coordinate position is (x, y) in {SM _org (x, y)};

④-4. Follow steps ④-1 to ④-3 to obtain the saliency map of {CM _org (x, y)}, and obtain the saliency map of {CM _dis (x, y)} in the same way, denoted as {SM _dis (x, y)}, wherein, SM _dis (x, y) represents the pixel value of the pixel whose coordinate position is (x, y) in {SM _dis (x, y)}.