Disclosure of Invention
The invention aims to solve the technical problem of providing a method for extracting a three-dimensional image saliency map, which accords with the saliency semantic features and has stronger extraction stability and higher extraction accuracy.
The technical scheme adopted by the invention for solving the technical problems is as follows: a method for extracting a three-dimensional image saliency map is characterized by comprising a training stage and a testing stage, wherein the training stage comprises the following specific steps:
① -1, the selected N sets of different stereo images and the right parallax image of each stereo image form a set, which is marked as { Li,Ri,diI is more than or equal to 1 and less than or equal to N, wherein N is more than or equal to 1, and LiRepresents { Li,Ri,diL 1. ltoreq. i. ltoreq.N } left viewpoint image of ith stereoscopic image, RiRepresents { Li,Ri,diI is not less than 1 and not more than N, diRepresents { Li,Ri,diI is not less than 1 and not more than N, and right parallax images of the ith stereo image;
① -2, using superpixel splitting technique to divide { Li,Ri,diI is more than or equal to 1 and less than or equal to N, the right viewpoint image of each stereo image is divided into M non-overlapping areas, R isiThe h-th area in (1) is denoted as SPi,hWherein M is more than or equal to 1, h is more than or equal to 1 and less than or equal to M;
① -3, calculation of { Li,Ri,diI is not less than 1 and not more than N, R is the contrast characteristic vector of each region in the right viewpoint image of each stereoscopic imageiH-th area SP of (1)i,hIs recorded as a contrast feature vector Wherein,has a dimension of 36, here the symbol "[ 2 ]]"is a vector representing a symbol, di,hRepresents SPi,hFirst feature vector u ofi,hThe distance from the first feature vector of the neighboring area,fi,hhas a dimension of 20, fi,hRepresents SPi,hC mean value of the frequency response feature vectors of all the pixels in (1)i,hHas a dimension of 9, ci,hRepresents SPi,hThe mean value of the color feature vectors of all the pixel points in (d)i,hRepresents SPi,hIs measured in the mean value of the parallax amplitude of (c),represents SPi,hColor histograms of R, G and B components of all pixel points in RGB color space and SPi,hThe distances of all pixel points in the adjacent regions of the color histogram of the R component, the G component and the B component of the RGB color space,represents SPi,hThe color histograms of L component, a component and b component of all pixel points in CIELAB color space and SPi,hThe distances of the color histograms of the L component, the a component and the b component of the CIELAB color space of all the pixel points in the neighboring region of (a),represents SPi,hColor histogram and SP of H component of all pixel points in HVS color spacei,hAll pixel points in the neighboring region of (a) are at the distance of the color histogram of the H component of the HVS color space,represents SPi,hColor histogram of S component of all pixel points in HVS color space and SPi,hThe distance of all pixel points in the neighboring region of (a) in the color histogram of the S component of the HVS color space,represents SPi,hLBP feature statistical histogram and SP of all pixel points ini,hThe distance of the statistical histogram of the LBP features of all pixel points in the neighboring region,represents SPi,hThe parallax statistical histogram and SP of all the pixel points ini,hThe distance of the parallax statistical histogram of all the pixel points in the adjacent region, where the adjacent region is RiNeutral SPi,hAn adjacent region;
① -4, calculation of { Li,Ri,diI is not less than 1 and not more than N, R is the general feature vector of each region in the right viewpoint image of each stereoscopic imageiH-th area SP of (1)i,hIs given as Wherein,has a dimension of 33, here the symbol "[ 2 ]]"is a vector representing a symbol and,has a dimension of 20 a and has a high degree of,represents SPi,hThe frequency response feature vector of all the pixel points inThe difference is that the number of the first and second,has a dimension of 9 a and has a high degree of,represents SPi,hThe variance of the color feature vectors of all the pixel points in (a),represents SPi,hOf the parallax amplitude, xi,hHas a dimension of 2, xi,hRepresents SPi,hS of the center pixel pointi,hRepresents SPi,hThe area of (d);
① -5, calculating { Li,Ri,diI is more than or equal to 1 and less than or equal to N, and R is equal to RiH-th area SP of (1)i,hIs recorded as Wherein,has a dimension of 36, here the symbol "[ 2 ]]"is a vector representing a symbol, ei,hRepresents SPi,hFirst feature vector u ofi,hThe distance from the first feature vector of the background region,fi,hhas a dimension of 20, fi,hRepresents SPi,hC mean value of the frequency response feature vectors of all the pixels in (1)i,hHas a dimension of 9, ci,hRepresents SPi,hThe mean of the color feature vectors of all the pixel points in (a),represents SPi,hIs measured in the mean value of the parallax amplitude of (c),represents SPi,hThe color histogram of R component, G component and B component of all pixel points in RGB color space and RiThe distances of all the pixel points in the background region in the color histogram of the R component, G component and B component of the RGB color space,represents SPi,hThe color histograms of L component, a component and b component and R of all pixel points in CIELAB color spaceiThe distances of the color histograms of the L component, the a component and the b component of the CIELAB color space of all the pixel points in the background area in (1),represents SPi,hColor histogram and R of H component of all pixel points in HVS color spaceiThe distance of all the pixel points in the background region in the color histogram of the H component of the HVS color space,represents SPi,hColor histogram of S component of all pixel points in HVS color space and RiThe distance of all pixel points in the background region in the color histogram of the S component of the HVS color space,represents SPi,hLBP feature statistical histogram and R of all pixel points iniThe distance of the statistical histogram of the LBP features of all the pixels in the background region in (1),represents SPi,hThe parallax statistical histogram and R of all the pixel points iniThe distance of the parallax statistical histogram of all the pixel points in the background region is RiThe areas positioned at the leftmost side, the rightmost side, the uppermost side and the lowermost side;
① -6, will { Li,Ri,diI is more than or equal to 1 and less than or equal to N), the contrast characteristic vector, the general characteristic vector and the background prior characteristic vector of each region in the right viewpoint image of each stereo image are arranged in sequence to form { L ≦i,Ri,diI 1 ≦ i ≦ N } for each region in the right view image of each stereoscopic image, R is a feature vector for reflecting visual saliencyiH-th area SP of (1)i,hThe feature vector for reflecting the visual saliency is marked as Xi,h,Wherein, Xi,hHas a dimension of 105, here the symbol "[ 2 ]]"is a vector representation symbol;
① -7, using random forest regression, on { L }i,Ri,diI is not less than 1 and not more than N, training the feature vectors for reflecting the visual saliency of all the regions in the right viewpoint images of all the stereoscopic images, and minimizing the error between the regression function value obtained through training and the average eye movement value,obtaining an optimal random forest regression training model, and recording the optimal random forest regression training model as f (D)inp) Wherein f () is a functional representation form, DinpRepresenting an input vector of a random forest regression training model;
the specific steps of the test stage are as follows:
② -1, for any one test stereo image StestWill StestThe left viewpoint image, the right viewpoint image, and the right parallax image are expressed as Ltest、Rtest、dtest(ii) a Then adopting super pixel segmentation technique to divide RtestDividing into M non-overlapping regions, and dividing RtestThe h-th area in (1) is denoted as SPh'; wherein M is more than or equal to 1, h is more than or equal to 1 and less than or equal to M;
② -2, following the procedure of step ① -3 through step ① -6, R is obtained in the same manner of operationtestFor each region of (a) to reflect visual saliency, RtestH-th area SP of (1)h' the feature vector for reflecting the visual saliency is denoted as Ftest,h(ii) a Then, training a model f (D) according to the optimal random forest regression obtained in the training stageinp) Will Ftest,hObtaining R as an input vector of an optimal random forest regression training modeltestOf each region of (a), RtestH-th area SP of (1)h' the three-dimensional visual saliency value is denoted as S3D,h,S3D,h=f(Ftest,h) (ii) a Then R is puttestThe three-dimensional visual saliency value of each region in the R-image is taken as the saliency value of all pixel points in the corresponding region, thereby obtaining RtestIs marked as { S3D(x, y) }, wherein (x, y) here denotes StestX is more than or equal to 1 and less than or equal to W, y is more than or equal to 1 and less than or equal to H, and W and H correspondingly represent StestWidth and height of (S)3D(x, y) represents S3DAnd the coordinate position in the (x, y) is the pixel value of the pixel point of (x, y).
R in the step ① -3iH-th area SP of (1)i,hContrast characteristic ofVectorThe acquisition process comprises the following steps:
a1, calculating RiH-th area SP of (1)i,hThe mean value of the frequency response characteristic vectors of all the pixel points is recorded as fi,h,fi,hTo (1)The value of each element is equal to RiH-th area SP of (1)i,hIn the frequency response feature vector of all the pixel pointsMean of the frequency response amplitudes of the elements, where fi,hHas a dimension of 20 a and has a high degree of,
a2, calculating RiH-th area SP of (1)i,hThe mean value of the color feature vectors of all the pixel points in (1) is marked as ci,h, Wherein, ci,hHas a dimension of 9, here the symbol "[ 2 ]]"is a vector representing a symbol and,represents RiH-th area SP of (1)i,hThe mean value of the color values of the R components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the G components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the B components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average value of the color values of the L components of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hThe average value of the color values of the a component of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hB component of all pixel points in CIELAB color spaceIs determined by the mean of the color values of (c),represents RiH-th area SP of (1)i,hThe average of the color values of the H component of the HVS color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the V components of all the pixel points in the HVS color space,represents RiH-th area SP of (1)i,hThe mean value of the color values of the S components of all the pixel points in the HVS color space;
a3, calculating RiH-th area SP of (1)i,hIs the mean of the parallax amplitude ofIs equal to diNeutral SPi,hThe mean value of the pixel values of all the pixel points in the corresponding region;
a4, mixing fi,h、ci,hAndarranged in sequence to form RiH-th area SP of (1)i,hFirst feature vector of (1), denoted as ui,h,Wherein u isi,hHas a dimension of 30, here the symbol "[ 2 ]]"is a vector representation symbol;
a5, calculating RiH-th area SP of (1)i,hFirst feature vector u ofi,hThe distance from the first feature vector of the neighboring region, denoted di,h,Wherein d isi,hHas a dimension of 30, p is more than or equal to 1 and less than or equal to M,represents RiH-th area SP of (1)i,hU of adjacent areas of the sequence numberi,pRepresents RiP-th area SP in (1)i,pThe symbol "|" is an absolute value symbol, and P represents RiH-th area SP of (1)i,hIs referred to as RiNeutral SPi,hAn adjacent region;
a6, calculating RiH-th area SP of (1)i,hThe color histograms of R, G and B components of all the pixel points in RGB color space are recorded asCalculation of RiH-th area SP of (1)i,hThe color histograms of the L component, the a component and the b component of all the pixel points in the CIELAB color space are recorded asCalculation of RiH-th area SP of (1)i,hThe color histogram of the H component of all the pixel points in the HVS color space is recorded asCalculation of RiH-th area SP of (1)i,hThe color histogram of S component of all pixel points in HVS color space is recorded asCalculation of RiH-th area SP of (1)i,hThe statistical histogram of the LBP characteristics of all the pixel points in (1) is recorded asCalculation of RiH of (1)Region SPi,hThe parallax statistical histogram of all the pixel points in (1) is recorded asWherein,has dimension of 163,Has dimension of 163,Has a dimension of 16 a and has a high degree of,has a dimension of 16 a and has a high degree of,has a dimension of 256, and has a high thermal conductivity,has a dimension of 16;
a7, calculatingAnd RiH-th area SP of (1)i,hThe distances of the color histograms of the R component, the G component and the B component of the RGB color space of all the pixel points in the adjacent areas are recorded as
ComputingAnd RiH-th area SP of (1)i,hAll pixel points in adjacent areas of (1) are in CIThe distance of the color histogram of the L, a and b components of the ELAB color space is noted
ComputingAnd RiH-th area SP of (1)i,hThe distance of the color histogram of the H component of the HVS color space of all the pixel points in the adjacent region is recorded as
ComputingAnd RiH-th area SP of (1)i,hThe distance of the color histogram of the S component of the HVS color space of all the pixel points in the adjacent region is recorded as
ComputingAnd RiH-th area SP of (1)i,hThe distance of the LBP feature statistical histogram of all the pixel points in the adjacent region is recorded as
ComputingAnd RiH-th area SP of (1)i,hThe distance of the parallax statistical histogram of all the pixel points in the adjacent region is recorded as
Wherein p is more than or equal to 1 and less than or equal to M,represents RiH-th area SP of (1)i,hP represents RiH-th area SP of (1)i,hIs the chi () function of the chi-squared distance,represents RiP-th area SP in (1)i,pThe color histograms of the R, G and B components of all the pixel points in the RGB color space,represents RiP-th area SP in (1)i,pThe color histograms of the L component, the a component and the b component of all the pixel points in the CIELAB color space,represents RiP-th area SP in (1)i,pThe color histogram of the H component of the HVS color space for all the pixel points in (a),represents RiP-th area SP in (1)i,pThe color histogram of the S component of the HVS color space for all the pixel points in (a),represents RiP-th area SP in (1)i,pThe statistical histogram of LBP features of all pixel points in (1),represents RiP-th area SP in (1)i,pThe parallax statistics histogram of all the pixel points in the image;
a8, di,h、Andarranged in sequence to form RiH-th area SP of (1)i,hIs recorded as the contrast feature vector of Wherein,has a dimension of 36, here the symbol "[ 2 ]]"is a vector representing a symbol.
R in the step ① -4iH-th area SP of (1)i,hIs a universal feature vectorThe acquisition process comprises the following steps:
b1, calculating RiH-th area SP of (1)i,hThe variance of the frequency response feature vectors of all the pixel points is recorded asTo (1)The value of each element is equal to RiH-th area SP of (1)i,hIn the frequency response feature vector of all the pixel pointsThe variance of the frequency response amplitude of the individual elements, wherein,has a dimension of 20 a and has a high degree of,
b2, calculating RiH-th area SP of (1)i,hThe variance of the color feature vectors of all the pixel points is recorded as Wherein,has a dimension of 9, here the symbol "[ 2 ]]"is a vector representing a symbol and,represents RiH-th area SP of (1)i,hThe variance of the color values of the R component of the RGB color space for all pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the G component of the RGB color space for all pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the B component of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the L component of the CIELAB color space for all pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the a component of the CIELAB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the b-component in CIELAB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the H component of the HVS color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the V component of the HVS color space for all pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the S component of all the pixel points in the HVS color space;
b3, calculating RiH-th area SP of (1)i,hThe variance of the parallax amplitude of (1), is recorded asIs equal to diNeutral SPi,hVariance of pixel values of all pixel points in the corresponding region;
b4, obtaining RiH-th area SP of (1)i,hOf the central pixelCoordinate position of point, noted as xi,hWherein x isi,hHas a dimension of 2;
b5, calculating RiH-th area SP of (1)i,hArea of (d), denoted as si,h;
b6, willxi,hAnd si,hArranged in sequence to form RiH-th area SP of (1)i,hIs a universal feature vector of Wherein,has a dimension of 33, here the symbol "[ 2 ]]"is a vector representing a symbol.
R in the step ① -5iH-th area SP of (1)i,hBackground prior feature vector ofThe acquisition process comprises the following steps:
c1, calculating RiH-th area SP of (1)i,hThe Chinese herbal medicineThe mean value of the frequency response feature vectors with pixel points is recorded as fi,h,fi,hTo (1)The value of each element is equal to RiH-th area SP of (1)i,hIn the frequency response feature vector of all the pixel pointsMean of the frequency response amplitudes of the elements, where fi,hHas a dimension of 20 a and has a high degree of,
c2, calculating RiH-th area SP of (1)i,hThe mean value of the color feature vectors of all the pixel points in (1) is marked as ci,h, Wherein, ci,hHas a dimension of 9, here the symbol "[ 2 ]]"is a vector representing a symbol and,represents RiH-th area SP of (1)i,hThe mean value of the color values of the R components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the G components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the B components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average value of the color values of the L components of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hThe average value of the color values of the a component of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hThe average value of the color values of the b components of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hThe average of the color values of the H component of the HVS color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the V components of all the pixel points in the HVS color space,represents RiH-th area SP of (1)i,hThe mean value of the color values of the S components of all the pixel points in the HVS color space;
c3, calculating RiH-th area SP of (1)i,hIs the mean of the parallax amplitude ofIs equal to diNeutral SPi,hThe mean value of the pixel values of all the pixel points in the corresponding region;
c4, mixing fi,h、ci,hAndarranged in sequence to form RiH-th area SP of (1)i,hFirst feature vector of (1), denoted as ui,h,Wherein u isi,hHas a dimension of 30, here the symbol "[ 2 ]]"is a vector representation symbol;
c5, calculating RiH-th area SP of (1)i,hFirst feature vector u ofi,hThe distance from the first feature vector of the background region, denoted as ei,h,Wherein e isi,hHas a dimension of 30, q is more than or equal to 1 and less than or equal to M,represents RiAll backgrounds in (1)Set of sequence numbers of regions, ui,qRepresents RiThe q-th region SP in (1)i,qThe symbol "|" is an absolute value symbol, and Q represents RiThe total number of background regions in (1), where the background region is RiThe areas positioned at the leftmost side, the rightmost side, the uppermost side and the lowermost side;
c6, calculating RiH-th area SP of (1)i,hThe color histograms of R, G and B components of all the pixel points in RGB color space are recorded asCalculation of RiH-th area SP of (1)i,hThe color histograms of the L component, the a component and the b component of all the pixel points in the CIELAB color space are recorded asCalculation of RiH-th area SP of (1)i,hThe color histogram of the H component of all the pixel points in the HVS color space is recorded asCalculation of RiH-th area SP of (1)i,hThe color histogram of S component of all pixel points in HVS color space is recorded asCalculation of RiH-th area SP of (1)i,hThe statistical histogram of the LBP characteristics of all the pixel points in (1) is recorded asCalculation of RiH-th area SP of (1)i,hThe parallax statistical histogram of all the pixel points in (1) is recorded asWherein,has dimension of 163,Has dimension of 163,Has a dimension of 16 a and has a high degree of,has a dimension of 16 a and has a high degree of,has a dimension of 256, and has a high thermal conductivity,has a dimension of 16;
c7, calculationAnd RiThe distances of the color histograms of the R, G and B components in the RGB color space of all the pixel points in the background region are recorded as
ComputingAnd RiThe distances of the color histograms of the L component, the a component and the b component of all the pixel points in the background area in the CIELAB color space are recorded as the distances
ComputingAnd RiThe distance of the color histogram of the H component of the HVS color space of all the pixel points in the background area is recorded as
ComputingAnd RiThe distance of the color histogram of the S component of the HVS color space of all the pixel points in the background area is recorded as
ComputingAnd RiThe distance of the LBP feature statistical histogram of all the pixel points in the background area is recorded as
ComputingAnd RiThe distance of the parallax statistical histogram of all the pixel points in the background region is recorded as
Wherein q is more than or equal to 1 and less than or equal to M,represents RiQ represents RiThe total number of background regions, χ () is a chi-squared distance function,represents RiThe q-th region SP in (1)i,qThe color histograms of the R, G and B components of all the pixel points in the RGB color space,represents RiThe q-th region SP in (1)i,qThe color histograms of the L component, the a component and the b component of all the pixel points in the CIELAB color space,represents RiThe q-th region SP in (1)i,qThe color histogram of the H component of the HVS color space for all the pixel points in (a),represents RiThe q-th region SP in (1)i,qThe color histogram of the S component of the HVS color space for all the pixel points in (a),represents RiThe q-th region SP in (1)i,qThe statistical histogram of LBP features of all pixel points in (1),represents RiThe q-th region SP in (1)i,qAll the pixels in (1)A disparity statistical histogram of points;
c8, mixingi,h、 Arranged in sequence to form RiH-th area SP of (1)i,hIs recorded as the background prior feature vector of Wherein,has a dimension of 36, here the symbol "[ 2 ]]"is a vector representing a symbol.
Said RiThe obtaining process of the frequency response characteristic vector of each pixel point comprises the following steps:
1) -1, using a Gabor filter pair RiFiltering to obtain RiEach pixel point in (1) is at a different center frequencyFrequency and amplitude of frequency response at different directional factors, RiThe frequency response amplitude of the pixel point with the middle coordinate position (x, y) under the condition that the center frequency is omega and the direction factor is theta is marked as G (x, y; omega, theta), wherein (x, y) represents { L }i,Ri,diI is more than or equal to 1 and less than or equal to N, x is more than or equal to 1 and less than or equal to W, y is more than or equal to 1 and less than or equal to H, and W and H correspondingly represent { L ≦ N }i,Ri,diI 1 ≦ i ≦ N }, ω represents the center frequency of the Gabor filter, ω ∈ Φωθ represents the directional factor of the Gabor filter, θ ∈ Φθ,ΦωRepresenting the set of all centre frequencies of the Gabor filter, phiθRepresents the set of all directional factors of the Gabor filter;
1) -2, reacting RiThe frequency response amplitude of each pixel point in the R-array is arranged in sequence under different central frequencies and different direction factors to form RiR, the frequency response feature vector of each pixel point iniThe frequency response characteristic vector of the pixel point with the middle coordinate position (x, y) is marked as fi(x, y) wherein fiThe dimension of (x, y) is 20.
Said RiThe obtaining process of the color feature vector of each pixel point comprises the following steps:
2) -1, calculating RiThe color value of each pixel point in different color spaces is RiThe color values of the R component, the G component and the B component of the pixel point with the (x, y) middle coordinate position in the RGB color space are respectively recorded as R (x, y), G (x, y) and B (x, y), and R (x, y) is recordediThe color values of the L component, the a component and the b component of the pixel point with the middle coordinate position (x, y) in the CIELAB color space are respectively recorded as L (x, y), a (x, y) and b (x, y), and R is used for indicating the color values of the L component, the a component and the b component in the CIELAB color space as L (x, y), a (x, y) and b (iThe color values of the H component, the V component and the S component of the pixel point with the (x, y) middle coordinate position in the HVS color space are respectively recorded as H (x, y), V (x, y) and S (x, y), wherein (x, y) here represents { L }i,Ri,diI is more than or equal to 1 and less than or equal to N, x is more than or equal to 1 and less than or equal to W, y is more than or equal to 1 and less than or equal to H, and W and H correspondingly represent { L ≦ N }i,Ri,diThe width and height of the stereoscopic image in the condition that i is more than or equal to 1 and less than or equal to N;
2) -2, reacting RiThe color values of each pixel point in the different color spaces are arranged in sequence to form RiThe color feature vector of each pixel point in (1), RiThe color feature vector of the pixel point with the middle coordinate position (x, y) is marked as ci(x,y),ci(x,y)=[R(x,y),G(x,y),B(x,y),L(x,y),a(x,y),b(x,y),H(x,y),V(x,y),S(x,y)]Wherein c isiThe dimension of (x, y) is 9, where the symbol "[ 2 ],"]"is a vector representing a symbol.
Compared with the prior art, the invention has the advantages that:
1) the method simultaneously considers the contrast characteristic vector, the general characteristic vector and the background prior characteristic vector of each region in the right viewpoint image of the stereo image, and fuses to obtain the characteristic vector for reflecting the visual saliency of each region in the right viewpoint image of the stereo image, so the method has higher extraction accuracy and stronger stability, can better reflect the remarkable change condition of various factors, and conforms to the remarkable semantic characteristics.
2) The method of the invention constructs a random forest regression training model between the feature vector for reflecting the visual saliency and the average eye movement value through training, and then predicts the three-dimensional visual saliency value of each region in the right viewpoint image of the tested stereo image by using the random forest regression training model, thereby obtaining the three-dimensional saliency map of the tested stereo image and effectively improving the prediction accuracy of the visual saliency value.
Detailed Description
The invention is described in further detail below with reference to the accompanying examples.
The overall implementation block diagram of the method for extracting the saliency map of the stereo image provided by the invention is shown in fig. 1, and the method comprises two processes of a training stage and a testing stage, wherein the specific steps of the training stage are as follows:
① -1, the selected N sets of different stereo images and the right parallax image of each stereo image form a set, which is marked as { Li,Ri,diI is not less than 1 and not more than N, where N is not less than 1, and in this embodiment, N is 600, and L isiRepresents { Li,Ri,diL 1. ltoreq. i. ltoreq.N } left viewpoint image of ith stereoscopic image, RiRepresents { Li,Ri,diI is not less than 1 and not more than N, diRepresents { Li,Ri,diAnd i is not less than 1 and not more than N.
In this embodiment, a training stereo image set is constructed using a three-dimensional human eye tracking database (NUS 3D-sales database) provided by singapore national university, which contains 600 pairs of stereo images and corresponding right parallax images, and gives a real eye-diagram for each pair of stereo images.
① -2, using existing superpixel splitting techniquesi,Ri,diI is more than or equal to 1 and less than or equal to N, the right viewpoint image of each stereo image is divided into M non-overlapping areas, R isiThe h-th area in (1) is denoted as SPi,hCan be changed into { Li,Ri,diI is not less than 1 and not more than N), the right viewpoint image of each stereo image is represented as a set of M regions, R is represented as a set of M regionsiThe set of M regions re-represented is denoted as SPi,h}; wherein M is not less than 1, in this embodiment, M is 400, and h is not less than 1 and not more than M.
①-3Calculating { L }i,Ri,diI is not less than 1 and not more than N, R is the contrast characteristic vector of each region in the right viewpoint image of each stereoscopic imageiH-th area SP of (1)i,hIs recorded as a contrast feature vector Wherein,has a dimension of 36, here the symbol "[ 2 ]]"is a vector representing a symbol, di,hRepresents SPi,hFirst feature vector u ofi,hThe distance from the first feature vector of the neighboring area,fi,hhas a dimension of 20, fi,hRepresents SPi,hC mean value of the frequency response feature vectors of all the pixels in (1)i,hHas a dimension of 9, ci,hRepresents SPi,hThe mean of the color feature vectors of all the pixel points in (a),represents SPi,hIs measured in the mean value of the parallax amplitude of (c),represents SPi,hColor histograms of R, G and B components of all pixel points in RGB color space and SPi,hThe distances of all pixel points in the adjacent regions of the color histogram of the R component, the G component and the B component of the RGB color space,represents SPi,hThe color histograms of L component, a component and b component of all pixel points in CIELAB color space and SPi,hThe distances of the color histograms of the L component, the a component and the b component of the CIELAB color space of all the pixel points in the neighboring region of (a),represents SPi,hColor histogram and SP of H component of all pixel points in HVS color spacei,hAll pixel points in the neighboring region of (a) are at the distance of the color histogram of the H component of the HVS color space,represents SPi,hColor histogram of S component of all pixel points in HVS color space and SPi,hThe distance of all pixel points in the neighboring region of (a) in the color histogram of the S component of the HVS color space,represents SPi,hLBP feature statistical histogram and SP of all pixel points ini,hThe distance of the statistical histogram of the LBP features of all pixel points in the neighboring region,represents SPi,hThe parallax statistical histogram and SP of all the pixel points ini,hIn adjacent areas ofThe distance of the parallax statistical histogram of all the pixel points, wherein the adjacent region is RiNeutral SPi,hAdjacent regions.
In this embodiment, R in step ① -3iH-th area SP of (1)i,hContrast feature vector ofThe acquisition process comprises the following steps:
a1, calculating RiH-th area SP of (1)i,hThe mean value of the frequency response characteristic vectors of all the pixel points is recorded as fi,h,fi,hTo (1)The value of each element is equal to RiH-th area SP of (1)i,hIn the frequency response feature vector of all the pixel pointsMean of the frequency response amplitudes of the elements, where fi,hHas a dimension of 20 a and has a high degree of,
a2, calculating RiH-th area SP of (1)i,hThe mean value of the color feature vectors of all the pixel points in (1) is marked as ci,h, Wherein, ci,hHas a dimension of 9, here the symbol "[ 2 ]]"is a vector representing a symbol and,represents RiH-th area SP of (1)i,hThe mean value of the color values of the R components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the G components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the B components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average value of the color values of the L components of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hAll ofThe average value of the color values of the a component of the pixel point in the CIELAB color space,represents RiH-th area SP of (1)i,hThe average value of the color values of the b components of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hThe average of the color values of the H component of the HVS color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the V components of all the pixel points in the HVS color space,represents RiH-th area SP of (1)i,hThe mean value of the color values of the S component of all the pixel points in the HVS color space.
a3, calculating RiH-th area SP of (1)i,hIs the mean of the parallax amplitude ofIs equal to diNeutral SPi,hAnd the mean value of the pixel values of all the pixel points in the corresponding region.
a4, mixing fi,h、ci,hAndarranged in sequence to form RiH-th area SP of (1)i,hFirst feature vector of (1), denoted as ui,h,Wherein u isi,hHas a dimension of 30, here the symbol "[ 2 ]]"is a vector representing a symbol.
a5, calculating RiH-th area SP of (1)i,hFirst feature vector u ofi,hThe distance from the first feature vector of the neighboring region, denoted di,h,Wherein d isi,hHas a dimension of 30, p is more than or equal to 1 and less than or equal to M,represents RiH-th area SP of (1)i,hU of adjacent areas of the sequence numberi,pRepresents RiP-th area SP in (1)i,p(SPi,pIs SPi,hAdjacent region of (b) with the symbol "|" being an absolute value symbol, P represents RiH-th area SP of (1)i,hIn this embodiment, P is 20, and the adjacent area is RiNeutral SPi,hAdjacent regions.
a6, calculating RiH-th area SP of (1)i,hThe color histograms of R, G and B components of all the pixel points in RGB color space are recorded asCalculation of RiH-th area SP of (1)i,hThe color histograms of the L component, the a component and the b component of all the pixel points in the CIELAB color space are recorded asCalculation of RiH-th area SP of (1)i,hThe color histogram of the H component of all the pixel points in the HVS color space is recorded asCalculation of RiH-th area SP of (1)i,hThe color histogram of S component of all pixel points in HVS color space is recorded asCalculation of RiH-th area SP of (1)i,hThe statistical histogram of the LBP characteristics of all the pixel points in (1) is recorded asCalculation of RiH-th area SP of (1)i,hThe parallax statistical histogram of all the pixel points in (1) is recorded asWherein,has dimension of 163,Has dimension of 163,Has a dimension of 16 a and has a high degree of,has a dimension of 16 a and has a high degree of,has a dimension of 256, and has a high thermal conductivity,has dimension of 16.
a7, calculatingAnd RiH-th area SP of (1)i,hThe distances of the color histograms of the R component, the G component and the B component of the RGB color space of all the pixel points in the adjacent areas are recorded as
ComputingAnd RiH-th area SP of (1)i,hThe distances of the color histograms of the L component, the a component and the b component of all the pixel points in the adjacent areas in the CIELAB color space are recorded as
ComputingAnd RiH-th area SP of (1)i,hThe distance of the color histogram of the H component of the HVS color space of all the pixel points in the adjacent region is recorded as
ComputingAnd RiH-th area SP of (1)i,hThe distance of the color histogram of the S component of the HVS color space of all the pixel points in the adjacent region is recorded as
ComputingAnd RiH-th area SP of (1)i,hThe distance of the LBP feature statistical histogram of all the pixel points in the adjacent region is recorded as
ComputingAnd RiH-th area SP of (1)i,hThe distance of the parallax statistical histogram of all the pixel points in the adjacent region is recorded as
Wherein p is more than or equal to 1 and less than or equal to M,represents RiH-th area SP of (1)i,hP represents RiH-th area SP of (1)i,hP is 20, χ () is a Chi-distance measure function,represents RiP-th area SP in (1)i,pThe color histograms of the R, G and B components of all the pixel points in the RGB color space,represents RiP-th area SP in (1)i,pL of all pixel points in CIELAB color spaceColor histograms of the quantity, a-component and b-component,represents RiP-th area SP in (1)i,pThe color histogram of the H component of the HVS color space for all the pixel points in (a),represents RiP-th area SP in (1)i,pThe color histogram of the S component of the HVS color space for all the pixel points in (a),represents RiP-th area SP in (1)i,pThe statistical histogram of LBP features of all pixel points in (1),represents RiP-th area SP in (1)i,pThe disparity statistical histogram of all the pixel points in (1).
a8, di,h、Andarranged in sequence to form RiH-th area SP of (1)i,hIs recorded as the contrast feature vector of Wherein,has a dimension of 36, here the symbol "[ 2 ]]"is a vector representing a symbol.
① -4, calculation of { Li,Ri,diI is not less than 1 and not more than N, R is the general feature vector of each region in the right viewpoint image of each stereoscopic imageiH-th area SP of (1)i,hIs given as Wherein,has a dimension of 33, here the symbol "[ 2 ]]"is a vector representing a symbol and,has a dimension of 20 a and has a high degree of,represents SPi,hThe variance of the frequency response feature vectors of all the pixel points in (a),has a dimension of 9 a and has a high degree of,represents SPi,hThe variance of the color feature vectors of all the pixel points in (a),represents SPi,hOf the parallax amplitude, xi,hHas a dimension of 2, xi,hRepresents SPi,hS of the center pixel pointi,hRepresents SPi,hThe area of (a).
In this embodiment, R in step ① -4iH-th area SP of (1)i,hIs a universal feature vectorThe acquisition process comprises the following steps:
b1, calculating RiH-th area SP of (1)i,hThe variance of the frequency response feature vectors of all the pixel points is recorded asTo (1)The value of each element is equal to RiH-th area SP of (1)i,hIn the frequency response feature vector of all the pixel pointsThe variance of the frequency response amplitude of the individual elements, wherein,has a dimension of 20 a and has a high degree of,
b2, calculating RiH-th area SP of (1)i,hThe variance of the color feature vectors of all the pixel points is recorded as Wherein,has a dimension of 9, here the symbol "[ 2 ]]"is a vector representing a symbol and,represents RiH-th area SP of (1)i,hThe variance of the color values of the R component of the RGB color space for all pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the G component of the RGB color space for all pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the B component of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the L component of the CIELAB color space for all pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the a component of the CIELAB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the b-component in CIELAB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the H component of the HVS color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the V component of the HVS color space for all pixel points in (a),represents RiH-th area SP of (1)i,hThe variance of the color values of the S component of the HVS color space for all the pixel points in (1).
b3, calculating RiH-th area SP of (1)i,hThe variance of the parallax amplitude of (1), is recorded asIs equal to diNeutralization ofSPi,hThe variance of the pixel values of all the pixel points in the corresponding region.
b4, obtaining RiH-th area SP of (1)i,hThe coordinate position of the central pixel point is marked as xi,hWherein x isi,hHas a dimension of 2.
b5, calculating RiH-th area SP of (1)i,hArea of (d), denoted as si,h。
b6, willxi,hAnd si,hArranged in sequence to form RiH-th area SP of (1)i,hIs a universal feature vector of Wherein,has a dimension of 33, here the symbol "[ 2 ]]"is a vector representing a symbol.
① -5, calculating { Li,Ri,diI is more than or equal to 1 and less than or equal to N, and R is equal to RiH-th area SP of (1)i,hIs recorded as Wherein,has a dimension of 36, here the symbol "[ 2 ]]"is a vector representing a symbol, ei,hRepresents SPi,hFirst feature vector u ofi,hThe distance from the first feature vector of the background region,fi,hhas a dimension of 20, fi,hRepresents SPi,hC mean value of the frequency response feature vectors of all the pixels in (1)i,hHas a dimension of 9, ci,hRepresents SPi,hThe mean of the color feature vectors of all the pixel points in (a),represents SPi,hIs measured in the mean value of the parallax amplitude of (c),represents SPi,hThe color histogram of R component, G component and B component of all pixel points in RGB color space and RiThe distances of all the pixel points in the background region in the color histogram of the R component, G component and B component of the RGB color space,represents SPi,hThe color histograms of L component, a component and b component and R of all pixel points in CIELAB color spaceiThe distances of the color histograms of the L component, the a component and the b component of the CIELAB color space of all the pixel points in the background area in (1),represents SPi,hColor histogram and R of H component of all pixel points in HVS color spaceiThe distance of all the pixel points in the background region in the color histogram of the H component of the HVS color space,represents SPi,hColor histogram of S component of all pixel points in HVS color space and RiThe distance of all pixel points in the background region in the color histogram of the S component of the HVS color space,represents SPi,hLBP feature statistical histogram and R of all pixel points iniThe distance of the statistical histogram of the LBP features of all the pixels in the background region in (1),represents SPi,hThe parallax statistical histogram and R of all the pixel points iniThe distance of the parallax statistical histogram of all the pixel points in the background region is RiThe leftmost, rightmost, uppermost and lowermost regions of the group.
In this embodiment, R in step ① -5iH-th area SP of (1)i,hBackground prior feature vector ofThe acquisition process comprises the following steps:
c1, calculating RiH-th area SP of (1)i,hThe mean value of the frequency response characteristic vectors of all the pixel points is recorded as fi,h,fi,hTo (1)The value of each element is equal to RiH-th area SP of (1)i,hIn the frequency response feature vector of all the pixel pointsMean of the frequency response amplitudes of the elements, where fi,hHas a dimension of 20 a and has a high degree of,
c2, calculating RiH-th area SP of (1)i,hThe mean value of the color feature vectors of all the pixel points in (1) is marked as ci,h, Wherein, ci,hHas a dimension of 9, here the symbol "[ 2 ]]"is a vector representing a symbol and,represents RiH-th area SP of (1)i,hThe mean value of the color values of the R components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the G components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the B components of the RGB color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average value of the color values of the L components of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hThe average value of the color values of the a component of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hThe average value of the color values of the b components of all the pixel points in the CIELAB color space,represents RiH-th area SP of (1)i,hThe average of the color values of the H component of the HVS color space for all the pixel points in (a),represents RiH-th area SP of (1)i,hThe average of the color values of the V components of all the pixel points in the HVS color space,represents RiH-th area SP of (1)i,hThe mean value of the color values of the S component of all the pixel points in the HVS color space.
c3, calculating RiH-th area SP of (1)i,hIs the mean of the parallax amplitude ofIs equal to diNeutral SPi,hAnd the mean value of the pixel values of all the pixel points in the corresponding region.
c4, mixing fi,h、ci,hAndarranged in sequence to form RiH-th area SP of (1)i,hFirst feature vector of (1), denoted as ui,h,Wherein u isi,hHas a dimension of 30, here the symbol "[ 2 ]]"is a vector representation symbol;
c5, calculating RiH-th area SP of (1)i,hFirst feature of (1)Eigenvector ui,hThe distance from the first feature vector of the background region, denoted as ei,h,Wherein e isi,hHas a dimension of 30, q is more than or equal to 1 and less than or equal to M,represents RiSet of sequence numbers of all background regions in ui,qRepresents RiThe q-th region SP in (1)i,q(SPi,qIs RiBackground region in), the symbol "|" is an absolute value symbol, Q represents RiThe total number of background regions in (1), where the background region is RiThe areas positioned at the leftmost side, the rightmost side, the uppermost side and the lowermost side in the middle are about to fall on RiThe leftmost, rightmost, uppermost and lowermost regions of (a) are taken as background regions.
c6, calculating RiH-th area SP of (1)i,hThe color histograms of R, G and B components of all the pixel points in RGB color space are recorded asCalculation of RiH-th area SP of (1)i,hThe color histograms of the L component, the a component and the b component of all the pixel points in the CIELAB color space are recorded asCalculation of RiH-th area SP of (1)i,hThe color histogram of the H component of all the pixel points in the HVS color space is recorded asCalculation of RiH-th area SP of (1)i,hThe color histogram of S component of all pixel points in HVS color space is recorded asCalculation of RiH-th area SP of (1)i,hThe statistical histogram of the LBP characteristics of all the pixel points in (1) is recorded asCalculation of RiH-th area SP of (1)i,hThe parallax statistical histogram of all the pixel points in (1) is recorded asWherein,has dimension of 163,Has dimension of 163,Has a dimension of 16 a and has a high degree of,has a dimension of 16 a and has a high degree of,has a dimension of 256, and has a high thermal conductivity,has dimension of 16.
c7, calculationAnd RiThe distances of the color histograms of the R, G and B components in the RGB color space of all the pixel points in the background region are recorded as
ComputingAnd RiThe distances of the color histograms of the L component, the a component and the b component of all the pixel points in the background area in the CIELAB color space are recorded as the distances
ComputingAnd RiThe distance of the color histogram of the H component of the HVS color space of all the pixel points in the background area is recorded as
ComputingAnd RiThe distance of the color histogram of the S component of the HVS color space of all the pixel points in the background area is recorded as
ComputingAnd RiThe distance of the LBP feature statistical histogram of all the pixel points in the background area is recorded as
ComputingAnd RiThe distance of the parallax statistical histogram of all the pixel points in the background region is recorded as
Wherein q is more than or equal to 1 and less than or equal to M,represents RiQ represents Riχ () is a Chi-distance measure function,represents RiThe q-th region SP in (1)i,qThe color histograms of the R, G and B components of all the pixel points in the RGB color space,represents RiThe q-th region SP in (1)i,qThe color histograms of the L component, the a component and the b component of all the pixel points in the CIELAB color space,represents RiThe q-th region SP in (1)i,qThe color histogram of the H component of the HVS color space for all the pixel points in (a),represents RiThe q-th region SP in (1)i,qThe color histogram of the S component of the HVS color space for all the pixel points in (a),represents RiThe q-th region SP in (1)i,qThe statistical histogram of LBP features of all pixel points in (1),represents RiThe q-th region SP in (1)i,qThe disparity statistical histogram of all the pixel points in (1).
c8, mixingi,h、Andarranged in sequence to form RiH-th area SP of (1)i,hIs recorded as the background prior feature vector of Wherein,has a dimension of 36, here the symbol "[ 2 ]]"is a vector representing a symbol.
① -6, will { Li,Ri,diI is more than or equal to 1 and less than or equal to N), the contrast characteristic vector, the general characteristic vector and the background prior characteristic vector of each region in the right viewpoint image of each stereo image are arranged in sequence to form { L ≦i,Ri,diI 1 ≦ i ≦ N } for each region in the right view image of each stereoscopic image, R is a feature vector for reflecting visual saliencyiH-th area SP of (1)i,hThe feature vector for reflecting the visual saliency is marked as Xi,h,Wherein, Xi,hHas a dimension of 105, here the symbol "[ 2 ]]"is a vector representing a symbol.
① -7, using existing random forest regression, on { L }i,Ri,diI is more than or equal to 1 and less than or equal to N), training the feature vectors for reflecting the visual saliency in all the regions of the right viewpoint images of all the stereo images, minimizing the error between the regression function value obtained through training and the average eye movement value, obtaining the optimal random forest regression training model, and marking the optimal random forest regression training model as f (D)inp) Wherein f () is a functional representation form, DinpAnd representing the input vector of the random forest regression training model.
The specific steps of the test phase are as follows:
② -1, for any one test stereo image StestWill StestThe left viewpoint image, the right viewpoint image, and the right parallax image are expressed as Ltest、Rtest、dtest(ii) a Then adopting the existing super-pixel segmentation technology to divide RtestIs divided intoM non-overlapping regions, RtestThe h-th area in (1) is denoted as SPh', R may betestRe-expressed as a set of M regions, denoted as { SPh' }; wherein M is not less than 1, in this embodiment, M is 400, and h is not less than 1 and not more than M.
② -2, following the procedure of step ① -3 through step ① -6, R is obtained in the same manner of operationtestFor each region of (a) to reflect visual saliency, RtestH-th area SP of (1)h' the feature vector for reflecting the visual saliency is denoted as Ftest,h,Ftest,hThe acquisition process comprises the following steps: calculation of RtestH-th area SP of (1)h' the contrast feature vector is denoted Wherein,has a dimension of 36, dh' means SPh' first feature vector uh' distance from the first feature vector of the neighboring area,fhdimension of' 20, fh' means SPh' average of frequency response feature vectors, chDimension of' 9, ch' means SPhThe mean value of the color feature vector of',represents SPhThe average value of the magnitude of the parallax of',represents SPh' color histograms of R, G, and B components of all pixels in RGB color space and SPh' distances of color histograms of R, G and B components of RGB color space for all pixel points in neighboring regions of,represents SPh' color histograms of L, a and b components of all pixels in CIELAB color space and SPh' distances of color histograms of L component, a component and b component of CIELAB color space for all pixel points in neighboring regions,represents SPh' color histogram of H component of HVS color space and SP of all pixel points inh'the distance of all pixel points in the neighborhood of' in the color histogram of the H component of the HVS color space,represents SPh' the color histogram of S component of all pixel points in HVS color space and SPh'distance of all pixel points in the neighborhood of' in the color histogram of the S component of the HVS color space,represents SPhLBP of all the pixels in `Feature statistical histogram and SPh' the distance of the statistical histogram of the LBP features of all pixel points in the neighborhood of,represents SPh' the statistical histogram of parallax and SP of all the pixels inh' the distance between the disparity statistical histogram of all the pixels in the neighboring region of SPh' adjacent regions; calculation of RtestH-th area SP of (1)h' general feature vector is denoted as Wherein,has a dimension of 33 a and has a high degree of,has a dimension of 20 a and has a high degree of,represents SPhThe variance of the frequency response feature vector of',has a dimension of 9 a and has a high degree of,represents SPhThe variance of the color feature vector of',represents SPhVariance of parallax amplitude, x of `hThe dimension of' is 2, xh' means SPh' coordinate position of center pixel, sh' means SPhThe area of `; calculation of RtestH-th area SP of (1)h' the background prior feature vector is denoted as Wherein,has a dimension of 36, eh' means SPh' first feature vector uh' distance from the first feature vector of the background region,represents SPhAll the pixels inDistances of color histograms of R, G, and B components of the RGB color space from color histograms of R, G, and B components of the RGB color space of all pixel points in the background region,represents SPh' distances of the color histograms of the L component, the a component and the b component of all the pixel points in the CIELAB color space from the color histograms of the L component, the a component and the b component of all the pixel points in the background area in the CIELAB color space,represents SPh' distance between the color histogram of the H component of the HVS color space of all the pixel points in the background region and the color histogram of the H component of the HVS color space of all the pixel points in the background region,represents SPh' distance between the color histogram of S component of HVS color space of all pixel points in the background region and the color histogram of S component of HVS color space of all pixel points in the background region,represents SPh' distance between the statistical histogram of LBP features of all pixels in the background region and the statistical histogram of LBP features of all pixels in the background region,represents SPhThe distance between the parallax statistical histograms of all the pixel points in the background region and the parallax statistical histograms of all the pixel points in the background region, where the background region refers to the regions located at the leftmost side, the rightmost side, the uppermost side, and the lowermost side; r is to betestH-th area SP of (1)h' the contrast feature vector, the generic feature vector and the background prior feature vector are arranged in order to form RtestH-th area SP of (1)h' the feature vector for reflecting visual saliency,is marked as Ftest,h,Wherein, Ftest,hHas a dimension of 105; then, training a model f (D) according to the optimal random forest regression obtained in the training stageinp) Will Ftest,hObtaining R as an input vector of an optimal random forest regression training modeltestOf each region of (a), RtestH-th area SP of (1)h' the three-dimensional visual saliency value is denoted as S3D,h,S3D,h=f(Ftest,h) (ii) a Then R is puttestAs the saliency value of all pixel points in the corresponding region, e.g. for RtestH-th area SP of (1)h', will S3D,hAs SPhSignificant values of all pixels in `, thus obtaining RtestIs marked as { S3D(x, y) }, wherein (x, y) here denotes StestX is more than or equal to 1 and less than or equal to W, y is more than or equal to 1 and less than or equal to H, and W and H correspondingly represent StestWidth and height of (S)testIs the same as the width of the stereo image selected in the training stage, StestIs the same as the height of the stereo image selected in the training stage, S3D(x, y) represents S3DAnd the coordinate position in the (x, y) is the pixel value of the pixel point of (x, y).
In this embodiment, RiThe obtaining process of the frequency response characteristic vector of each pixel point comprises the following steps:
1) -1, using a Gabor filter pair RiFiltering to obtain RiUnder different central frequencies and different direction factors, each pixel point in the signal is subjected to frequency response amplitude RiThe frequency response amplitude of the pixel point with the middle coordinate position (x, y) under the condition that the center frequency is omega and the direction factor is theta is marked as G (x, y; omega, theta), wherein (x, y) represents { L }i,Ri,diI is more than or equal to 1 and less than or equal to N, x is more than or equal to 1 and less than or equal to W, y is more than or equal to 1 and less than or equal to H, and W and H correspondingly represent { L ≦ N }i,Ri,diI 1 ≦ i ≦ N }, ω represents the center frequency of the Gabor filter, ω ∈ Φωθ represents the directional factor of the Gabor filter, θ ∈ Φθ,ΦωRepresents the set of all center frequencies of the Gabor filter, phi in this embodimentω={1.74,2.47,3.49,4.93,6.98},ΦθRepresents the set of all the directional factors of the Gabor filter, phi in this embodimentθ={0°,90°,180°,270°}。
1) -2, reacting RiThe frequency response amplitude of each pixel point in the R-array is arranged in sequence under different central frequencies and different direction factors to form RiR, the frequency response feature vector of each pixel point iniThe frequency response characteristic vector of the pixel point with the middle coordinate position (x, y) is marked as fi(x,y),fi(x, y) is a vector consisting of G (x, y; 1.74,0 °), G (x, y; 2.47,0 °), G (x, y; 3.49,0 °), G (x, y; 4.93,0 °), G (x, y; 6.98,0 °), G (x, y; 1.74,90 °), G (x, y; 2.47,90 °), G (x, y; 3.49,90 °), G (x, y; 4.93,90 °), G (x, y; 6.98,90 °), G (x, y; 1.74,180 °), G (x, y; 2.47,180 °), G (x, y; 3.49,180 °), G (x, y; 4.93,180 °), G (x, y; 6.98,180 °), G (x, y °, 1.74,270 °), G (x, y; 2.47,270, G (x, y; 3.49,270; 4.93,270, y; 6.98,270, 26 °, 6.98,270:, wherein the vector is arranged in that orderiThe dimension of (x, y) is 20.
In this embodiment, RiThe obtaining process of the color feature vector of each pixel point comprises the following steps:
2) -1, calculating RiThe color value of each pixel point in different color spaces is RiThe color values of the R component, the G component and the B component of the pixel point with the (x, y) middle coordinate position in the RGB color space are respectively recorded as R (x, y), G (x, y) and B (x, y), and R (x, y) is recordediThe color values of the L component, the a component and the b component of the pixel point with the middle coordinate position (x, y) in the CIELAB color space are respectively recorded as L (x, y), a (x, y) and b (x, y), and R is used for indicating the color values of the L component, the a component and the b component in the CIELAB color space as L (x, y), a (x, y) and b (iH component and V component of pixel point with (x, y) as middle coordinate position in HVS color spaceThe color values of the quantity and S component are denoted as H (x, y), V (x, y), and S (x, y), respectively, where (x, y) here denotes { L }i,Ri,diI is more than or equal to 1 and less than or equal to N, x is more than or equal to 1 and less than or equal to W, y is more than or equal to 1 and less than or equal to H, and W and H correspondingly represent { L ≦ N }i,Ri,diI is not less than 1 and not more than N.
2) -2, reacting RiThe color values of each pixel point in the different color spaces are arranged in sequence to form RiThe color feature vector of each pixel point in (1), RiThe color feature vector of the pixel point with the middle coordinate position (x, y) is marked as ci(x,y),ci(x,y)=[R(x,y),G(x,y),B(x,y),L(x,y),a(x,y),b(x,y),H(x,y),V(x,y),S(x,y)]Wherein c isiThe dimension of (x, y) is 9, where the symbol "[ 2 ],"]"is a vector representing a symbol.
The method of the invention is used for extracting three-dimensional saliency maps of five stereo images, namely Image1, Image2, Image3, Image4 and Image5 in a three-dimensional human eye tracking database (3Deye-tracking database) provided by southern French university. FIG. 2a shows a right viewpoint Image of "Image 1", FIG. 2b shows a real eye diagram of a right viewpoint Image of "Image 1", and FIG. 2c shows a three-dimensional saliency map of "Image 1"; FIG. 3a shows a right viewpoint Image of "Image 2", FIG. 3b shows a real eye diagram of a right viewpoint Image of "Image 2", and FIG. 3c shows a three-dimensional saliency map of "Image 2"; FIG. 4a shows a right viewpoint Image of "Image 3", FIG. 4b shows a real eye diagram of a right viewpoint Image of "Image 3", and FIG. 4c shows a three-dimensional saliency map of "Image 3"; FIG. 5a shows a right viewpoint Image of "Image 4", FIG. 5b shows a real eye diagram of a right viewpoint Image of "Image 4", and FIG. 5c shows a three-dimensional saliency map of "Image 4"; FIG. 6a shows a right viewpoint Image of "Image 5", FIG. 6b shows a real eye diagram of a right viewpoint Image of "Image 5", and FIG. 6c shows a three-dimensional saliency map of "Image 5"; FIG. 7a shows a right viewpoint Image of "Image 6", FIG. 7b shows a real eye diagram of a right viewpoint Image of "Image 6", and FIG. 7c shows a three-dimensional saliency map of "Image 6"; fig. 8a shows a right viewpoint Image of "Image 7", fig. 8b shows a real eye diagram of a right viewpoint Image of "Image 7", and fig. 8c shows a three-dimensional saliency map of "Image 7". As can be seen from fig. 2a to 8c, the three-dimensional saliency map obtained by the method of the present invention can well conform to the features of the saliency semantics due to the consideration of the contrast feature, the general feature, and the background prior feature.