CN107657585A - High magnification super-resolution method based on double transform domains - Google Patents

High magnification super-resolution method based on double transform domains Download PDF

Info

Publication number
CN107657585A
CN107657585A CN201710764485.5A CN201710764485A CN107657585A CN 107657585 A CN107657585 A CN 107657585A CN 201710764485 A CN201710764485 A CN 201710764485A CN 107657585 A CN107657585 A CN 107657585A
Authority
CN
China
Prior art keywords
resolution
super
alpha
magnification
block
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710764485.5A
Other languages
Chinese (zh)
Other versions
CN107657585B (en
Inventor
姜中玉
杨敬钰
岳焕景
侯永宏
侯春萍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN201710764485.5A priority Critical patent/CN107657585B/en
Publication of CN107657585A publication Critical patent/CN107657585A/en
Application granted granted Critical
Publication of CN107657585B publication Critical patent/CN107657585B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/40Scaling of whole images or parts thereof, e.g. expanding or contracting
    • G06T3/4053Scaling of whole images or parts thereof, e.g. expanding or contracting based on super-resolution, i.e. the output image resolution being higher than the sensor resolution

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Image Processing (AREA)

Abstract

The invention belongs to computer vision field, to improve the qualitatively and quantitatively result of depth map high magnification super-resolution on the basis of double transform domains.Therefore, the technical solution adopted by the present invention is, the depth map super-resolution method of the multilayer cascade based on double transform domains, step is:One powerful problem is become the cascade problem of multistage small multiplying power, for the small multiplying power super-resolution problem of every one-level, be modeled using formula (1): arg min x F ( α ) = arg min x E D ( x , y ) + λ 1 E T R ( x , z ) + λ 2 E S R ( x , z ) = arg min x | | y - H x | | 2 2 + λ 1 Σ i | | α i - Σ q ∈ Ω i ζ i , q α q | | 1 + λ 2 Σ ( m , l ) ∈ C | | P m , l ( x - S 1 l S 2 m x ) | | 1 P m , l = 1 Θ P m , l Q P m , l C P m , l D P m , l T - - - ( 1 ) Solution formula (1) obtains the result of the super-resolution of final small multiplying power.Present invention is mainly applied to computer vision to handle occasion.

Description

High-magnification super-resolution method based on double transformation domains
Technical Field
The invention belongs to the field of computer vision. The invention makes innovative improvement to the problem that the result of the latest level can be obtained for the super-resolution with small magnification based on the regularization terms of the frequency domain and the spatial domain, but the satisfactory result can not be obtained for the super-resolution with high magnification.
Background
Depth information has been widely used in the fields of virtual reality, augmented reality, and 3D reconstruction. Despite the rapid development of depth cameras in recent years, the resolution of depth maps obtained by current mainstream depth sensors (such as Kinect and TOF) is still much lower than that of color maps. Therefore, the effective depth map super-resolution method has practical significance. At present, a plurality of excellent methods obtain better results on the problem of small-magnification super-resolution of the depth map, but the problem of high-magnification super-resolution is still troublesome.
The super-resolution problem is usually changed into a proper problem by introducing a regular term, and the existing super-resolution methods can be divided into three categories according to the difference of the regular term: a space domain regular term method, a transform domain regular term method and a double transform domain regular term method.
The central idea behind the transform domain class approach is to design a dictionary that can provide efficient representation while being suitable for a particular application. The main advantage of this type of approach is that the extracted depth features can be compactly represented by a small number of basis functions or dictionary atoms. High-order associations between chunks may be efficiently utilized. A disadvantage of this type of approach is the lack of pixel-level adaptability.
The spatial domain method can make up the problem of the lack of adaptability of the transform domain method, and the method mainly utilizes the strong connection of color depth image pairs. Among them, total variation (total variation) reconstruction models are widely used because they can describe sparse gradients well.
The dual transform domain class approach combines the advantages of both the transform domain class and the spatial domain class approaches. Much recent work has shown that the use of the dual transform domain approach is a trend. However, although the dual transform domain method can obtain excellent results on the small-magnification super-resolution problem, the high-magnification reconstruction result is still unsatisfactory. The invention provides a method for improving the super-resolution performance of high magnification by utilizing a multi-level cascade method on the basis of double transform domains.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention aims to improve the quantitative and qualitative results of the high-magnification super-resolution of the depth map on the basis of the double transform domains. Therefore, the technical scheme adopted by the invention is that the double transform domain-based multi-layer cascaded depth map super-resolution method comprises the following steps:
1) Changing a high-magnification problem into a multi-level small-magnification cascade problem, and modeling the small-magnification super-resolution problem of each level by using an equation (1):
wherein, let F (alpha) = E D (x,y)+λ 1 E TR (x,z)+λ 2 E SR (x,z),x∈R n ,y∈R m ,z∈R n Vectorized recovered depth map, vectorized low resolution depth map and vectorized high resolution color map, E, respectively D (x, y) is a data item, E TR (x, z) is a transform domain regularization term, E SR (x, z) is a spatial regularization term, λ 1 And λ 2 Is the weight parameter for balancing each item, min represents the minimum value,represents the value of the variable x when the following equation is minimized; h denotes the synthesis operator of the blur and sample, α i Representing a block x i On-site dictionaryThe coefficients of (a) and (b) are,representing a block x i Dictionary, x, corresponding to the class of block in which it is located i Is an image block with the center located at i in the depth map x; alpha is alpha q Is a block x i Q-th similar block x of (1) i,q Corresponding sparse coefficient, q belongs to a set omega composed of similar blocks i ,ζ i,q Are the respective weighted weights; nucleus P m,l The four contained terms are spatial terms respectivelyColour itemDepth termAnd the structure tensor term Andrepresenting shifting the image x by l and m pixels in the horizontal and vertical directions, respectively, C being a selected set of (m, l), in different (m, l) combinationsThe total variation operator representing different directions, e, represents belonging. I | · | live through 2 Is a 2 Norm, | · | luminance 1 Is a 1 Norm, Σ, represents a summation operator; Θ is the normalization factor; item of structure tensorComprises the following steps:
r m,l (u)=ψ{|cos(v m,l ,v 1 (u))|},(m,l)∈C (2)
where max (-) is a function of the maximum value,is an exponential function with the base of a constant e, ψ {. Is a sorting function in descending order, and returns a sorted numerical result r m,l (u), cos (. Cndot.) is a cosine function, v m,l Is the total variation operatorDirection of (v) 1 (u) the first bit of the structure tensor at pixel uThe eigenvector, | · | represents an absolute value. a, b and c are constants;
and solving the formula (1) to obtain a final small-magnification super-resolution result.
The specific construction method of the super-resolution method of multi-layer cascade, which changes the problem of high magnification into the problem of cascade of multi-level and small magnification, is as follows: constructing an L-layer cascade network, wherein the sampling multiplying power of each layer is 2, namely, each layer only solves the problem of 2 times of up-sampling, and the output of the k-1 layer is set asFirstly, 2 times of upsampling is carried out on the bicubicThis up-sampled depth map is used as input for the k-th layer, i.e.For super-resolution magnifications that cannot be expressed as powers of 2, up-sampling is performed at each layer using fractional magnifications.
The concrete solving steps for the formula (1) are as follows:
31 The equation (1) is transformed using the weighted least squares algorithm IRLS:
whereinW m,l Is a diagonal element ofA diagonal matrix of (a);
matrix ofWhere I is the unit array. Matrix ofWhere 0 is a vector with all zero elements, denotes all alpha i Is connected in series with the dictionaryA multiplication and addition operation of the series phi is specificallyR i Representing the extraction of a block x from a graph x i The linear operator of (a) is determined,representing a block x i Dictionary corresponding to the class of block in which it is located, α i Representing a block x i In a dictionaryA coefficient of M representing the number of blocks extracted from the depth map, (. C) -1 Is an inversion operation, (.) T Is a transposition operation in which the first and second operations are performed,is an open root number operation;
32 Solving the optimization equation:
the main idea of the PG algorithm is to make one function Q (α, ρ) iteratively approach the original problem F (α) near the point ρ, and then minimize the original problem F (α) is replaced by minimize Q (α, ρ), defining functions h (α) and g (α) as follows:
g(α)=∑ i ||α ii || 1 (4)
then the function Q (α, ρ) is defined according to the PG algorithm as follows:
whereinDenotes the gradient of h (-), L f Is a constant greater than the spectral norm of the hessian matrix of h (alpha),<·,·&gt represents an inner product operator, and the coefficient alpha is obtained by the (l + 1) th iteration (l+1) The updating is as follows:
wherein
ρ (l) Is ρ, Φ at the l-th iteration T Denotes the transposition of phi, B T Which represents the transpose of B and,a multiply-add operation after the representation is concatenated is finally obtained
Where soft (-) is a soft threshold function,d 0 1, beta is all beta i A series connection of (l) Is alpha at the time of the l iteration, and updated alpha is obtained (l+1) Then, the first +1 iteration is obtainedUpdated depth map x (l+1)
The invention has the technical characteristics and effects that:
the method is a troublesome problem aiming at the high-magnification super-resolution of the depth camera, and recovers the depth map after high-quality high-magnification super-resolution by introducing a weight item capable of reflecting a space geometric structure and applying a multi-layer cascade structure. The invention has the following characteristics:
1. and a structure tensor is introduced into a space domain regular term, so that the gradient characteristic of rapid change in the depth map is better described.
2. A multi-layer cascade method is provided to solve the problem of high-magnification super-resolution.
3. The sub-problems are solved by using algorithms such as PG, IRLS and the like, and the advantages of the existing algorithms are integrated.
Drawings
Fig. 1 is a structural block diagram corresponding to equation (6), namely a structural block diagram corresponding to handling a small-magnification super-resolution (or per-level super-resolution) problem.
FIG. 2 is the inputs and outputs of the present invention: the input is (a) a low resolution depth map, (b) a high resolution color map, and the output is (c) a high resolution depth map.
Fig. 3 is a schematic diagram of a super-resolution method of multi-layer cascade.
Fig. 4 is a comparison of the results of the respective methods for 16 times super resolution. (first, second row is the correlation results for reinneer, left to right graphs are (a) bicubic interpolation, RMSE:8.78 (b) GF, RMSE:6.40 (c) SDF, RMSE:5.55 (d) AE, RMSE:6.42 (e) TGV, RMSE:8.99 (f) NCSR, RMSE:5.20 (g) JID, RMSE:5.13 (h) our results without cascade, RMSE:3.81 (i) our results with cascade, RMSE:3.12 (j) GT.
Detailed Description
1) Changing a high-magnification problem into a multi-level small-magnification cascade problem, and modeling the small-magnification super-resolution problem of each level by using an equation (1):
wherein let F (alpha) = E D (x,y)+λ 1 E TR (x,z)+λ 2 E SR (x,z),x∈R n ,y∈R m ,z∈R n Respectively vectorized recovered depth map, vectorized low resolution depth map and vectorized high resolution color map, E D (x, y) is a data item, E TR (x, z) is a transform domain regularization term, E SR (x, z) is a spatial regularization term, λ 1 And λ 2 Is the weight parameter for balancing each item, min represents the minimum value,represents the value of the variable x when the following equation is minimized; h denotes the synthesis operator of the blur and sample, α i Representing a block x i In a dictionaryThe coefficients of (a) and (b) are,representing a block x i Dictionary, x, corresponding to the class of block in which it is located i Is an image block with the center located at i in the depth map x; alpha is alpha q Is a block x i Q-th similar block x of i,q Corresponding sparse coefficient, q belongs to a set omega composed of similar blocks i ,ζ i,q Are the corresponding weighted weights; nucleus P m,l The four contained terms are spatial terms respectivelyColour itemDepth termAnd the structure tensor termWherein the structural tensor terms are the weight terms proposed by the invention;andrepresenting shifting the image x by l and m pixels in the horizontal and vertical directions, respectively, C being a selected set of (m, l), in different (m, l) combinationsThe total variation operator representing different directions, e, represents belonging. I | · | purple wind 2 Is a 1 2 Norm, | · | luminance 1 Is a 1 Norm, sigma, represents the sum operator; Θ is the normalization factor;
since the ordinary spatial distance point by point does not reflect the image structure. A better method for reflecting the spatial structure is to distinguish the total variation operators in different directions, namely, different weights are given to different operators according to the distances between the tangential direction of the current pixel and the direction of the variation operator. Therefore, the present invention is based on spatial weighting P m,l Further introduces a structural tension termItem of structure tensorComprises the following steps:
r m,l (u)=ψ{|cos(v m,l ,v 1 (u))|},(m,l)∈C (2)
where max (-) is a function of the maximum value,is an exponential function with the base of a constant e, ψ {. Is a sorting function in descending order, and returns a sorted numerical result r m,l (u), cos (. Cndot.) is a cosine function, v m,l Is the total variation operatorDirection of (v) 1 (u) represents the first eigenvector of the structure tensor at pixel u, | · | represents the absolute value. a, b and c are constants;
and solving the formula (1) to obtain a final small-magnification super-resolution result.
2) Step 1) of the present invention describes the method for solving the small-magnification super-resolution under the double transform domain, and although the method has excellent performance on the small-magnification problem, the reconstruction result of the high-magnification is still not satisfactory. Since for high-power super-resolution, e.g. 8,16 times, it is very difficult to get details of the high-resolution map directly from the low-resolution depth map, the main problem is that the original high-resolution map from the low-resolution map is too smooth and contains little detail information. In order to solve the problem, the invention provides a multi-layer cascaded super-resolution method, which is qualitatively understood that the problem of high magnification is changed into the problem of multi-stage small magnification cascade, and the small magnification super-resolution problem of each stage can be modeled by using an equation (1).
The specific construction method of the super-resolution method of multi-layer cascade, which changes the problem of high magnification into the problem of cascade of multi-level and small magnification, is as follows: constructing a cascade network of L layers, wherein the sampling multiplying power of each layer is 2, namely each layer only solvesTo solve the problem of 2 times up-sampling, the output of the k-1 layer is set asFirstly, the conventional bicubic method is adopted to carry out 2 times of upsampling on the bicubicThis up-sampled depth map is used as input for the k-th layer, i.e.For super-resolution magnifications that cannot be expressed as powers of 2, up-sampling can be performed at each layer using fractional magnifications.
3) We solve equation (1) using the near-end gradient (PG) algorithm. For the super-resolution problem with small magnification, the result obtained by directly solving the formula (1) is the super-resolution result, and for the super-resolution problem with high magnification, the method of the step 2) is adopted to convert the problem into a multi-stage super-resolution problem with small magnification, and each stage needs to solve the formula (1).
The concrete solving steps for the formula (1) are as follows:
31 The original problem (equation (1)) is transformed using the weighted least squares algorithm IRLS:
whereinW m,l Is a diagonal element ofThe diagonal matrix of (a).
Matrix arrayWhere I is the unit array. Matrix arrayWhere 0 is a vector with all zero elements, denotes all alpha i Is connected in series with the dictionaryA multiplication and addition operation of the series phi is specificallyR i Representing the extraction of a block x from a graph x i The linear operator of (a) is determined,representing a block x i Dictionary corresponding to the class of block in which it is located, α i Representing a block x i On-site dictionaryA coefficient, M represents the number of blocks extracted from the depth map, (-) 1 Is an inversion operation (·) T Is a transposition operation in which the phase of the input signal is inverted,is an open root number operation;
32 Solving the optimization equation:
the main idea of the PG algorithm is to make one function Q (α, ρ) iteratively approach the original problem F (α) near the point ρ, and then minimizing the original problem F (α) can be replaced by minimizing Q (α, ρ), defining functions h (α) and g (α) as follows:
g(α)=∑ i ||α ii || 1 (4)
then the function Q (α, ρ) can be defined according to the PG algorithm as follows:
whereinDenotes the gradient of h (-), L f Is a constant greater than the spectral norm of the hessian matrix of h (alpha),<·,·&gt, representing inner product operator, coefficient alpha obtained by the (l + 1) th iteration (l+1) The updating is as follows:
wherein
ρ (l) Is ρ, Φ at the l-th iteration T Denotes the transposition of phi, B T The transpose of B is represented by,a multiply-add operation after the representation is concatenated is finally obtained
Where soft (-) is a soft threshold function,d 0 =1, β is all β i A series connection of (l) Is alpha at the time of the l iteration, and updated alpha is obtained (l+1) Then, the l +1 th iteration update can be further obtainedDepth map x of (l+1)
The invention is described in detail below with reference to the figures and examples.
1) A high-magnification problem is changed into a multi-level small-magnification cascade problem, modeling is carried out on the small-magnification super-resolution problem of each level by using an equation (1), and a corresponding structural block diagram is shown in figure 1:
wherein let F (alpha) = E D (x,y)+λ 1 E TR (x,z)+λ 2 E SR (x,z),x∈R n ,y∈R m ,z∈R n The vectorised restored depth map (as in fig. 2 (c)), the vectorised low resolution depth map (fig. 2 (a)) and the vectorised high resolution color map (fig. 2 (b)), E respectively D (x, y) are data items, E TR (x, z) is a transform domain regularization term, E SR (x, z) is a spatial regularization term, λ 1 And λ 2 Is the weight parameter for balancing each item, min represents the minimum value,represents the value of the variable x when the following equation is minimized; h denotes the synthesis operator of the blur and sample, alpha i Representing a block x i In a dictionaryThe coefficients of (a) and (b) are,representing a block x i Dictionary, x, corresponding to the class of block in which it is located i Is an image block centered at i in the depth map x; alpha is alpha q Is a block x i Q-th similar block x of (1) i,q Corresponding sparse coefficient, q belongs to a set omega composed of similar blocks i ,ζ i,q Are the corresponding weighted weights; nucleus P m,l The four contained terms are spatial terms respectivelyColour itemDepth termAnd the structure tensor termWherein, the structural tensor term is a weight term provided by the invention;andrepresenting the movement of the image x by l and m pixels in the horizontal and vertical directions, respectively, C being a selected set of (m, l), different (m, l) combinationsRepresents the total variation operator in different directions, and epsilon represents belonging. I | · | purple wind 2 Is a 2 Norm, | | · | non-conducting phosphor 1 Is a 1 Norm, Σ, represents a summation operator; Θ is the normalization factor;
since the ordinary spatial distance point by point does not reflect the image structure. A better method for reflecting the spatial domain structure is to distinguish the total variation operators to be treated in different directions, namely, different weights are given to different operators according to the distance between the tangential direction of the current pixel and the direction of the variation operator. Therefore, the present invention is based on spatial weighting P m,l Further introduces a structural tension termItem of structure tensorComprises the following steps:
r m,l (u)=ψ{|cos(v m,l ,v 1 (u))|},(m,l)∈C (2)
where max (-) is a function of the maximum value,is an exponential function with the base of a constant e, ψ {. Is a sorting function in descending order, and returns a sorted numerical result r m,l (u), cos (. Cndot.) is a cosine function, v m,l Is total variation operatorDirection of (v) 1 (u) represents the first eigenvector of the structure tensor at pixel u, | · | indicates the absolute value. a, b and c are constants;
and solving the formula (1) to obtain a final small-magnification super-resolution result.
2) Step 1) of the present invention describes the method for solving the small-magnification super-resolution under the double-transform domain, and although this method has an excellent performance on the small-magnification problem, the reconstruction result of the high-magnification still cannot be satisfied. Since for high-power super-resolution, e.g. 8,16 times, it is very difficult to get details of the high-resolution map directly from the low-resolution depth map, the main problem is that the original high-resolution map from the low-resolution map is too smooth and contains little detail information. In order to solve the problem, the invention provides a multi-layer cascade super-resolution method (fig. 3), which qualitatively understands that a high-magnification problem is changed into a multi-stage small-magnification cascade problem, and the small-magnification super-resolution problem of each stage can be modeled by using an equation (1).
The specific construction method of the super-resolution method of multi-layer cascade, which changes the problem of high magnification into the problem of cascade of multi-level and small magnification, is as follows: constructing a cascade network of L layers, wherein the multiplying power of the up-sampling of each layer is 2, namely, each layer only solves the problem of up-sampling of 2 times, and the output of the k-1 layer is set asFirstly, the conventional bicubic method is adopted to carry out 2 times of upsampling on the bicubicThis up-sampled depth map is used as input for the k-th layer, i.e.For super-resolution magnification that cannot be expressed as a power of 2, up-sampling can be performed using fractional magnification at each layer.
Because each layer of the multilayer cascade method only needs to process a 2-time small-magnification up-sampling problem, each layer can obtain a relatively ideal reconstruction result. As shown in fig. 4 and corresponding RMSE values, it is shown that the performance of high-magnification super-resolution can be effectively improved by using the multilayer cascade method.
3) We solve equation (1) using the near-end gradient (PG) algorithm. For the super-resolution problem with small magnification, the result obtained by directly solving the formula (1) is the result of the super-resolution, and for the super-resolution problem with high magnification, the method in the step 2) is adopted to convert the problem into a multi-stage super-resolution problem with small magnification, and the formula (1) needs to be solved for each stage.
The concrete solving steps for the formula (1) are as follows:
31 The original problem (equation (1)) is transformed using the reweighted least squares algorithm IRLS algorithm:
whereinW m,l Is a diagonal element ofThe diagonal matrix of (a).
Matrix ofWhere I is the unit array. Matrix arrayWhere 0 is a vector with all zero elements, denotes all alpha i Is connected in series with the dictionaryA multiplication and addition operation of the series phi is specificallyR i Representing the extraction of a block x from a graph x i The linear operator of (a) is determined,representing a block x i Dictionary corresponding to the class of block in which it is located, α i Representing a block x i On-site dictionaryThe coefficient of the following, M represents the number of blocks extracted from the depth map, (. Cndot.) -1 Is an inversion operation (·) T Is a transposition operation in which the phase of the input signal is inverted,is an open root number operation;
32 Solving the optimization equation:
the main idea of the PG algorithm is to make one function Q (α, ρ) iteratively approach the original problem F (α) near the point ρ, and then minimizing the original problem F (α) can be replaced by minimizing Q (α, ρ), defining functions h (α) and g (α) as follows:
g(α)=∑ i ||α ii || 1 (4)
then the function Q (α, ρ) can be defined according to the PG algorithm as follows:
whereinDenotes the gradient of h (-), L f Is a constant greater than the spectral norm of the hessian matrix of h (alpha),<·,·&gt, representing inner product operator, coefficient alpha obtained by the (l + 1) th iteration (l+1) The updating is as follows:
wherein
ρ (l) Is ρ, Φ at the l-th iteration T Denotes the transposition of phi, B T The transpose of B is represented by,representing stringsA subsequent multiply-add operation is performed to obtain the final result
Where soft (-) is a soft threshold function,d 0 =1, β is all β i A series connection of (l) Is alpha at the time of the l iteration, and updated alpha is obtained (l+1) Then, the depth map x updated by the (l + 1) th iteration can be obtained (l+1)

Claims (3)

1. A multilayer cascade depth map super-resolution method based on double transform domains is characterized by comprising the following steps: changing a high-magnification problem into a multi-level small-magnification cascade problem, and modeling the small-magnification super-resolution problem of each level by using an equation (1):
wherein, let F (alpha) = E D (x,y)+λ 1 E TR (x,z)+λ 2 E SR (x,z),x∈R n ,y∈R m ,z∈R n Vectorized recovered depth map, vectorized low resolution depth map and vectorized high resolution color map, E, respectively D (x, y) are data items, E TR (x, z) is a transform domain regularization term, E SR (x, z) is a spatial regularization term,λ 1 And λ 2 Is the weight parameter for balancing each item, min represents the minimum value,represents the value of the variable x when the following equation is minimized; h denotes the synthesis operator of the blur and sample, alpha i Representing a block x i In a dictionaryThe coefficients of (a) and (b) are,representing a block x i Dictionary, x, corresponding to the class of block in which it is located i Is an image block centered at i in the depth map x; alpha is alpha q Is a block x i Q-th similar block x of i,q Corresponding sparse coefficient, q belongs to a set omega composed of similar blocks i ,ζ i,q Are the corresponding weighted weights; nucleus P m,l The four contained terms are respectively spatial domain termsColour itemDepth termAnd the structure tensor term Andrepresenting shifting the image x by l and m pixels in the horizontal and vertical directions, respectively, C being a selected set of (m, l) s, under different (m, l) combinationsIs/are as followsRepresents the total variation operator in different directions, and epsilon represents belonging. I | · | purple wind 2 Is a 1 2 Norm, | · | luminance 1 Is a 1 Norm, Σ, represents a summation operator; Θ is the normalization factor; item of structure tensorComprises the following steps:
r m,l (u)=ψ{|cos(v m,l ,v 1 (u))|},(m,l)∈C (2)
where max (-) is a function of the maximum value,is an exponential function based on a constant e, ψ {. Is a sorting function sorted in descending order, and returns a sorted numerical result r m,l (u), cos (. Cndot.) is a cosine function, v m,l Is total variation operatorDirection of (v) 1 (u) represents the first eigenvector of the structure tensor at pixel u, |, represents the absolute value. a, b and c are constants;
and solving the formula (1) to obtain a final small-magnification super-resolution result.
2. The method for super-resolution of depth maps based on multi-layer cascade of double transform domains as claimed in claim 1, wherein a high-magnification problem is changed into a multi-level small-magnification cascade problem, namely a super-resolution method of multi-layer cascade, and the specific construction method is as follows: constructing an L-layer cascade network, wherein the sampling multiplying power of each layer is 2, namely, each layer only solves the problem of 2 times of upsampling, and the first layer is providedThe output of the k-1 layer isFirstly, 2 times of upsampling is carried out on the bicubicThis up-sampled depth map is used as input for the k-th layer, i.e.For super-resolution magnification that cannot be expressed as a power of 2, upsampling is performed using fractional magnification at each layer.
3. The method for super-resolution of the depth map based on multi-layer cascade of the double transform domains as claimed in claim 1, wherein the specific solving step for equation (6) is: the concrete solving steps of the formula (1) are as follows:
31 ) the formula (1) is transformed using the weighted least squares algorithm IRLS:
whereinW m,l Is a diagonal element ofA diagonal matrix of (a);
matrix arrayWhere I is the unit array. Matrix arrayWhere 0 is a vector with all zero elements, denotes all alpha i Is connected in series with the dictionaryA multiplication and addition operation of the series phi is specificallyR i Representing the extraction of a block x from a graph x i The linear operator of (a) is used,representing a block x i Dictionary corresponding to the class of block in which it is located, alpha i Representing a block x i In a dictionaryA coefficient of M representing the number of blocks extracted from the depth map, (. C) -1 Is an inversion operation, (.) T Is a transposition operation in which the phase of the input signal is inverted,is an open root number operation;
32 Solving an optimization equation: the main idea of the PG algorithm is to make one function Q (α, ρ) iteratively approach the original problem F (α) near the point ρ, and then the minimized original problem F (α) is replaced by the minimized Q (α, ρ), defining functions h (α) and g (α) as follows:
g(α)=∑ i ||α ii || 1 (4)
then the function Q (α, ρ) is defined according to the PG algorithm as follows:
whereinDenotes the gradient of h (-), L f Is a constant greater than the spectral norm of the hessian matrix of h (alpha),<·,·&gt, representing inner product operator, coefficient alpha obtained by the (l + 1) th iteration (l+1) The updating is as follows:
wherein:
ρ (l) is ρ, Φ at the l-th iteration T Denotes the transposition of phi, B T Which represents the transpose of B and,a multiply-add operation after the representation is concatenated is finally obtained
Where soft (-) is a soft threshold function,d 0 =1, β is all β i Series connection of (a) (l) Is alpha at the time of the l iteration, and updated alpha is obtained (l+1) Then, a depth map x updated by the (l + 1) th iteration is further obtained (l+1)
CN201710764485.5A 2017-08-30 2017-08-30 High-magnification super-resolution method based on double transformation domains Expired - Fee Related CN107657585B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710764485.5A CN107657585B (en) 2017-08-30 2017-08-30 High-magnification super-resolution method based on double transformation domains

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710764485.5A CN107657585B (en) 2017-08-30 2017-08-30 High-magnification super-resolution method based on double transformation domains

Publications (2)

Publication Number Publication Date
CN107657585A true CN107657585A (en) 2018-02-02
CN107657585B CN107657585B (en) 2021-02-05

Family

ID=61128810

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710764485.5A Expired - Fee Related CN107657585B (en) 2017-08-30 2017-08-30 High-magnification super-resolution method based on double transformation domains

Country Status (1)

Country Link
CN (1) CN107657585B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111932464A (en) * 2020-09-18 2020-11-13 北京百度网讯科技有限公司 Super-resolution model using and training method, device, equipment and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102902966A (en) * 2012-10-12 2013-01-30 大连理工大学 Super-resolution face recognition method based on deep belief networks
US20160267349A1 (en) * 2015-03-11 2016-09-15 Microsoft Technology Licensing, Llc Methods and systems for generating enhanced images using multi-frame processing
CN106204449A (en) * 2016-07-06 2016-12-07 安徽工业大学 A kind of single image super resolution ratio reconstruction method based on symmetrical degree of depth network
WO2016193393A1 (en) * 2015-06-05 2016-12-08 Université Du Luxembourg Real-time temporal filtering and super-resolution of depth image sequences
CN106251329A (en) * 2016-07-14 2016-12-21 天津大学 Depth map super-resolution methods based on double transform domains

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102902966A (en) * 2012-10-12 2013-01-30 大连理工大学 Super-resolution face recognition method based on deep belief networks
US20160267349A1 (en) * 2015-03-11 2016-09-15 Microsoft Technology Licensing, Llc Methods and systems for generating enhanced images using multi-frame processing
WO2016193393A1 (en) * 2015-06-05 2016-12-08 Université Du Luxembourg Real-time temporal filtering and super-resolution of depth image sequences
CN106204449A (en) * 2016-07-06 2016-12-07 安徽工业大学 A kind of single image super resolution ratio reconstruction method based on symmetrical degree of depth network
CN106251329A (en) * 2016-07-14 2016-12-21 天津大学 Depth map super-resolution methods based on double transform domains

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
GHOLAMREZA ANBARJAFARI等: "Image Super Resolution Based on Interpolation of Wavelet Domain High FrequencySubbands and the Spatial Domain Input Image", 《ETRI JOURNAL》 *
周圆等: "基于非局部稀疏表示的立体图像的超分辨率重建", 《天津大学学报(自然科学与工程技术版)》 *
谭勇: "遥感目标图像空间分辨率增强技术研究", 《中国博士学位论文全文数据库信息科技辑》 *
黎媛: "基于超分辨率的多视角混合分辨率视频描述的关键技术研究", 《中国优秀硕士论文全文数据库信息科技辑》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111932464A (en) * 2020-09-18 2020-11-13 北京百度网讯科技有限公司 Super-resolution model using and training method, device, equipment and medium
CN111932464B (en) * 2020-09-18 2024-03-29 北京百度网讯科技有限公司 Super-resolution model using and training method, device, equipment and medium

Also Published As

Publication number Publication date
CN107657585B (en) 2021-02-05

Similar Documents

Publication Publication Date Title
Shi et al. Super-resolution reconstruction of MR image with a novel residual learning network algorithm
CN111242999B (en) Parallax estimation optimization method based on up-sampling and accurate re-matching
CN104657962A (en) Image super-resolution reconstruction method based on cascading linear regression
Yan Ptychographic phase retrieval by proximal algorithms
Aghamaleki et al. Image fusion using dual tree discrete wavelet transform and weights optimization
Zhang et al. Multi-scale image super-resolution via a single extendable deep network
CN112906756A (en) High-image classification method and system for cross-channel quantity transfer learning
Young et al. Gaussian lifting for fast bilateral and nonlocal means filtering
Altinel et al. Deep structured energy-based image inpainting
CN113962262B (en) Continuous learning-based intelligent radar signal sorting method
Li et al. Single image super-resolution reconstruction based on fusion of internal and external features
Yang et al. NasmamSR: a fast image super-resolution network based on neural architecture search and multiple attention mechanism
CN107657585B (en) High-magnification super-resolution method based on double transformation domains
Ammanouil et al. A parallel and automatically tuned algorithm for multispectral image deconvolution
Tang et al. Lightweight network with one-shot aggregation for image super-resolution
Raut et al. Image compression using convolutional autoencoder
Shen et al. ASDN: A deep convolutional network for arbitrary scale image super-resolution
Liu et al. Hyperspectral image super-resolution employing nonlocal block and hybrid multiscale three-dimensional convolution
Zhang et al. A new strategy for estimating photometric redshifts of quasars
Zandavi Post-trained convolution networks for single image super-resolution
Valli Bhasha et al. Image super resolution model enabled by wavelet lifting with optimized deep convolutional neural network
Yang et al. A hybrid sampling and gradient attention network for compressed image sensing
Wang et al. AN RDN-based image super-resolution method using Meta-learning
Li et al. Multi-scale cross-fusion for arbitrary scale image super resolution
Yang et al. Bi-path network coupling for single image super-resolution

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20210205