CN111833252A

CN111833252A - Image Super-Resolution Method Based on SAE Dictionary Learning and Neighborhood Regression

Info

Publication number: CN111833252A
Application number: CN202010670836.8A
Authority: CN
Inventors: 黄炜钦; 郭一晶; 陈俊仁
Original assignee: Xiamen University Tan Kah Kee College
Current assignee: Xiamen University Tan Kah Kee College
Priority date: 2020-07-13
Filing date: 2020-07-13
Publication date: 2020-10-27
Anticipated expiration: 2040-07-13
Also published as: CN111833252B

Abstract

The present invention relates to an image super-resolution method based on SAE dictionary learning and neighborhood regression. First, input data is prepared for the dictionary learning model SAE, and a dictionary is constructed and trained; then, the projection matrix is solved by combining the neighborhood regression theory and the dictionary; finally, the image is reconstructed based on the projection matrix to obtain a high-resolution image. On the one hand, the present invention improves the feature expression ability of the dictionary and reduces the dependence of the reconstruction result on the dictionary; on the other hand, the neighborhood regression theory is incorporated to improve the reconstruction speed.

Description

Image Super-Resolution Method Based on SAE Dictionary Learning and Neighborhood Regression

技术领域technical field

本发明涉及图像超分辨率方法设计领域，特别是一种基于SAE字典学习和邻域回归的图像超分辨率方法。The invention relates to the field of image super-resolution method design, in particular to an image super-resolution method based on SAE dictionary learning and neighborhood regression.

背景技术Background technique

在现实中，由于图像采集设备的限制、场景变化以及光源等因素，往往不能得到高质量的图像，当图像的分辨率较低就无法满足实际应用的要求。图像超分辨率(Super-Resolution,SR)方法利用图像信号处理技术将已有的单幅或多幅低分辨率(LowResolution,LR)图像重建成高分辨率(High Resolution,HR)图像，其关键在于在重建过程中加入一定的附加信息来弥补图像降质过程中损失的细节信息。由于SR重建能够突破成像器件自身固有分辨率的限制实现图像分辨率的提升，因此其在遥感、医疗、视频监控等领域都具有重要的应用价值。In reality, due to factors such as the limitation of image acquisition equipment, scene changes, and light sources, high-quality images are often not obtained. When the resolution of the image is low, it cannot meet the requirements of practical applications. Image Super-Resolution (SR) method uses image signal processing technology to reconstruct existing single or multiple low-resolution (LowResolution, LR) images into high-resolution (High Resolution, HR) images. The key It is to add some additional information in the reconstruction process to make up for the detailed information lost in the process of image degradation. Since SR reconstruction can break through the limitation of the inherent resolution of the imaging device and improve the image resolution, it has important application value in remote sensing, medical treatment, video surveillance and other fields.

目前，SR方法主要分为3种类型：基于插值、基于重建和基于学习的SR方法。其中，基于学习的SR方法是近年来的热点方向，而基于字典学习的SR方法是基于学习的SR方法中最为流行的一种，最早由Yang等提出。该算法基于压缩感知理论而提出，采用字典联合学习法学习HR和LR字典对，先利用LR图像块和LR字典求得LR稀疏系数，然后基于HR和LR图像块具有相同的稀疏表示系数的假设，利用LR稀疏系数和HR字典重建HR图像块。该算法能够获取到充分的先验知识，具有较好的主观视觉效果，但是块效应较为明显，而且重建的效果对学习的字典具有很大的依赖性，而且重建耗时较长。Currently, SR methods are mainly divided into three types: interpolation-based, reconstruction-based, and learning-based SR methods. Among them, the learning-based SR method is a hot topic in recent years, and the dictionary learning-based SR method is the most popular one among the learning-based SR methods, which was first proposed by Yang et al. The algorithm is based on the compressed sensing theory. It adopts the dictionary joint learning method to learn the HR and LR dictionary pairs. First, the LR image block and the LR dictionary are used to obtain the LR sparse coefficient, and then based on the assumption that the HR and LR image blocks have the same sparse representation coefficient , using LR sparse coefficients and HR dictionary to reconstruct HR image patches. The algorithm can obtain sufficient prior knowledge and has good subjective visual effect, but the block effect is obvious, and the reconstruction effect has a great dependence on the learned dictionary, and the reconstruction takes a long time.

发明内容SUMMARY OF THE INVENTION

有鉴于此，本发明的目的是提出一种基于SAE字典学习和邻域回归的图像超分辨率方法，一方面提高字典的特征表达能力，减小重建结果对字典的依赖性；另一方面融入邻域回归理论，提高重建速度。In view of this, the purpose of the present invention is to propose an image super-resolution method based on SAE dictionary learning and neighborhood regression. Neighborhood regression theory to improve reconstruction speed.

本发明采用以下方案实现：一种基于SAE字典学习和邻域回归的图像超分辨率方法，具体包括以下步骤：The present invention adopts the following scheme to realize: an image super-resolution method based on SAE dictionary learning and neighborhood regression, which specifically includes the following steps:

针对字典学习模型SAE准备输入数据，并进行字典的构造与训练；Prepare input data for the dictionary learning model SAE, and construct and train the dictionary;

结合邻域回归理论和字典求解投影矩阵；Combine neighborhood regression theory and dictionary to solve projection matrix;

基于投影矩阵进行图像重建，获得高分辨率图像。Image reconstruction is performed based on the projection matrix to obtain high-resolution images.

进一步地，所述针对字典学习模型SAE准备输入数据具体为：Further, the described preparation input data for the dictionary learning model SAE is specifically:

将HR图像样本I_h进行下采样,得到LR图像I_l，并将LR图像上采样得到中间图像I_m； _The HR image sample _Ih is down-sampled to obtain the LR image Il, and the LR image is upsampled to obtain the intermediate image _Im ;

HR输入数据准备：将HR图像I_h与中间图像I_m相减，得到差值图像I_d，对差值图像I_d进行分块与归一化，作为HR输入数据，记为

m为 HR输入数据的样本数；HR input data preparation: subtract the HR image I _h from the intermediate image _Im to obtain the difference image I _d , and block and normalize the difference image I _d as HR input data, denoted as

m is the number of samples of HR input data;

LR输入数据准备：对中间图像I_m进行滤波，得到滤波图像，再对滤波图像进行归一化和图像分块，并对滤波图像块进行降维，记为

n为LR 输入数据的样本数；LR input data preparation: filter the intermediate image _Im to obtain a filtered image, then normalize and block the filtered image, and reduce the dimension of the filtered image block, denoted as

n is the number of samples of LR input data;

将字典学习模型SAE的输入数据表示为S＝[S_h,S_l]。Denote the input data of the dictionary learning model SAE as S=[ _Sh , S _l ].

进一步地，在LR输入数据准备时，采用主成分分析法PCA对滤波图像块进行降维。Further, when the LR input data is prepared, the principal component analysis method PCA is used to reduce the dimension of the filtered image block.

进一步地，所述进行字典的构造与训练具体为：Further, the construction and training of the dictionary are specifically:

结合字典学习的需要，在SAE的代价函数中，采用平均绝对值误差代替均方误差，得到改进SAE字典学习模型；Combined with the needs of dictionary learning, in the cost function of SAE, the mean absolute value error is used to replace the mean square error, and an improved SAE dictionary learning model is obtained;

输入S＝[S_h,S_l]，采用改进SAE字典学习模型进行学习，获得输入层与隐含层之间的权重W₁，将权重转化为HR和LR字典对{D_h,D_l}。Input S=[S _h , S _l ], use the improved SAE dictionary learning model for learning, obtain the weight W ₁ between the input layer and the hidden layer, and convert the weight into HR and LR dictionary pairs {D _h , D _l } .

进一步地，所述结合字典学习的需要，在SAE的代价函数中，采用平均绝对值误差代替均方误差，得到改进SAE字典学习模型具体为：Further, in combination with the needs of dictionary learning, in the cost function of SAE, the mean absolute value error is used to replace the mean square error, and the improved SAE dictionary learning model is specifically:

设s_i∈S为输入数据，o_i∈O为输出数据，改进的SAE字典模型如下：Let s _i ∈ S be the input data and o _i ∈ O be the output data. The improved SAE dictionary model is as follows:

其中，第一项J_MAE(θ)为重构误差项，此处采用平均绝对误差表示，m和n分别表示HR和LR输入数据的样本数；第二项J_weight(θ)为权重衰减项，用于减少权值的量级，防止过拟合,λ为该项的调节参数，

表示第l-1层节点i与第l层节点j 的连接权重，l表示网络的层数，N_l表示第l层的节点数，N_l+1第l+1层的节点数；第三项J_sparse(θ)为隐含层稀疏正则项，

为隐含层神经元的平均激活量，ρ为设定好的预期激活量，γ为调节参数，N₂表示第2层的节点数；其中，

采用式(2)表示：Among them, the first term J _MAE (θ) is the reconstruction error term, which is represented by the mean absolute error here, m and n represent the number of samples of HR and LR input data respectively; the second term J _weight (θ) is the weight decay term , used to reduce the magnitude of the weights and prevent overfitting, λ is the adjustment parameter of this item,

Represents the connection weight between the l-1 layer node i and the l layer node j, l represents the number of layers of the network, N _l represents the number of nodes in the l layer, N _l+1 The number of nodes in the l+1 layer; the third The term J _sparse (θ) is the hidden layer sparse regular term,

is the average activation of neurons in the hidden layer, ρ is the set expected activation, γ is the adjustment parameter, and N ₂ represents the number of nodes in the second layer; among them,

It is expressed by formula (2):

进一步地，所述输入S＝[S_h,S_l]，采用改进SAE字典学习模型进行学习，获得输入层与隐含层之间的权重W₁，将权重转化为HR和LR字典对{D_h,D_l}具体为：Further, the input S=[S _h , S _l ], the improved SAE dictionary learning model is used for learning, the weight W ₁ between the input layer and the hidden layer is obtained, and the weight is converted into the HR and LR dictionary pair {D _h , D _l } are specifically:

在SAE字典学习模型的训练过程中，结合梯度下降法进行参数更新，最终获得输入层到隐含层的连接权重W₁，其中W₁＝{w_i}，i＝1,2,...,m+n；根据网络权重与字典的关系，字典D等价于输入层与隐含层的链接权重W₁，具体表示为HR字典 D_h＝{w₁,w₂,…,w_m}、LR字典D_l＝{w_m+1,w_m+2,…,w_m+n}，字典对表示为D＝(D_h,D_l)，其中w_i∈W₁，且w_i＝{w_1,i,w_2,i,...,w_k,i}，k为字典的维数，w_k,i表示第k维第i个字典原子的权值。In the training process of the SAE dictionary learning model, the parameters are updated with the gradient descent method, and finally the connection weight W ₁ from the input layer to the hidden layer is obtained, where W ₁ ={ _wi }, i=1,2,... ,m+n; according to the relationship between the network weight and the dictionary, the dictionary D is equivalent to the link weight W ₁ of the input layer and the hidden layer, specifically expressed as HR dictionary D _h ={w ₁ ,w ₂ ,...,w _m } , LR dictionary D _l ={w _m+1 ,w _m+2 ,...,w _m+n }, the dictionary pair is expressed as D=(D _h , D _l ), where w _i ∈ W ₁ , and w _i = {w _1,i ,w _2,i ,...,w _k,i }, k is the dimension of the dictionary, w _k,i is the weight of the i-th dictionary atom in the k-th dimension.

进一步地，所述结合邻域回归理论和字典求解投影矩阵具体为：首先，采用最近邻域方法计算HR和LR字典对{D_h,D_l}中每个原子在样本空间S＝[S_h,S_l]的最近邻域映射关系{N_h,N_l}；然后，基于映射关系{N_h,N_l}，采用岭回归方法求解投影矩阵P。Further, the combination of the neighborhood regression theory and the dictionary to solve the projection matrix is specifically: first, adopt the nearest neighbor method to calculate the HR and LR dictionary pairs {D _h , D _l } in the sample space S=[S _h for each atom in the , S _l ] the nearest neighbor mapping relationship {N _h , N _l }; then, based on the mapping relationship {N _h , N _l }, the projection matrix P is solved by the ridge regression method.

进一步地，所述采用最近邻域方法计算HR和LR字典对{D_h,D_l}中每个原子在样本空间S＝[S_h,S_l]的最近邻域映射关系{N_h,N_l}具体为：Further, the nearest neighbor method is used to calculate the nearest neighbor mapping relationship {N _h ,N of each atom in the sample space S=[ _Sh ,S _l ] in the HR and LR dictionary pair {D _h , D _l } _l }Specifically:

设S＝[S_h,S_l]为训练样本，{D_h,D_l}为字典对，采用欧式距离计算原子

在LR 字典D_l中的K个最近邻域块集合N_l,q：Let S=[S _h , S _l ] be the training sample, {D _h , D _l } be the dictionary pair, and use the Euclidean distance to calculate the atoms

The set of K nearest neighbor blocks N _l,q in the LR dictionary D _l :

式中，

表示LR训练样本S_l中第p个训练样本，

表示LR字典的第q个原子；In the formula,

represents the p-th training sample in the LR training sample S _l ,

Represents the qth atom of the LR dictionary;

从式(3)可计算出每一个LR字典原子对应的K个LR近邻图像块N_l,q，根据K个近邻图像块的位置，可从HR训练样本中获得相应的K个HR近邻图像块N_h,q，当遍历整个LR字典D_l的所有原子，可得所有原子的最近邻LR图像块组合而成的映射关系N_l，及其对应的最近邻HR图像块组合而成的最近邻映射关系N_h，最终获得{N_h,N_l}。From equation (3), the K LR neighbor image blocks N _l,q corresponding to each LR dictionary atom can be calculated. According to the positions of the K neighbor image blocks, the corresponding K HR neighbor image blocks can be obtained from the HR training samples. N _h,q , when traversing all atoms of the entire LR dictionary D _l , the mapping relationship N _l composed of the nearest neighbor LR image blocks of all atoms can be obtained, and the nearest neighbor composed of the corresponding nearest neighbor HR image blocks. The mapping relation N _h is finally obtained {N _h , N _l }.

进一步地，所述基于映射关系{N_h,N_l}，采用岭回归方法求解投影矩阵P具体为：Further, based on the mapping relationship {N _h , N _l }, using the ridge regression method to solve the projection matrix P is specifically:

用LR最近邻映射关系N_l取代字典D_l，则重建β的表达式为

Replacing the dictionary D _l with the LR nearest neighbor mapping relation N _l , the expression for reconstructing β is

式中，β表示系数矩阵，Y表示待重建低分辨率图像，η是权重系数，用于缓解奇异性问题并保证系数分解的稳定性；In the formula, β represents the coefficient matrix, Y represents the low-resolution image to be reconstructed, and η is the weight coefficient, which is used to alleviate the singularity problem and ensure the stability of the coefficient decomposition;

采用岭回归方法求解式(4)，将系数

表示为The ridge regression method is used to solve equation (4), and the coefficients

Expressed as

重建的HR图像X通过映射关系N_h和系数

获得，The reconstructed HR image X is mapped by N _h and coefficients

get,

记式(6)中的

为投影矩阵P。In Equation (6)

is the projection matrix P.

进一步地，所述基于投影矩阵进行图像重建，获得高分辨率图像具体为：Further, performing image reconstruction based on the projection matrix to obtain a high-resolution image is specifically:

首先，对待重建的图像Y进行预处理，得到LR测试特征图像 Y_t＝{y₁,y₂,…,y_i,…,y_n}；然后，采用欧式距离在LR字典D_l中寻找LR测试特征图像块y_i对应的最近邻字典原子d_k，表达式为

接着通过原子d_k找到其对应的投影矩阵P_t，然后利用表达式x_i＝P_ty_i，获得对应y_i的HR图像块x_i，以此重建所有的LR测试特征图像块，并组成HR图像X。First, preprocess the image Y to be reconstructed to obtain the LR test feature image Y _t ={y ₁ ,y ₂ ,...,y _i ,...,y _n }; then, use Euclidean distance to find LR in the LR dictionary D _l The nearest neighbor dictionary atom d _k corresponding to the test feature image block y _i is expressed as

Then find its corresponding projection matrix P _t through atom d _k , and then use the expression _xi =P _t y _i to obtain the HR image block _{xi corresponding to yi} _, so as to reconstruct all LR test feature image blocks, and form HR image X.

与现有技术相比，本发明有以下有益效果：本发明首先对于字典的学习采用改进稀疏自动编码器，充分利用其突出的特征学习能力，增强字典的特征表达能力，提高图像的重建质量；然后，本发明在基于字典学习的超分辨率框架中融入邻域回归理论，避免原来框架中的稀疏编码过程，减少计算量，提高重建的速度。Compared with the prior art, the present invention has the following beneficial effects: the present invention firstly adopts an improved sparse auto-encoder for the learning of the dictionary, makes full use of its outstanding feature learning ability, enhances the feature expression ability of the dictionary, and improves the reconstruction quality of the image; Then, the present invention integrates the neighborhood regression theory into the super-resolution framework based on dictionary learning, avoids the sparse coding process in the original framework, reduces the amount of computation, and improves the speed of reconstruction.

附图说明Description of drawings

图1为本发明实施例的SAE输入数据预处理流程。FIG. 1 is an SAE input data preprocessing process according to an embodiment of the present invention.

图2为本发明实施例的方法流程示意图。FIG. 2 is a schematic flowchart of a method according to an embodiment of the present invention.

具体实施方式Detailed ways

下面结合附图及实施例对本发明做进一步说明。The present invention will be further described below with reference to the accompanying drawings and embodiments.

应该指出，以下详细说明都是示例性的，旨在对本申请提供进一步的说明。除非另有指明，本文使用的所有技术和科学术语具有与本申请所属技术领域的普通技术人员通常理解的相同含义。It should be noted that the following detailed description is exemplary and intended to provide further explanation of the application. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.

需要注意的是，这里所使用的术语仅是为了描述具体实施方式，而非意图限制根据本申请的示例性实施方式。如在这里所使用的，除非上下文另外明确指出，否则单数形式也意图包括复数形式，此外，还应当理解的是，当在本说明书中使用术语“包含”和/或“包括”时，其指明存在特征、步骤、操作、器件、组件和/ 或它们的组合。It should be noted that the terminology used herein is for the purpose of describing specific embodiments only, and is not intended to limit the exemplary embodiments according to the present application. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural as well, furthermore, it is to be understood that when the terms "comprising" and/or "including" are used in this specification, it indicates that There are features, steps, operations, devices, components and/or combinations thereof.

如图2所示，本实施例提供了一种基于SAE字典学习和邻域回归的图像超分辨率方法，具体包括以下步骤：As shown in Figure 2, this embodiment provides an image super-resolution method based on SAE dictionary learning and neighborhood regression, which specifically includes the following steps:

步骤S1：针对字典学习模型SAE准备输入数据，并进行字典的构造与训练；Step S1: prepare input data for the dictionary learning model SAE, and construct and train the dictionary;

步骤S2：结合邻域回归理论和字典求解投影矩阵；Step S2: combine the neighborhood regression theory and the dictionary to solve the projection matrix;

步骤S3：基于投影矩阵进行图像重建，获得高分辨率图像。Step S3: Perform image reconstruction based on the projection matrix to obtain a high-resolution image.

在本实施例中，如图1所示，步骤S1中所述针对字典学习模型SAE准备输入数据具体为：In this embodiment, as shown in FIG. 1 , the preparation of input data for the dictionary learning model SAE described in step S1 is specifically:

步骤S11：将HR图像样本I_h进行下采样,得到LR图像I_l，并将LR图像上采样得到中间图像I_m；Step S11: down-sampling the HR image sample _Ih to obtain the LR image _Il , and up-sampling the LR image to obtain the intermediate image _Im ;

步骤S12：HR输入数据准备：将HR图像I_h与中间图像I_m相减，得到差值图像I_d，对差值图像I_d进行分块与归一化，作为HR输入数据，记为

m为HR输入数据的样本数；Step S12: HR input data preparation: subtract the HR image I _h and the intermediate image I _m to obtain a difference image I _d , and block and normalize the difference image I _d as HR input data, denoted as

m is the number of samples of HR input data;

步骤S13：LR输入数据准备：对中间图像I_m进行滤波，得到滤波图像，再对滤波图像进行归一化和图像分块，并对滤波图像块进行降维，记为

n为LR输入数据的样本数；Step S13: LR input data preparation: filter the intermediate image _Im to obtain a filtered image, then normalize and block the filtered image, and reduce the dimension of the filtered image block, denoted as

n is the number of samples of LR input data;

步骤S14：将字典学习模型SAE的输入数据表示为S＝[S_h,S_l]。Step S14: Denote the input data of the dictionary learning model SAE as S=[S _h , S _l ].

其中，在LR输入数据准备时，采用主成分分析法PCA对滤波图像块进行降维。Among them, when the LR input data is prepared, the principal component analysis method PCA is used to reduce the dimension of the filtered image block.

在本实施例中，步骤S1中的所述进行字典的构造与训练具体为：In this embodiment, the construction and training of the dictionary in step S1 is specifically:

步骤S15：结合字典学习的需要，在SAE的代价函数中，采用平均绝对值误差代替均方误差，得到改进SAE字典学习模型；Step S15: Combined with the needs of dictionary learning, in the cost function of SAE, the mean absolute value error is used instead of the mean square error to obtain an improved SAE dictionary learning model;

其中，第一项J_MAE(θ)为重构误差项，此处采用平均绝对误差(Mean SquaredError, MSE)表示，m和n分别表示HR和LR输入数据的样本数；第二项J_weight(θ)为权重衰减项，用于减少权值的量级，防止过拟合,λ为该项的调节参数，

表示第l-1 层节点i与第l层节点j的连接权重，l表示网络的层数，N_l表示第l层的节点数， N_l+1第l+1层的节点数；第三项J_sparse(θ)为隐含层稀疏正则项，

为隐含层神经元的平均激活量，ρ为设定好的预期激活量，其值接近于0，γ为该项的调节参数， N₂表示第2层的节点数；对于

显著偏离ρ的情况，一般采用相对熵来惩罚，如式(2)所示：Among them, the first item J _MAE (θ) is the reconstruction error term, which is represented by the mean absolute error (Mean SquaredError, MSE) here, m and n represent the number of samples of HR and LR input data respectively; the second item J _weight ( θ) is the weight decay term, which is used to reduce the magnitude of the weight value and prevent over-fitting, and λ is the adjustment parameter of this term,

is the average activation of neurons in the hidden layer, ρ is the set expected activation, its value is close to 0, γ is the adjustment parameter of this item, N ₂ represents the number of nodes in the second layer; for

In the case of significant deviation from ρ, relative entropy is generally used to punish, as shown in formula (2):

步骤S16：输入S＝[S_h,S_l]，采用改进SAE字典学习模型进行学习，获得输入层与隐含层之间的权重W₁，将权重转化为HR和LR字典对{D_h,D_l}。Step S16: Input S=[S _h , S _l ], use the improved SAE dictionary learning model for learning, obtain the weight W ₁ between the input layer and the hidden layer, and convert the weight into HR and LR dictionary pairs {D _h , D _l }.

在SAE字典学习模型的训练过程中，结合梯度下降法进行参数更新，最终获得输入层到隐含层的连接权重W₁，其中W₁＝{w_i}，i＝1,2,...,m+n；在字典学习中，可以通过字典矩阵和稀疏表示输入数据，而在SAE中，可以通过隐含层的表示与学习权重表示输入数据，根据二者之间的联系可知字典D等价于输入层与隐含层的链接权重W₁，具体表示为HR字典D_h＝{w₁,w₂,…,w_m}、LR字典 D_l＝{w_m+1,w_m+2,…,w_m+n}，字典对表示为D＝(D_h,D_l)，其中w_i∈W₁，且w_i＝{w_1,i,w_2,i,...,w_k,i}，k为字典的维数，w_k,i表示第k维第i个字典原子的权值。In the training process of the SAE dictionary learning model, the parameters are updated with the gradient descent method, and finally the connection weight W ₁ from the input layer to the hidden layer is obtained, where W ₁ ={ _wi }, i=1,2,... ,m+n; in dictionary learning, the input data can be represented by dictionary matrix and sparse representation, while in SAE, the input data can be represented by the representation of the hidden layer and the learning weight, according to the relationship between the two, the dictionary D, etc. It is equivalent to the link weight W ₁ between the input layer and the hidden layer, specifically expressed as HR dictionary D _h ={w ₁ ,w ₂ ,...,w _m }, LR dictionary D _l ={w _m+1 ,w _m+2 _, _. _{_} _{_} _{_} _{_} _{_} _{_} _k,i }, k is the dimension of the dictionary, w _k,i represents the weight of the i-th dictionary atom in the k-th dimension.

在本实施例中，步骤S2具体为：In this embodiment, step S2 is specifically:

步骤S21：采用最近邻域方法计算HR和LR字典对{D_h,D_l}中每个原子在样本空间S＝[S_h,S_l]的最近邻域映射关系{N_h,N_l}；即，Step S21: Use the nearest neighbor method to calculate the nearest neighbor mapping relationship {N _h ,N _l } of each atom in the HR and LR dictionary pair {D _h ,D _l } in the sample space S=[ _Sh ,S _l ] ;which is,

The set of K nearest neighbor blocks N _l,q in the LR dictionary D _l :

式中，

表示LR训练样本S_l中第p个训练样本，

表示LR字典的第q个原子；In the formula,

represents the p-th training sample in the LR training sample S _l ,

Represents the qth atom of the LR dictionary;

步骤S22：基于映射关系{N_h,N_l}，采用岭回归方法求解投影矩阵P，即，Step S22: Based on the mapping relationship {N _h , N _l }, use the ridge regression method to solve the projection matrix P, that is,

用LR最近邻映射关系N_l取代字典D_l，则重建β的表达式为

采用岭回归方法求解式(4)，将系数

Expressed as

重建的HR图像X通过映射关系N_h和系数

获得，The reconstructed HR image X is mapped by N _h and coefficients

get,

记式(6)中的

为投影矩阵P。In Equation (6)

is the projection matrix P.

在本实施例中，步骤S3具体为：In this embodiment, step S3 is specifically:

接下来，本实施例采用以下仿真实验来进行进一步的说明。Next, this embodiment uses the following simulation experiments for further description.

本实施例所采用的仿真工具为MATLAB，评价指标为峰值信噪比PSNR和结构相似度SSIM，其中PSNR越大，SSIM越接近1，则超分辨率效果越好。The simulation tool used in this embodiment is MATLAB, and the evaluation indicators are peak signal-to-noise ratio (PSNR) and structural similarity (SSIM). The larger the PSNR, the closer the SSIM is to 1, the better the super-resolution effect.

仿真实验具体设置如下：The specific settings of the simulation experiment are as follows:

数据准备：为了保证实验的客观性，字典学习采用91张通用标准HR训练样本，测试图像来源于标准测试库Set5和Set14。为了定量地评价重建图像的质量，将这些测试图像作为HR参考图像，通过下采样获取待处理的LR图像。Data preparation: In order to ensure the objectivity of the experiment, dictionary learning adopts 91 general standard HR training samples, and the test images come from standard test libraries Set5 and Set14. In order to quantitatively evaluate the quality of the reconstructed images, these test images are used as HR reference images, and the LR images to be processed are obtained by downsampling.

对比算法：Bicubic、L1SR(Super Resolution with L1 Regression)、SISR(Single Image Super Resolution)、ANR(Anchored Neighborhood Regression)、SRCNN(Super Resolution using Convolution Neural Network)等5种SR算法作对比。Comparison algorithms: 5 SR algorithms, including Bicubic, L1SR (Super Resolution with L1 Regression), SISR (Single Image Super Resolution), ANR (Anchored Neighborhood Regression), and SRCNN (Super Resolution using Convolution Neural Network), are compared.

重要参数设置：采样因子为3，SAE隐含层的节点数为1024。Important parameter settings: the sampling factor is 3, and the number of nodes in the SAE hidden layer is 1024.

仿真实验主要分为2组，具体如下。The simulation experiments are mainly divided into two groups, as follows.

第1组实验：与不同SR方法比较。Group 1 experiments: Comparison with different SR methods.

表1列出了不同SR算法得到的重建图像对应的PSNR和SSIM，其中最后一栏标出的数值表示在相应的评价指标下本实施例对应算法的性能最优，10幅图像分别来自Set5和Set14。由表1可见，本实施例方法得到的PSNR和SSIM值大体上都是最优的，表明其重建效果更优。Table 1 lists the PSNR and SSIM corresponding to the reconstructed images obtained by different SR algorithms. The values marked in the last column indicate that the performance of the corresponding algorithm in this embodiment is the best under the corresponding evaluation indicators. The 10 images are from Set5 and Set14. It can be seen from Table 1 that the PSNR and SSIM values obtained by the method in this embodiment are generally optimal, indicating that the reconstruction effect is better.

表1不同SR方法的PSNR(dB)和SSIM值的比较Table 1 Comparison of PSNR (dB) and SSIM values for different SR methods

第2组实验：重建速度比较。Experiment 2: Comparison of reconstruction speed.

本组实验将不同算法在同样的设备和环境中运行，用于验证在基于字典学习的SR算法中融入邻域回归思想能提升的重建速度的效果。表2中列出了Set5和 Set14测试集合在不同SR算法下的平均重建时间。从中可以看出，本发明速度明显高于其他SR算法。In this group of experiments, different algorithms were run in the same equipment and environment to verify the effect of improving the reconstruction speed by incorporating the idea of neighborhood regression into the dictionary learning-based SR algorithm. Table 2 lists the average reconstruction time of Set5 and Set14 test sets under different SR algorithms. It can be seen from this that the speed of the present invention is significantly higher than other SR algorithms.

表2不同SR算平均重建时间(s)的比较Table 2 Comparison of the average reconstruction time (s) of different SR algorithms

测试图像库test image library L1SRL1SR SISRSISR SRCNNSRCNN 本方法this method Set 5Set 5 14.2814.28 0.960.96 2.982.98 0.330.33 Set 14Set 14 31.9331.93 1.931.93 8.038.03 0.65 0.65

本领域内的技术人员应明白，本申请的实施例可提供为方法、系统、或计算机程序产品。因此，本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且，本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。As will be appreciated by those skilled in the art, the embodiments of the present application may be provided as a method, a system, or a computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器，使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the present application. It will be understood that each flow and/or block in the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing device to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing device produce Means for implementing the functions specified in a flow or flow of a flowchart and/or a block or blocks of a block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中，使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品，该指令装置实现在流程图一个流程或多个流程和/ 或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory result in an article of manufacture comprising instruction means, the instructions The apparatus implements the functions specified in the flow or flows of the flowcharts and/or the block or blocks of the block diagrams.

这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上，使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理，从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing device to cause a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process such that The instructions provide steps for implementing the functions specified in the flow or blocks of the flowcharts and/or the block or blocks of the block diagrams.

以上所述，仅是本发明的较佳实施例而已，并非是对本发明作其它形式的限制，任何熟悉本专业的技术人员可能利用上述揭示的技术内容加以变更或改型为等同变化的等效实施例。但是凡是未脱离本发明技术方案内容,依据本发明的技术实质对以上实施例所作的任何简单修改、等同变化与改型，仍属于本发明技术方案的保护范围。The above are only preferred embodiments of the present invention, and are not intended to limit the present invention in other forms. Any person skilled in the art may use the technical content disclosed above to make changes or modifications to equivalent changes. Example. However, any simple modifications, equivalent changes and modifications made to the above embodiments according to the technical essence of the present invention without departing from the content of the technical solutions of the present invention still belong to the protection scope of the technical solutions of the present invention.

Claims

1. An image super-resolution method based on SAE dictionary learning and neighborhood regression is characterized by comprising the following steps:

preparing input data aiming at a dictionary learning model SAE, and constructing and training a dictionary;

solving a projection matrix by combining a neighborhood regression theory and a dictionary;

and reconstructing the image based on the projection matrix to obtain a high-resolution image.

2. The image super-resolution method based on SAE dictionary learning and neighborhood regression as claimed in claim 1, wherein the preparation of input data for the dictionary learning model SAE is specifically as follows:

HR image sample I_hDownsampling to obtain LR image I_lAnd up-sampling the LR image to obtain an intermediate image I_m；

HR input data preparation: HR image I_hAnd an intermediate image I_mSubtracting to obtain a difference image I_dFor difference image I_dPartitioning and normalizing to obtain HR input data

m is the number of samples of HR input data;

LR input data preparation: for intermediate image I_mFiltering to obtain a filtered image, normalizing and partitioning the filtered image, and reducing dimensions of the filtered image block, which is recorded as

n is the number of samples of LR input data;

input data of the dictionary learning model SAE is represented as S ═ S_h,S_l]。

3. The image super-resolution method based on SAE dictionary learning and neighborhood regression as claimed in claim 2, wherein in LR input data preparation, a Principal Component Analysis (PCA) method is used to reduce the dimension of the filtering image block.

4. The image super-resolution method based on SAE dictionary learning and neighborhood regression as claimed in claim 1, wherein said constructing and training of the dictionary specifically comprises:

combining with the requirement of dictionary learning, in the cost function of SAE, adopting average absolute value error to replace mean square error to obtain an improved SAE dictionary learning model;

input S ═ S_h,S_l]Learning by adopting an improved SAE dictionary learning model to obtain the weight W between the input layer and the hidden layer₁The weights are converted into HR and LR dictionary pairs { D_h,D_l}。

5. The image super-resolution method based on SAE dictionary learning and neighborhood regression as claimed in claim 4, wherein said combination of the requirement of dictionary learning, in the cost function of SAE, the mean absolute value error is used to replace the mean square error, and the improved SAE dictionary learning model is specifically:

let s_iE.g. S as input data, o_iE O is taken as output data, and the improved SAE dictionary model is as follows:

wherein the first item J_MAE(θ) is a reconstruction error term, here expressed in mean absolute error, m and n represent HR and HR, respectivelyNumber of samples of LR input data; second item J_weight(theta) is a weight decay term used to reduce the magnitude of the weights, preventing overfitting, lambda is the tuning parameter of this term,

represents the connection weight of the l-1 layer node i and the l layer node j, wherein l represents the layer number of the network, and N_lIndicates the number of nodes of the l-th layer, N_l+1Number of nodes of layer l + 1; third item J_sparse(theta) is a hidden layer sparsity regularization term,

for the mean activation of the neurons in the hidden layer, ρ is the set expected activation, γ is the regulatory parameter, N₂Represents the number of nodes of layer 2; wherein,

expressed by formula (2):

6. the image super-resolution method based on SAE dictionary learning and neighborhood regression as claimed in claim 4, wherein said input S ═ S_h,S_l]Learning by adopting an improved SAE dictionary learning model to obtain the weight W between the input layer and the hidden layer₁The weights are converted into HR and LR dictionary pairs { D_h,D_lThe concrete steps are as follows:

in the training process of the SAE dictionary learning model, updating parameters by combining a gradient descent method, and finally obtaining the connection weight W from the input layer to the hidden layer₁Wherein W is₁＝{w_i1,2, i, m + n; according to the relation between the network weight and the dictionary, the dictionary D is equivalent to the link weight W of the input layer and the hidden layer₁Denoted HR dictionary D_h＝{w₁,w₂,…,w_mR, LR dictionary D_l＝{w_m+1,w_m+2,…,w_m+nDenoted D ═ D for dictionary pair_h,D_l) Wherein w is_i∈W₁And w is_i＝{w_1,i,w_2,i,...,w_k,iK is the dimension of the dictionary, w_k,iRepresenting the weight of the ith dictionary atom in the kth dimension.

7. The image super-resolution method based on SAE dictionary learning and neighborhood regression as claimed in claim 1, wherein said solving of projection matrix in combination with neighborhood regression theory and dictionary specifically comprises: first, the HR and LR dictionary pair { D is calculated by adopting a nearest neighbor method_h,D_lEach atom in the lattice space S ═ S_h,S_l]Nearest neighbor mapping of { N }_h,N_l}; then, based on the mapping { N_h,N_lAnd solving a projection matrix P by using a ridge regression method.

8. The image super-resolution method based on SAE dictionary learning and neighborhood regression as claimed in claim 7, wherein said computing HR and LR dictionary pair { D ] by nearest neighbor method_h,D_lEach atom in the lattice space S ═ S_h,S_l]Nearest neighbor mapping of { N }_h,N_lThe concrete steps are as follows:

let S be ═ S_h,S_l]For training samples, { D_h,D_lThe atom is calculated by Euclidean distance

In LR dictionary D_lK nearest neighbor domain block sets N in (1)_l,q：

In the formula,

represents the LR training sample S_lThe p-th training sample of (1),

representation of LR dictionary D_lThe qth atom of (1);

from equation (3), K LR neighboring image blocks N corresponding to each LR dictionary atom can be calculated_l,qAccording to the positions of the K adjacent image blocks, obtaining corresponding K HR adjacent image blocks N from HR training samples_h,qWhen traversing the entire LR dictionary D_lObtaining the mapping relation N formed by combining the nearest neighbor LR image blocks of all atoms_lAnd a nearest neighbor mapping relation N formed by combining the nearest neighbor HR image blocks corresponding to the nearest neighbor HR image blocks_hFinally obtain { N_h,N_l}。

9. The image super-resolution method based on SAE dictionary learning and neighborhood regression as claimed in claim 7, wherein said mapping relation { N }_h,N_lSolving the projection matrix P by using a ridge regression method specifically comprises the following steps:

using LR nearest neighbor mapping relation N_lReplacing dictionary D_lThen the expression for the reconstruction of beta is

In the formula, beta represents a coefficient matrix, Y represents a low-resolution image to be reconstructed, and eta is a weight coefficient and is used for relieving the singularity problem and ensuring the stability of coefficient decomposition;

solving the formula (4) by using a ridge regression method, and calculating the coefficient

Is shown as

The reconstructed HR image X passes through the mapping relation N_hSum coefficient

The method comprises the steps of (1) obtaining,

note N in the formula (6)_h(N_l ^TN_l+ηI)^-1N_l ^TIs a projection matrix P.

10. The image super-resolution method based on SAE dictionary learning and neighborhood regression as claimed in claim 7, wherein said image reconstruction based on projection matrix to obtain high resolution image specifically comprises:

firstly, preprocessing an image Y to be reconstructed to obtain an LR test characteristic image Y_t＝{y₁,y₂,…,y_i,…,y_n}; then, the Euclidean distance is adopted to be in LR dictionary D_lFinding LR test characteristic image block y in_iCorresponding nearest neighbor dictionary atom d_kThe expression is

Then through atom d_kFind its corresponding projection matrix P_tThen using the expression x_i＝P_ty_iObtain a correspondence y_iHR image block x_iAll LR test characteristic image blocks are reconstructed in this way, and an HR image X is formed.