CN114139618A

CN114139618A - Signal dependent noise parameter estimation method based on improved density peak clustering

Info

Publication number: CN114139618A
Application number: CN202111406097.2A
Authority: CN
Inventors: 吴俣倩; 张钰
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2021-11-24
Filing date: 2021-11-24
Publication date: 2022-03-04
Anticipated expiration: 2041-11-24
Also published as: CN114139618B

Abstract

本发明公开了基于改进密度峰值聚类的信号依赖噪声参数估计方法。通过滑动窗口从含有噪声的图像中提取样本，计算均值、熵和梯度作为特征数据。再输入聚类算法中进行聚类，区分弱纹理样本与强纹理样本。在聚类的过程中，引入了相对密度的概念，通过数据点与周围数据的距离，划分了比较范围，计算每个比较范围中数据点的相对密度，最后选择相对密度高的数据点作为聚类中心，解决了传统的DPC算法在对密度不均匀数据集进行聚类时往往忽视稀疏簇中心，从而影响聚类精度的问题。根据聚类结果，对簇标签为弱纹理的样本进行像素水平估计与噪声估计，最后通过最小二乘法拟合像素值‑噪声方差估计对，得到原始图像的噪声参数估计值，实现去噪的准备工作。The invention discloses a signal-dependent noise parameter estimation method based on improved density peak clustering. Extract samples from noisy images through sliding windows, and calculate mean, entropy and gradient as feature data. Then input into the clustering algorithm for clustering to distinguish weak texture samples from strong texture samples. In the process of clustering, the concept of relative density is introduced, the comparison range is divided according to the distance between the data point and the surrounding data, the relative density of the data points in each comparison range is calculated, and finally the data point with high relative density is selected as the cluster. The cluster center solves the problem that the traditional DPC algorithm often ignores the sparse cluster center when clustering data sets with uneven density, thus affecting the clustering accuracy. According to the clustering results, the pixel level estimation and noise estimation are performed for the samples whose cluster label is weak texture, and finally the pixel value-noise variance estimation pair is fitted by the least square method to obtain the estimated value of the noise parameter of the original image, and the preparation for denoising is realized. Work.

Description

Signal dependent noise parameter estimation method based on improved density peak clustering

Technical Field

The invention belongs to the technical field of image noise signal processing, and particularly relates to a signal dependent noise parameter estimation method based on improved density peak clustering.

Background

The image is an important carrier for information recording and transmission in modern society, but the image is inevitably affected by noise in the processes of acquisition, storage and transmission, so that the quality of the image is reduced. The noise level is an important parameter of image processing optimization algorithms such as image denoising, image compression, image splicing and the like, so that accurately estimating the noise level has a very important meaning for image denoising. With the development of CMOS image sensors, in order to reduce the influence of noise, some manufacturers directly embed a noise reduction module in an image sensor chip, which can effectively suppress noise independent of a signal whose noise component is becoming a main noise source of the CMOS image sensor. At present, most of estimation of signal dependent noise is based on weak texture image blocks, and the method has the difficulty that the weak texture image blocks are selected, then a pair of pixel values and noise variance are estimated for each image block, and finally, noise parameters are fitted. The weak texture image blocks may be obtained using a clustering algorithm.

Clustering is a process of grouping different objects according to similarity, and objects with high similarity are grouped into the same group. Clustering analysis, which is the main content of unsupervised learning, has been an important component in the fields of machine learning, pattern recognition, data mining, and the like. At present, many clustering algorithms have appeared, and can be roughly divided into a partition clustering method, a hierarchical clustering method, a density clustering method, a model clustering method, a grid clustering method and the like.

In 2014, Rodriguez and Laio proposed a density peak Clustering algorithm (DPC) based on density and distance on Science, which can search and find density peaks quickly, then determine cluster centers, and assign the remaining points to corresponding clusters. The DPC algorithm has the advantages of low parameter requirement, non-iteration and the like, and can effectively detect the clusters of any shapes from a large-scale data set with low calculation complexity. However, the existing density peak clustering algorithm is more prone to select the clustering center of the dense region, and the clustering of the sparse region is often ignored. For a data set with large density difference, the DPC algorithm often determines density points of a sparse region as abnormal values or allocates the abnormal values to adjacent dense clusters in a staggered manner, so that the cluster center of the sparse cluster is ignored, and thus the inaccuracy of a clustering result is caused. Some existing DPC-based improved clustering algorithms often improve the clustering effect by introducing K Nearest Neighbor (KNN), but the parameter k is a numerical value set artificially and subjectively according to past experience, and the size of the k value often has a large influence on the clustering result, so that the clustering accuracy is further influenced.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides a signal dependent noise parameter estimation method based on improved density peak value clustering, relative density definition is introduced, a clustering center is determined according to picture characteristic distribution, parameter values do not need to be defined artificially, clustering precision is improved, and accurate estimation of noise parameters is realized.

The signal dependent noise parameter estimation method based on the improved density peak clustering specifically comprises the following steps:

step one, extracting a plurality of samples with the same size from an original image containing noise by using a sliding window. And calculating the mean value, entropy and gradient of each sample as the characteristic data of the sample. And (4) forming a feature data set by the feature data of all the samples, and then carrying out normalization processing.

Step two, calculating the following parameters aiming at the feature data set X after normalization processing in the step one:

characteristic data x_iEuclidean distance d from other characteristic data_ij：

d_ij＝||x_i-x_j||

② characteristic data x_iLocal density of (p)_i：

③ characteristic data x_iDistance δ from nearest higher density feature data_i：

Characteristic data x_iDistance θ from nearest lower density feature data_i：

Wherein x is_i、x_j∈X，d_cThe cutoff distance is a value at 1% to 2% after arranging the distances between all feature data in ascending order.

Step three, dividing the characteristic data x according to the calculation result of the step two_iRelative density comparison range η of_iAnd calculating the relative density ρ thereof_i'：

η_i＝max(θ_i,δ_i)

Wherein N is_iIs a relative density comparison range eta_iThe number of characteristic data in the table.

Representing characteristic data x_jWherein the distance characteristic data x_iThe more recent, the feature data x_iThe greater the relative density contribution of (a), the greater the weight occupied. Defining feature data x_iIs the ratio of its local density to the cumulative average of the local densities of the surrounding feature data.

Step four, using the characteristic data x obtained in the step three_iRelative density of (g)_i' alternative local Density ρ_iRecalculating the feature data x_iDistance δ from nearest higher density feature data_i. Then, the feature data x is calculated_iCluster value of gamma_i：

γ_i＝ρ_i'·δ_i

Calculating to obtain a clustering value gamma_iSorting according to descending order, selecting two feature data with the highest clustering value as clustering centers, distributing other feature data to clusters with the clustering centers closer to each other, and outputting each feature dataAnd finishing the clustering process by using the cluster labels corresponding to the characteristic data.

Step five, calculating the estimated pixel level of the weak texture sample according to the clustering result of the step four

And variance of noise

wk＝1～B_weak_n

wk∈[1,B_weak_n]

Where N is the side length of the sample, wk denotes the index of the weak texture sample, B _ bead _ N is the number of weak texture samples, I_wk(m, n) represents a weak texture sample I_wkThe pixel value of the mth row and the nth column.

Is the minimum variance direction vector of the weak texture samples, and the covariance matrix C_PCorrelation of minimum eigenvalues of:

step six, configuring the noise in the original image into a Poisson-Gaussian noise model, and then carrying out a total noise method sigma of (p, q) positions in the original image²(p, q) is:

σ²(p,q)＝ax(p,q)+b

where x (p, q) represents the noise-free pixel value at the (p, q) position in the original image. a. b is a noise parameter. Fitting weak texture samples I using least squares_wkPixel value-to-noise variance estimation pair of

Obtaining the estimated value of the noise parameter of the original image

And

the invention has the following beneficial effects:

1. according to the method, for a data set with uneven density distribution of characteristic data of signal-related noise of a CMOS image sensor, in the clustering process of samples, relative density is defined, the density is calculated according to the data distribution characteristics of the data set, so that the central point of a sparse cluster can have a larger density value, the density value of a sparse cluster in the data set with large density difference is prevented from being always lower than that of a dense cluster, and the sparse cluster is prevented from being taken as an outlier or wrongly distributed to other high-density clusters.

2. The comparison areas are divided according to the data distribution characteristics of the data sets, and subjectively determined parameter values are not introduced, so that different data sets have different relative density comparison ranges in a self-adaptive manner, and the influence of artificial experience judgment on experimental results is avoided.

3. The accuracy of the clustering result is improved by introducing the relative density, so that a more accurate weak texture region is selected, the noise estimation of the whole image is further carried out, the estimation accuracy is improved, and the later-stage image denoising is facilitated.

Detailed Description

The invention is further explained in the following with reference to the accompanying drawings; the raw image used in this embodiment is derived from an image captured by a CMOS image sensor, and noise in the raw image is a signal-dependent noise component generated by a noise reduction module of the CMOS image sensor during the capturing process.

step one, using a sliding window with the size of 16 x 16, sliding the distance of one pixel point at a time according to the sequence from top to bottom and from left to right, and extracting n samples with the size of 16 x 16 from an original image containing noise. Calculating the mean, entropy, and gradient of each sample as the characteristic data of the sample:

where, gray represents the magnitude of the gray value, and p (gray) represents the probability that the gray value is gray. W, H is the width and height of the sample, p_w,hThe pixel value of the (w, h) position in the sample.

I.e. feature data x for each sample_i’Has three attributes of mean, entropy, and gradient, x_i'＝{x_i1,x_i2,x_i3}. And (4) forming a feature data set by the feature data of all the samples, and performing normalization processing.

d_ij＝||x_i-x_j||

② characteristic data x_iLocal density of (p)_i：

Characteristic data x_iDistance θ from nearest lower density feature data_i：

Wherein x is_i、x_j∈X，d_cThe size is a value at 1% to 2% after arranging the distances between all feature data in ascending order, as the cutoff distance. When the feature data x_iIs the global lowest, theta_i＝δ_i。

η_i＝max(θ_i,δ_i)

When eta_i＝δ_iTime, description feature data x_iSurrounded by feature data of lower density than the feature data, and more likely to become the center point.

When eta_i＝θ_iTime, description feature data x_iSurrounded by feature data with a higher density than the density, the probability of becoming a center point is small.

Representing characteristic data x_jWherein the distance characteristic data x_iThe more recent, the feature data x_iThe greater the relative density contribution of (A), is occupiedThe greater the weight. Defining feature data x_iIs the ratio of its local density to the cumulative average of the local densities of the surrounding feature data. Thus, the density value calculated by each point is the relative size, not the absolute size, compared with the density of the points around the point, the local structure of the cluster is better considered, and even the cluster center of the sparse cluster can obtain a larger density value, so that the sparse cluster is better identified.

Step four, using the characteristic data x obtained in the step three_iRelative density of (g)_i' alternative local Density ρ_iRecalculating the feature data x_iDistance δ from nearest higher density feature data_i. A traditional DPC algorithm needs to draw a decision diagram according to rho and delta, and then manually selects a point with larger rho and delta as a clustering center. The subjective factors of the method account for a great proportion, and the condition that the clustering center cannot be easily selected by naked eyes exists, so that the domino effect is directly caused in the subsequent distribution step, the final clustering result is influenced, and therefore the product of the calculated relative density and the calculated distance is selected as the characteristic data x_iCluster value of gamma_i：

γ_i＝ρ_i'·δ_i

Calculating to obtain a clustering value gamma_iAnd sorting according to a descending order, selecting two feature data with the highest clustering value as clustering centers, distributing other feature data into clusters with the clustering centers which are closer to each other, outputting a cluster label corresponding to each feature data, dividing the cluster label into a weak texture sample and a strong texture sample, and finding the weak texture sample after finishing the clustering process.

And variance of noise

wk＝1～B_weak_n

wk∈[1,B_weak_n]

Where N is the side length of each sample, N is 16 in this embodiment, wk denotes the index of the weak texture sample, B _ bead _ N is the number of weak texture samples, I_wk(m, n) represents a weak texture sample I_wkThe pixel value of the mth row and the nth column.

σ²(p,q)＝ax(p,q)+b

Obtaining the estimated value of the noise parameter of the original image

And

after determining the noise estimate in the image, the magnitude of the noise component in the image may be determined for use in subsequent CMOS image sensor signalsThe number depends on the removal of noise.

Claims

1. The signal dependent noise parameter estimation method based on the improved density peak value clustering is characterized by comprising the following steps: the method specifically comprises the following steps:

the method comprises the following steps of firstly, extracting a plurality of samples with the same size from an original image containing noise by using a sliding window; calculating the mean value, entropy and gradient of each sample as the characteristic data of the sample; normalizing the characteristic data of all samples to form a characteristic data set X;

step two, calculating the following parameters aiming at the characteristic data set X:

d_ij＝||x_i-x_j|| (1)

② characteristic data x_iLocal density of (p)_i：

Characteristic data x_iDistance θ from nearest lower density feature data_i：

Wherein x is_i、x_j∈X，d_cThe size of the cut-off distance is a value of 1% -2% of the distances between all the characteristic data after the distances are arranged in ascending order;

step three, dividing the characteristic data x according to the calculation result of the step two_iRelative density comparison range η of_iAnd calculating the relative density ρ thereof_i′：

η_i＝max(θ_i,δ_i) (5)

Wherein N is_iIs a relative density comparison range eta_iThe number of the characteristic data in the database;

representing characteristic data x_jThe weight of (c);

step four, using the characteristic data x obtained in the step three_iRelative density of (g)_i' alternative local Density ρ_iRecalculating the feature data x according to equation (3)_iDistance δ from nearest higher density feature data_i(ii) a Then, the feature data x is calculated_iCluster value of gamma_i：

γ_i＝ρ_i'·δ_i (7)

Calculating to obtain a clustering value gamma_iSorting in a descending order, selecting two feature data with the highest clustering values as clustering centers for clustering, and then outputting a cluster label corresponding to each feature data to finish the clustering process;

And variance of noise

Where N is the side length of the sample, wk denotes the index of the weak texture sample, B _ bead _ N is the number of weak texture samples, I_wk(m, n) represents a weak texture sample I_wkThe pixel values of the mth row and the nth column;

σ²(p,q)＝ax(p,q)+b (11)

wherein x (p, q) represents a noise-free pixel value at a position (p, q) in the original image; a. b is a noise parameter; fitting weak texture samples I using least squares_wkPixel value-to-noise variance estimation pair of

Obtaining the estimated value of the noise parameter of the original image

And

2. the method of claim 1 for signal dependent noise parameter estimation based on improved density peak clustering, wherein: step one, using a sliding window with the size of 16 x 16, sliding the distance of one pixel point at a time from top to bottom and from left to right, and extracting a plurality of samples with the size of 16 x 16 from an original image containing noise.

3. A signal dependent noise parameter estimation method based on improved density peak clustering according to claim 1 or 2, characterized by: calculating the mean, entropy, and gradient of each sample as the characteristic data of the sample:

wherein, gray represents the magnitude of the gray value, and p (gray) represents the probability of the gray value being gray; w, H is the width and height of the sample, p_w,hThe pixel value of the (w, h) position in the sample.