Multiscale image denoising using goodness-of-fit test based on EDF statistics

Khuram Naveed; Bisma Shaukat; Shoaib Ehsan; Klaus D. Mcdonald-Maier; Naveed ur Rehman

doi:10.1371/journal.pone.0216197

Abstract

Two novel image denoising algorithms are proposed which employ goodness of fit (GoF) test at multiple image scales. Proposed methods operate by employing the GoF tests locally on the wavelet coefficients of a noisy image obtained via discrete wavelet transform (DWT) and the dual tree complex wavelet transform (DT-CWT) respectively. We next formulate image denoising as a binary hypothesis testing problem with the null hypothesis indicating the presence of noise and the alternate hypothesis representing the presence of desired signal only. The decision that a given wavelet coefficient corresponds to the null hypothesis or the alternate hypothesis involves the GoF testing based on empirical distribution function (EDF), applied locally on the noisy wavelet coefficients. The performance of the proposed methods is validated by comparing them against the state of the art image denoising methods.

Citation: Naveed K, Shaukat B, Ehsan S, Mcdonald-Maier KD, ur Rehman N (2019) Multiscale image denoising using goodness-of-fit test based on EDF statistics. PLoS ONE 14(5): e0216197. https://doi.org/10.1371/journal.pone.0216197

Editor: Ahmadreza Baghaie, New York Institute of Technology, UNITED STATES

Received: November 27, 2017; Accepted: April 16, 2019; Published: May 10, 2019

Copyright: © 2019 Naveed et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Matlab code along with input images used in this study are available at https://www.mathworks.com/matlabcentral/fileexchange/64531-gofshrink.

Funding: This work was supported by the UK Engineering and Physical Sciences Research Council (EPSRC) under Grants EP/R02572X/1 and EP/P017487/1.

Competing interests: The authors have declared that no competing interests exist.

1 Introduction

The acquisition and transmission normally corrupt an image by introducing an additive noise. In this regard, image denoising algorithms are utilized to suppress noise while preserving the desired image features. Let x_p,q denote a pixel of a noisy N × N sized image X at location (p, q), acquired from an acquisition device, a transmission medium or a reconstruction process as (1) where s_p,q denotes the pixels of the true image S while η_p,q denotes noise at pixel location (p, q). In matrix form, the above equation can be written as (2) The goal of denoising is to estimate the true signal S from its noisy observation X. Here, η is considered an independent Gaussian noise with zero mean and arbitrary variance σ².

Earlier, denoising was achieved by linear methods such as Weiner filtering in the Fourier domain [1]. However, the scope of such techniques is only limited to stationary data because the Fourier transform is incapable of handling non-linear or non-stationary data. That resulted in multi-scale denoising methods employing non-linear operations such as thresholding in the transform domain [2]. For that purpose, discrete wavelet transform (DWT) was employed which decomposes a dataset into multiple scales that gives a sparse representation of the signal in transform domain [3]. The DWT based denoising algorithms exploit the sparsity of the wavelet coefficients [4–6] through simple yet powerful nonlinear thresholding operations [7, 8] to obtain the denoised image. Similar principle is adopted while denoising with variants of the DWT like double density discrete wavelet transform (DDDWT), complex wavelet transform (CWT), dual tree complex wavelet transform (DT-CWT) etc.

Among the wavelet based denoising methods, VisuShrink [9] is one of the simplest techniques; it employs a universal threshold for all the scales depending largely on image size and noise level. The disadvantage of this method is that it tends to over smooth large sized images. This is due to the dependence of the estimated threshold on the input image size. Therefore, comparatively better performance is shown by the adaptive data driven techniques which estimate the threshold separately for each scale [10–18]. An example of such a method is the SureShrink [10], which exploits the Stein’s unbiased risk estimator (SURE) to get an unbiased estimate of the threshold to perform signal/image denoising. An extension of the SureShrink is the Surelet [12], which employs the principle of SURE along with the linear expansion techniques (LET) to cast the denoising problem as the one with linear system of equations. The BayesShrink [13], on the other hand, operates within the Bayesian framework with prior application of Generalized Gaussian Distribution (GGD) on wavelet coefficients. An empirical Bayes approach of denoising based on the Jeffrey’s non-informative prior [14] exploits the sparsity and de-correlation properties of DWT for denoising purposes. Recently, empirical Bayes approach of denoising has been extended to 2D scale-mixing complex valued wavelet transform, namely cSM-EB [15].

Sparsity based signal recovery methods have also been explored as an avenue for image denoising. To that end, a compressive sensing based image denoising algorithm is proposed in [19] where L₁-minimization has been used to recover the true signal. In [20], sparse and redundant signal representation over learned dictionaries is used for denoising images. Clustering based locally learned dictionaries are employed for image denoising in [21] whereby clusters of local patches are obtained based on likewise geometrical structures. Similarly, clustering based sparse representation (CSR) method for image denoising combines the dictionary learning with structured clustering to exploit enhanced sparsity in [22]. A hybrid image denoising algorithm is proposed in [23] based on wavelet transform in combination with the learned and redundant dictionaries. In this method, the wavelet transform is used to obtain multiscale feature and sparse prior for wavelet coefficients which leads to the sparse representation in wavelet domain. Subsequently, the K-SVD algorithm is used to build sparse over-complete dictionaries of wavelet coefficients resulting in a state of the art image denoising algorithm. Patch based noisy image specific orthogonal dictionaries are learned using PCA in [24] to threshold the patch coefficients for image denoising, namely PaPCA.

A collaborative hard thresholding based filtering technique is used within BM3D [25] to exploit enhanced sparsity of transform domain. Here, a complex multistage process is adopted starting with the grouping of similar fragments of 2D transformed coefficients which are then arranged into 3D data arrays. Subsequently, attenuation of noise is achieved via spatial collaborative hard-thresholding followed by the collaborative Weiner filtering on the 3D arrays of the transformed coefficients. Despite its efficacy, the computational complexity of BM3D is considerably large owing to its complicated multi-step procedure [25].

Sparsity driven iterative algorithms are also used to solve total variation (TV) minimization for image denoising. For instance, several iterative algorithms have been designed for TV denoising including iterative soft thresholding algorithm (ISTA), fast ISTA (FISTA) and a monotone version of FISTA [26]. In addition, split Bregman algorithm has been used for efficient isotropic and anisotropic TV image denosing in [27]. Similarly, Beltrami regularization is considered in [28] for image denoising and has been shown to outperform TV based methods.

Spatial domain filtering techniques such as mean and median filtering are commonly used but are known to produce sub-optimal denoising. However, an efficient spatial domain non local mean (NLM) filtering technique for image denoising is proposed in [29], which happens to be a gold standard denoising method owing to its effective denoising performance. In this technique, image pixels having smallest euclidean distance from each other are grouped together leading to weighted mean of these pixels for noise smoothing. Hence, for each pixel, similar pixels are searched, grouped and averaged leading to very high computational complexity. Though, this technique yields visually pleasing denoising results but it is known to over-smooth details of an image.

Mostly, classical thresholding strategies exploit sparsity in transform domain by considering that coefficients corresponding to the signal have higher amplitudes compared to the noisy coefficients. Contrarily, Cai and Silverman [16] observed that wavelet coefficients corresponding to signal are distributed in the locality of each other while coefficients corresponding to noise are distributed uniformly. They used this fact to introduce neighbourhood based thresholding strategies for 1D signals [16] in which a coefficient is classified as signal if it is surrounded by likewise coefficients and vice versa. NeighShrink [17] introduces neighbourhood based thresholding to image denoising which operates by classifying a wavelet coefficient surrounded by higher amplitude coefficients as desired signal while a coefficient surrounded by the lower amplitude coefficients is classified as noise. Similarly, NeighSure [18] refines neighbourhood based thresholding via the SURE to achieve image denoising. A simple yet effective image denoising method exploiting the statistical neighbourhood dependencies of wavelet coefficients is proposed in [30]. A statistical model for neighbourhoods of oriented pyramid coefficients is developed in [31], which is based on Gaussian scale mixtures of empirical wavelet coefficients. The intra-scale dependencies within the wavelet coefficients have been modeled using fuzzy features in Fuzzy-Shrink [32], where a fuzzy feature distinguishes between the image discontinuities and noise.

Recently, statistical methods have emerged as a strong tool in the wavelet based image denoising. These methods exploit statistical dependencies within the wavelet coefficients for estimating the thresholds for denoising. BiShrink [33] models inter-scale dependencies in wavelet coefficients (obtained via the DWT as well as the DT-CWT) based on a new non-Gaussian bivariate distribution for threshold estimation. The method also includes a nonlinear bivariate shrinkage function driven through a maximum a posteriori (MAP) estimator. The ProbShrink [32] estimates a threshold based on the probability that a given coefficient contains significant information (signal of interest) by assuming a generalized Laplacian prior for noise free data.

A major issue in the conventional DWT is the lack of translation invariance in the traditional wavelet basis functions resulting in artifacts in the aftermath of denoising. These artifacts could be explained by the Gibbs phenomena in the neighbourhood of discontinuities. Stationary DWT, which is rotation invariant, can render partial translation invariance to the denoising results and can be implemented via cycle spinning approach [34]. In cycle spinning, noisy data is first shifted left or right, denoised via a wavelet based method and subsequently un-shifted. This process is repeated several times and all the results are averaged to produce a denoised signal/image with lesser artifacts. It has been shown in [34] that denoising results can be improved considerably by making the DWT partially translation invariant through cycle spinning.

In contrast to DWT, the DT-CWT enjoys near translation invariance and directional selectivity at the cost of a higher degree of redundancy [35]. The redundancy in DT-CWT is due to the fact that real and imaginary parts of the complex wavelet coefficients are dealt as independent wavelet coefficients which makes it twice redundant. However, in order to incorporate directional selectivity in the two dimensional DT-CWT, the complex wavelet coefficients are obtained at six directions compared to the three directions of the DWT (i.e. horizontal, vertical and diagonal), which further increases the redundancy by two. Hence, the two dimensional DT-CWT is 4:1 redundant as compared to the DWT [35]. In the two dimensional DT-CWT, dual tree of filters oriented at 6 directions are employed, yielding six bands of real parts and six bands of imaginary parts of the complex wavelet coefficients at each scale.

The directional selectivity in DT-CWT preserves orientation of the edges or discontinuities having a line or a curve shape, unlike DWT which only preserves the point discontinuities. In addition, the directional selectivity in DT-CWT helps avoid the checker-board artifacts during denoising process by differentiating between the edges oriented at 45° and −45° [35].

The redundancy, in combination with the filter banks designed to achieve complex number representation, makes DT-CWT approximately translation invariant. The maximal decimation in DWT causes aliasing in the decomposed wavelet coefficients. In order to cancel the effect of aliasing and achieve perfect reconstruction, the synthesis filters for inverse DWT operation are designed to fulfill the aliasing-free condition. However, the aliasing can only be avoided if the wavelet coefficients are not perturbed, which is not the case in wavelet based denoising. Contrarily, in DT-CWT, the inherent redundancy (4:1) suppresses aliasing to a large extent, yielding better denoising results.

Several denoising methods have been reported in literature which utilize the above desirable properties of the DT-CWT: In [30], dependencies among three scales of DT-CWT coefficients are exploited. NeighSure [18] employs Stein’s unbiased risk estimator (SURE) on complex wavelet coefficients of the DT-CWT to find an optimum threshold and a window size. Furthermore, image denoising methods reported in [36–41] are some of the recent methods which exploit near translation invariance and directional selectivity of the DT-CWT for improved denoising performance.

In this paper, two image denoising methods are proposed which employ statistical goodness of fit (GoF) tests on multi-scale wavelet coefficients obtained via DWT and DT-CWT. The decision process regarding the presence of noise at multiple scales is based on the statistical GoF tests, wherein Anderson Darling (AD) statistic is used as a measure of similarity between the local wavelet coefficients and reference Gaussian noise distribution. A coefficient is detected as corresponding to noise if its associated AD measure is less than a threshold, which is a function of probability of false alarm. Those coefficients are then eliminated (set to zero) while the remaining coefficients are retained. We demonstrate the effectiveness of the proposed methods by comparing them against the state-of-the-art in wavelet based image denoising on both natural and medical input images.

In our previous work [42–45], we had employed GoF test on multiple 1D signal scales, obtained via the 1D DWT, for signal denoising. Also, Poisson denoising in the context of CMOS/CCD images has also been proposed in [46]. In this work, we employ GoF test on multiple image scales for image denoising. To this end, a novel framework is developed for GoF testing on multiple scales of DWT as well as the DT-CWT, which offers better translation invariance and directional selectivity. The proposed methodology is significantly different from classic wavelet thresholding techniques in which the wavelet coefficients are directly compared against a threshold. In the proposed thresholding method, decision regarding the noisy image coefficients is made based on the statistical distance between the distribution or model of the local wavelet coefficients from the reference noise distribution.

This paper is organized as follows: Section II gives the background of wavelet based image denoising along with an insight into the GoF testing and its operation. A detailed discussion on the proposed algorithms is presented in Section III. Section IV presents the experimental results and discussion, while Section V concludes the paper while also highlighting possible avenues for future work.

2 Theoretical background

2.1 Wavelet transform based image denoising

Let denote the wavelet transform operated over a noisy image X to decompose it into wavelet coefficients at multiple scales as (3) where W denotes the matrix composed of wavelet coefficients with j denoting the scale of decomposition, i denotes location of a coefficient at multiple scales. The operator may refer to the DWT or the DT-CWT operation: when refers to DWT, W is a two dimensional matrix of wavelet coefficients and its formation is depicted in Fig 1 (left), where each scale of decomposition contains three bands of wavelet coefficients, each of which is associated to a direction namely horizontal, vertical and diagonal. The location index i first lists the horizontal coefficients (column wise) followed by the listing of vertical and diagonal wavelet coefficients.

Download:

Fig 1. Difference in the formation of wavelet coefficient matrix W in case of the DWT and the DT-CWT operation; (left) arrangement of the empirical wavelet coefficients in a 2D matrix W in case of the DWT operation; (right) arrangement of the complex wavelet coefficients in a 3D matrix W in case of the DT-CWT operation, where first two layers contain the real parts and the last two layers contain the imaginary parts of the complex wavelet coefficients.

https://doi.org/10.1371/journal.pone.0216197.g001

On the other hand, when the operator denotes the DT-CWT operation, W is a three dimensional matrix of wavelet coefficients as shown in Fig 1 (right), where each scale of decomposition contains twelve bands of wavelet coefficients. In order to achieve this representation we placed the redundant wavelet coefficients, yielded via DT-CWT, in four different two dimensional matrices in accordance with the formation shown in Fig 1 (left) and then those four matrices are placed above each other to make four layers of a three dimensional matrix as shown in Fig 1 (right). It must be noted that first two layers contain the real parts of the complex wavelet coefficients and last two layers contain the imaginary parts of the complex wavelet coefficients for each scale).

A threshold value T is next estimated to classify the coefficients as belonging to signal or noise i.e. a popular universal threshold [9] is based on image size N × N and noise standard deviation σ which is estimated as (4) here i denotes the index of only the diagonal wavelet coefficients at the scale j = 1. A thresholding operator ϒ is next applied individually on each wavelet coefficient as given below (5) where are thresholded empirical wavelet coefficients, ϒ could be soft or hard thresholding rule which exhibit near optimal properties in minimax sense and better convergence rates for approximating functions in Besov spaces [7, 8]. In the soft thresholding operation, the signal elements less than threshold T are floored to zero and the amplitudes of the remaining signal elements are reduced (shrunk) by T. The hard thresholding operation keeps the signal elements whose values are greater than T and sets the remaining coefficients to zero.

After performing thresholding operation, inverse wavelet transform [3] is applied on the noise suppressed wavelet coefficients to get an estimate of the true image S in the spatial domain (6) where are thresholded empirical wavelet coefficients (see Fig 1).

2.2 Statistical goodness-of-fit testing

The goodness-of-fit (GoF) test indicates how well a specified model or distribution fits a given set of observations. The GoF test performs hypothesis testing whereby the case with observations or data fitting the specified model/distribution is termed as null hypothesis and the case where observation reject the specified model/distribution is termed as alternate hypothesis . In order to quantify the difference between the observed values and the values expected under the specified distribution, different statistics/measures of GoF have been defined [47, 48]. Several measures of GoF test are employed in practice [49–52], each having unique properties of their own but only the Anderson Darlington (AD) statistics [51] will be discussed here because of its relevance with our work. A detailed discussion on the topic is presented in [53].

Let denote the empirical cumulative distribution function (ECDF) of input samples z with support t and represent the hypothesized cumulative distribution function (reference CDF) corresponding to a probability density function p(z). The AD statistic τ is given as follows (7) where is the weighting function responsible for giving more weight to the tail of the distribution function is given as (8) In order to compute τ, numeric expression for the AD statistic relation in (7) is as follows (9) where L denotes the size of the given observations x_t or the size of window in case of local operation of GoF test and H is defined as (10) The probability distribution of distance τ is specified asymptotically as window lengths L → ∞.

Within the framework of GoF test, a threshold T is computed for error probability of given observations falsely reject the reference distribution. In spectrum sensing related literature [54–56], the probability of falsely rejecting a candidate distribution is termed as the probability of false alarm P_fa, defined as follows, (11) where the range {z s.t. τ > λ} are the values yielding false alarm. P_fa is generally kept very very low to estimate an appropriate threshold T [57].

Next, hypothesis testing defined in (15) is performed to validate the null hypothesis or reject it i.e. the alternate hypothesis . (12)

3 GoF based multiscale image denoising

Two novel image denoising methods are proposed which employ GoF test on the wavelet coefficients of the noisy image obtained by using DWT and DTCWT respectively. The DT-CWT exhibits approximate translation invariance and directional selectivity which helps it to suppress the artifacts otherwise present in the DWT based denoising results. We denote the proposed denoising methods as the GoFShrink based on the DWT and the DT-CWT.

Conventionally, GoF tests have been applied to detection problems where they operate directly on input data to test the binary hypothesis of noise only and signal plus noise cases e.g. spectrum sensing [54–56], as follows (13)

Contrarily, in the denoising problem, the alternate hypothesis must correspond to the detection of signal only case. To achieve that, we propose to employ multiscale wavelet transforms on the input noisy data before applying the GoF test. The DWT and DT-CWT distribute the signal coefficients sparsely as compared to noise coefficients which are distributed uniformly across the scales, thus segregating signal and noise into separate coefficients at multiple scales. The modified binary hypothesis using the GoF test at multiple scales are given bellow (14) where and denote modified null and alternate hypothesis at multiple scales respectively and denotes multiscale wavelet coefficients obtained through DWT or the DT-CWT operation as specified in (3).

Given a scale dependent threshold T_j, the proposed framework first computes a test statistic τ_i for a sub-image centered around the coefficient at scale j and then compares it with the threshold T_j. The decision regarding the null hypothesis or alternate hypothesis , as defined in (14) is taken as follows (15) Finally, the coefficients identified as noise (i.e. ) samples are rejected at each scale, while the remaining coefficients are retained as part of the desired signal (i.e. ). The steps of the proposed algorithm are listed in the Algorithm 1 and are graphically depicted in Fig 2.

Download:

Fig 2. Block diagram of the GoFShrink based on DWT.

https://doi.org/10.1371/journal.pone.0216197.g002

Remark 1: For the GoF testing, the reference CDF (i.e. CDF describing noise in the signal) must be known a-priori. In our case, the reference distribution is white Gaussian noise which means specifying mean and variance completely specifies F_r(t).

Remark 2: τ could be computed using any GoF based empirical distribution function (EDF) statistic e.g. Anderson Darling (AD), Cramer Von Mises (CVM) and Kolmogrov Smirnov (KS) statistics etc. AD and CVM have been found to be relatively robust as compared to other EDF statistics. An insight into how these statistics ensure detection of signal only and noise only cases, is shown in Fig 3.

Download:

Fig 3. Test for Gaussianity via GoF tests where the case (a) shows noise detection as τ is expected to small; and the case (b) shows signal detection as τ is expected to large.

https://doi.org/10.1371/journal.pone.0216197.g003

Let an input noisy image X be decomposed into wavelet coefficients W at multiple scales j = 1 ‥ J through the DWT operation in (1). We next estimate the standard deviation of noise σ in the input image via (4) and subsequently normalize the wavelet coefficients by the σ to make the noise unit variance at multiple scales, as follows, (16) where denotes the normalized DWT coefficients.

Next, the level dependent threshold T_j must be computed for a probability of false alarm P_fa which requires the estimation of ; the reference noise distribution at scale k. In this work, the reference distribution at multiple scales corresponds to zero mean white Gaussian noise i.e., since DWT and DT-CWT retain the Gaussianity of input noise at multiple scales and can be computed as follows, (17) where z is a zero mean Gaussian random variable with arbitrary variance σ² which can be estimated using (4). The EDF of local wavelet coefficients around the coefficient at scale j is computed as (18) where l × l denote the window size.

For empirically estimating T_j at scale j, a large sized WGN η is decomposed using the DWT and the resulting multiscale WGN coefficients W_η are divided into small windows of size l × l. Let be the total number of such windows at scale j. For each window centered at i, let τ_i be the value of AD statistic computed via (7) by employing the and defined in (17) and (18) respectively. If T_j be a chosen threshold then let be the number of false alarms where τ_i ≥ T_j, then the . This way, the P_fa versus threshold curve is estimated for a range of values of threshold T_j as shown in Fig 4.

Download:

Fig 4. Threshold versus P_fa graph generated empirically for the first five scales of wavelet decomposed Gaussian noise along with its curve fitted version.

https://doi.org/10.1371/journal.pone.0216197.g004

Remark 3: Owing to the orthogonal and linear nature of the DWT, the T_j versus P_fa curves were found to be similar for all the scales as expected. The following mathematical model for threshold selection based on P_fa was obtained using polynomial curve-fitting as shown in Fig 4. (19)

Remark 4: Probability of false alarm (P_fa), in this case, denotes the probability that a noise coefficient is detected as a signal. That probability should be very small and is specified in the range of P_fa = 10⁻³ → 10⁻⁵.

Let be the wavelet coefficients which are part of , the GoF test is applied on each by taking a window of size l × l around and then computing their EDF using (18). Subsequently, the AD distance τ_i between the and the reference CDF at scale j is estimated via (7). For a given P_fa, a threshold T_j is selected and the following GoF based thresholding function is employed, (20) Fig 5 reports an experimental estimation of a suitable choice of P_fa for selecting the thresholding T_j.

Download:

Fig 5. Empirical selection of P_fa: Mean squared error (MSE) versus the P_fa relation obtained empirically for several test images.

Notice that the P_fa values closer to zero yield better results.

https://doi.org/10.1371/journal.pone.0216197.g005

Remark 5: The thresholding function (20) performs hard thresholding on the wavelet coefficients. This is in-line with the neighbourhood based thresholding rules reported in [16–18, 30, 31], whereby the central coefficient of a neighbourhood or a window is either retained as desired signal or removed as noise based on statistical or deterministic dependencies between the local wavelet coefficients.

Finally, the denoised empirical wavelet coefficients are reconstructed by inverse DWT operation to yield the estimate of the true image S_p,q. However, before the reconstruction, the normalization process in step 2 is reversed by multiplying all the retrieved signal coefficients with the estimated variance of the noise. (21) Subsequently, cycle spinning operation defined in [34] is performed to obtained denoised image. We shall denote the proposed algorithm by GoFShrink-TI in the remainder of this paper.

The above method can be extended to DT-CWT by applying the GoF test has been employed on the complex wavelet coefficients obtained by applying the DT-CWT on the noisy image. The DT-CWT exhibits near translation invariance and directional selectivity, which enables it to suppress various artifacts otherwise present in the DWT based denoising results [58].

The DT-CWT yields complex wavelet coefficients by separately calculating their real and imaginary parts. We propose to apply GoF based denoising operation, namely GoFShrink, separately on both sets of real and imaginary parts. These steps include: (i) calculation of the scale dependent thresholds for the real and imaginary trees of noisy wavelet coefficients (a graphical depiction of this process is shown in Fig 6 (middle)); (ii) computation of the complex wavelet coefficients W of the noisy image by employing (1), where denotes the DT-CWT operation; (iii) normalization of the DT-CWT coefficients of the noisy signal by employing (16); (iv) performing the GoF based thresholding in parallel, whereby AD statistics was employed independently on the real and imaginary DT-CWT coefficients locally, followed by the use of thresholding function in (20) for detecting and annihilating coefficients belonging to noise while the remaining coefficients are retained as desired signal (the shaded region in Fig 6 shows this process for imaginary parts while the unshaded region shows the same for real parts); (v) taking the inverse-DT-CWT operation, after the reverse normalization operation, to yield the denoised signal. For the rest of the paper, we will denote this method by GoFShrink-DT. Matlab code of both of the proposed methods is available online at https://www.mathworks.com/matlabcentral/fileexchange/64531-gofshrink.

Algorithm 1 GoFShrink based on DWT

1: i, j ← 0 ⊳ 2D Wavelet coefficient indexes

2: ⊳ DWT operation on input X

3: P_fa ← 0.005 ⊳ P_fa selection based on the experiment given in Fig 5

4: ⊳ Operation implemented via the procedure given at Fig 2 (left)

5: ⊳ Noise variance estimation

6: ⊳ Normalisation of the wavelet coefficient

7: for k = 1 to K do

8: for l = 1 to 3 do

9: for do

10: ⊳ AD statistic

11: if then

12: ⊳ Noise detection during GoF test

13: else

14: ⊳ Signal detection during GoF test

15: end if

16: end for

17: end for

18: end for

19: ⊳ Inverse DWT

Download:

Fig 6. Block diagram of the GoFShrink based on DT-CWT.

https://doi.org/10.1371/journal.pone.0216197.g006

4 Computational complexity

In this section we present the computational cost of the GoFShrink based on DWT. The computational cost of the GoFShrink based on DT-CWT will be four times to that of GoFShrink based on DWT, provided the length of filters used by both transforms is exactly the same.

The DWT operation on an image (of size N × N) involves separate filtering of the rows and columns, where first rows are processed via 1D low and high pass filters followed by the decimation by 2, and then the same process is applied on the columns of the input matrix.

If M denotes the size of the 1D low and high pass filters then the computation of the DWT coefficients will take 2M multiplications and 2(M − 1) additions per sample point. Since at kth level, the coefficients in the rows will be down sampled by 2^k−1, the total cost of implementing a filter at kth level will involve 2M(1 − 2^−k) multiplications and 2(M − 1)(1 − 2^−k) additions per sample point. The total number of coefficients processed by row filters will be N² as there are N rows in the image with each row having N number of pixels. Hence, the total complexity for implementing the row filters at all scales becomes 2N² M(1 − 2^−k) multiplications and 2N²(M − 1)(1 − 2^−k) additions. After including the computational cost on image columns, which is the same as that on the rows, the total computational cost of the 2D DWT operation on the noisy image will be 4N² M(1 − 2^−k) multiplications and 4N²(M − 1)(1 − 2^−k) additions. Next, these DWT coefficients will be normalized by the estimated noise standard deviation which required N² multiplications.

The computation of the empirical CDF is an important part of GoF tests and will require the computations of the order of O(LlogL) where L denotes total number of coefficients in the window which are to be used for the GoF test.

From (10), we can see that the computation of the AD statistics measure will require 3N²L multiplication and 2L(L − 1)N² additions for the N² coefficients of the DWT.

At the end, the inverse DWT operation will be performed on the thresholded wavelet coefficients. The inverse DWT operation mirrors the operation of the forward DWT but with different filters having the same length M. Therefore, the computational complexity of the inverse DWT will be exactly the same as the forward DWT operation.

5 Experimental results

This section presents the performance comparison of the proposed algorithms against the state of the art in image denoising. The peak signal to noise ratio (PSNR) has been employed as the measure of quantitative performance, given as (22) The mean squared error (MSE) is calculated as (23) where s_p,q denotes pixels of the true image S of size N × N and represents the pixels of the denoised image . Note that MSE of noisy image is equal to the variance of the noise σ².

For qualitative analysis, we employ the structural similarity (SSIM) measure and feature similarity (FSIM) measure. While SSIM evaluates the quality of a recovered image based on the structure, the FSIM evaluates the subjective quality of the recovered image based on how the human visual system (HVS) perceives the quality of an image [59].

The set of input images used for experimentation consisted of standard test images including Lena, Barbara, Peppers, Aeroplane and Cameraman images coupled with images used in other practical applications such as medical Brain MRI image, a diffused Multi-focus image and a natural View image. The Brain MRI image was taken from the NIH IMAGE program ImageJ (https://imagej.nih.gov/nih-image/about.html), a public domain software package distributed freely by the National Institutes of Health. The Multi-focus image set was acquired during the study in [60]. The View image was selected due to higher amount of details in it and is captured by authors at COMSATS University Islamabad campus using a 13 mega-pixel digital camera. These test images were corrupted by Gaussian noise at multiple noise levels corresponding to σ = 10, 20, 30, 40 and 50, which produces noisy images with PSNRs = 28.13, 22.11, 18.59, 16.07 & 14.15 respectively. The Multi-focus image and View image are displayed in Fig 7 along with their noisy versions, while Lena, Barbara, Peppers, Aeroplane, Cameraman and Brain MRI have been provided as a supplementary material with this work in S1 Fig.

Download:

Fig 7. Selected input images along with their noisy versions at noise level σ = 30 namely, (a) Multi-focus image; (b) View image.

https://doi.org/10.1371/journal.pone.0216197.g007

The performance of the proposed GoFShrink-TI and GoFShrink-DT methods have been evaluated by comparing them against the well known state of the art image denoising methods based on different variants of the wavelet transform: BayesShrink (DWT) [13], BiShrink (DT-CWT) [33], Surelet (DWT) [12], NeighSure (DT-CWT) [18], cSM-EB (CWT) [15]. In addition to the wavelet based methods, sparsity driven methods like PaPCA [24], iTVD [27], aTVD [27] and BeltDen [28] have also been considered for comparison. Computationally expensive technique non local mean (NLM) filtering method [29] has also been used as a comparative denoising method on practical images.

The DWT based denoising methods including the proposed GoFShrink-TI were implemented using Daubechies wavelet filters of eight taps, namely db8. The noisy images were decomposed into D = 5 wavelet levels. For the DT-CWT based image denoising methods, namely the NeighSure, BiShrink, and the proposed GoFShrink-DT, the dual tree of wavelet filters developed by Kingsbury in [61] for complex wavelets, were employed to decompose the noisy image into D = 5 levels. The parameters corresponding to the other comparative methods were used as specified by authors for best performance. The window size for performing the GoF test in the proposed methods was selected to be 5 × 5, though experiments with other window sizes including 3 × 3, 7 × 7 yielded similar results.

Table 1 presents the PSNR values obtained by applying various denoising methods on the selected test images. These PSNR values represent the average values taken over twenty iterations. The highest PSNR value is highlighted in shaded bold, while the second highest PSNR value is highlighted in bold (without shade) to underline the two best performing denoising algorithm at each noise level. The results in Table 1 demonstrate the superior performance of the proposed GoFShrink-DT against the selected state of the art of image denoising at all the noise levels for all the test images. Note that the GoFShrink-TI showed competitive performance when with other comparative image denoising methods for natural as well as medical images.

Download:

Table 1. Comparison of the proposed methods with the state-of-the-art image denoising methods in terms of output PSNR for a range of input noise levels σ = 10 to σ = 50.

https://doi.org/10.1371/journal.pone.0216197.t001

For the input image Barbara (of size 512 × 512), the GoFShrink-DT and the GoFShrink-TI outperformed other denoising methods at all noise levels. The best results were shown by the GoFShrink-DT which beat the rest of the denoising methods including the second best GoFShrink-TI method by a considerable margin. The GoFShrink-DT also demonstrated superior performance for Lena image (of size 512 × 512) at all noise levels while the second best results were shown by GoFShrink-TI at noise levels 10 ≤ σ ≤ 40 and iTVD at σ = 50, which outperformed GoFShrink-TI by a small margin.

For Aeroplane and Side MRI images, the proposed GoFShrink-DT outperformed all the comparative methods at all noise levels, while second best results were obtained by GoFShrink-TI and PaPCA alternatively at different noise levels. The second best performance was demonstrated by the GoFShrink-TI for Aeroplane image at noise level σ = 10, 40 & 50, while the PaPCA yielded second best results for σ = 20 & 30. Similarly, for Brain MRI image, the GoFShrink-TI offered second best performance at input noise levels σ = 10, 20 & 50 while PaPCA yielded second highest PSNR values for σ = 30 & 40.

For Peppers image (of size 512 × 512) at σ = 10 & 20, the BeltDen yielded best performance in terms of output PSNRs followed by the NeighSure at σ = 10 and the GoFShrink-DT at σ = 20. For noise levels σ ≥ 30 GoFShrink-DT yielded best results.

For Cameraman image (of size 256 × 256), the PaPCA method demonstrated best performance against the rest of the denoising methods for 10 ≤ σ ≤ 40. However, at σ = 50, BeltDen yields the best results. The GoFShrink-DT shows the second best performance for Cameraman image at 20 ≤ σ ≤ 40. The NeighSure exhibited second best performance at the noise level σ = 10, while at noise level σ = 50, iTVD yielded second highest PSNR values. Even though, the GoFShrink-TI failed to be among top two performing methods for Cameraman image, it showed competitive performance against the best methods.

Similarly, the GoFShrink-DT outperformed the comparative state of the art methods for View and Multi-focus images (of size 512 × 512) at all noise levels. For Multi-focus image, the GoFShrink-TI yielded next best results at noise level σ ≤ 20, while the iTVD showed second best performance at σ = 30 & 40. For the View image, the PaPCA yielded second best results at σ ≤ 20, while the BeltDen, iTVD and aTVD were second best respectively for noise levels σ = 30, 40 & 50.

Table 2 presents the qualitative analysis of the denoised images obtained from the comparative state of the art methods along with the proposed GoFShrink-DT method. For that purpose, we obtain results for input images ‘Lena’, ‘Plane’, ‘Peppers’ and ‘MRI’. It can be observed that the denoised images obtained from the proposed method yields highest SSIM and FSIM values on most occasions. In cases where other methods yield better results, the proposed method still remains quite competitive. Among the state of the art, PaPCA and BeltDen yields the best results in terms of the SSIM and FSIM values.

Download:

Table 2. Comparison of the proposed methods with the state-of-the-art image denoising methods in terms of structural similarity (SSIM) and feature similarity (FSIM) for a range of input noise levels σ = 10 to σ = 50.

https://doi.org/10.1371/journal.pone.0216197.t002

The above results and discussion clearly demonstrate the efficiency of the GOF based methods against the state of the art denoising methods for a variety of practical input images. Similarly, the GoFShrink-TI also showed competitive performance against the state of the art in image denoising. From the state of the art methods, PaPCA and iTVD yielded good performance against the proposed methods while the NeighSure and the Surelet have also been competitive.

To show the visual quality of the recovered images by various denoising methods, we take a specific case of a Brain MRI image in Fig 8, corrupted with WGN at σ = 20. The Fig 8(a) shows noisy versions of the Brain MRI image while Fig 8(b)–8(h) show the corresponding denoised images obtained by employing BiShrink, PaPCA, Surelet, NeighSure, cSM-EB, GoFShrink-TI and GoFShrink-DT, respectively. It can be noticed that the GoFShrink-DT retained the image details and avoided artifacts thereby providing the best visual quality denoised image as compared to the other denoising methods. The GoFShrink-TI though contains some artifacts but it also manages to preserve important details as compared to NeighSure, Surelet and BiShrink which also yielded artifacts. The cSM-EB performed comparatively better but fails to capture the clarity as evident in GoFShrink-DT results. The PaPCA demonstrated visually pleasing results with lesser artifacts, however, the denoised image is over-smoothed and it is hard to differentiate between smoother regions and inherent image discontinuities. We also computed the difference images corresponding to all the denoised images and then estimated the power of the difference images. It was observed that least power of the difference image was yielded by proposed methods i.e. 38.7 & 50.9 while the comparative methods yielded higher power difference images.

Download:

Fig 8. Visual results for several state-of-the-art image denoising methods on the Side MRI image of a brain corrupted with the noise level σ = 20.

This figure is composed of (a) noisy image; (b) denoised image from Bi-Shrink; (c) PaPCA; (d) Surelet; (e) NieghSure; (f) cSM-EB; (g) GoFShrink-TI; and (h) GoFShrink-DT.

https://doi.org/10.1371/journal.pone.0216197.g008

In Fig 9, the performance of the proposed GoFShrink-TI and the GoFShrink-DT is compared with the iTVD, Surelet and NeighSure for the Multi-focus image. It can be observed that the denoised image obtained through the proposed GoFShrink-DT bears striking resemblance to the original image as it contains least artifacts and recovers all of the important details when compared to the other methods. Second best results were shown by the GoFShrink-TI which recovered all the details with few artifacts, see Fig 9(f). The NeighSure and the Surelet yielded more artifacts in Fig 9(d) & 9(e) even though image details were preserved. Contrarily, the iTVD over-smoothed the detailed regions leading to a poor estimate of the original image as shown in Fig 9(c). Another evidence of the best visual performance by the proposed methods is the least power of difference images (obtained by subtracting denoised images from original) 38.18 and 43.82 respectively while the comparative methods Surelet and NeighSure yield 45.64 and 54.71 respectively. Even though the iTVD yields lower noise power compared to the GoFShrink-TI, the visual quality of its denoised image is not particularly impressive.

Download:

Fig 9. Results of several state of the art image denoising methods on Multi-focus image corrupted with noise with standard deviation σ = 30; (a) original image (b) noisy image (c) denoised image by iTVD and a zoomed in region (d) denoised image by Surelet and a zoomed in region (e) denoised image by NeighSure and a zoomed in region (f) denoised image by GoFShrink-TI and a zoomed in region (g) denoised image by GoFShrink-DT and a zoomed in region.

https://doi.org/10.1371/journal.pone.0216197.g009

In Fig 10, shows the actual and noisy view image along with the denoised images obtained from the BeltDen, aTVD and cSM-EB and the proposed for the input noise level σ = 40. Note that the denoised image obtained from GoFShrink-DT in Fig 10(g) yielded few artifacts with most details intact. The GoFShrink-TI also managed to recover important details when compared against the state of the art methods but it also yielded considerable amount of artifacts. The denoised images from other comparative methods including the BeltDen and the cSM-EB show significant artifacts. The aTVD yielded lesser artifacts as compared to BeltDen, cSM-E, albeit few line artifacts are still present while image details are missing.

Download:

Fig 10. Visual performance comparison of various denoising methods on the View image at higher noise level σ = 40.

This figure is composed of (a) original image; (b) noisy image and denoised images from (d) aTVD; (e) cSM-EB; (f) GoFShrink-TI; and (g) GoFShrink-DT.

https://doi.org/10.1371/journal.pone.0216197.g010

In order to validate our work, the proposed GoFShrink-DT is also compared against the NLM method, which is a computationally intensive state of the art method known for its effective denoising performance. For this purpose, Brain MRI and Multi-focus images have been used. The denoised images obtained from the the NLM and the GoFShrink-DT, at input noise level σ = 20 & 30 (i.e. noisy MRI image with PSNR = 22.11 & 18.59), have been displayed in S2 Fig which is provided as supplementary material with this work. S2 Fig also reports the corresponding PSNR values of the noisy and the denoised images. The first column of the Auxiliary Fig 2, shows noisy images while the second and third columns show denoised images obtained from the NLM and the GoFShrink-DT respectively. It is evident that NLM method yielded higher PSNRs and also managed to smooth out noise very effectively. However, NLM smooths images discontinuities or edges thereby loosing important details of the MRI image. Contrarily, the GoFShrink yielded comparatively less PSNR but it recovered important signal details which might be useful in the clinical diagnosis.

Similar trends can be observed in the bottom two rows of the Auxiliary Fig 2 where the NLM over smooths the Multi-focus image at input noise level σ = 20 & 30 while yielding comparatively higher PSNR values than those of the proposed method. However, the proposed GoFShrink gives sharper denoised image with more signal details.

6 Conclusion

A class of multiscale image denoising algorithms have been proposed which employ the goodness of fit test on multiple image scales obtained from discrete wavelet transform (DWT) and dual tree complex wavelet transform (DT-CWT). The Anderson Darling (AD) statistics have been employed, within the framework of GoF test, on the wavelet coefficients of the noisy image to compute the distance between the empirical distribution function (EDF) of local coefficients and the CDF of reference Gaussian noise. A local thresholding function is then used to classify the wavelet coefficients as belonging to signal or noise depending on the given probability of false alarm (P_fa) and the estimated AD statistic. The signal coefficients are retained while the noise coefficients are discarded to yield the denoised image. While the current work only deals with the case of Gaussian noise, the proposed scheme has potential to remove any type of noise with prior knowledge of the noise distribution. The proposed methods have been shown to outperform the state-of-the-art image denoising methods on a variety of input images ranging from standard test datasets to medical and diffusion images. The results have revealed that from the two proposed methods, the GoFShrink-DT (based on DT-CWT) has outperformed the GoFShrink-TI (based on DWT) which was expected given directional selectivity and translation invariance of the DT-CWT transform.

Supporting information

S1 Fig. Standard Input test images (a) Lena (b) Barbara (c) Peppers (d) Plane (e) Cameraman (g) Brain MRI.

https://doi.org/10.1371/journal.pone.0216197.s001

(TIF)

S2 Fig. Comparison of the denoising performance of the proposed GoFShrink-DT against the NLM method on Multifocus and MRI datasets, whereby first column displays the noisy input images (at σ = 20 & 30) while second and third columns show denoised images by the NLM and the GoFShrink-DT respectively.

In addition, PSNR values of each image have also been reported.

https://doi.org/10.1371/journal.pone.0216197.s002

(TIF)

References

1. Wiener N, of Technology (Cambridge MMI. Extrapolation, interpolation, and smoothing of stationary time series: with engineering applications. Technology Press; 1950.
2. Donoho DL, Johnstone IM. Minimax estimation via wavelet shrinkage. The annals of Statistics. 1998;26(3):879–921.
- View Article
- Google Scholar
3. Mallat S. A wavelet tour of signal processing. Elsevier; 1999.
4. Mallat SG. A theory for multiresolution signal decomposition: the wavelet representation. IEEE Transactions on Pattern Analysis & Machine Intelligence. 1989;(7):674–693.
- View Article
- Google Scholar
5. Cohen A, Kovacevic J. Wavelets: The mathematical background. Proceedings of the IEEE. 1996;84(4):514–522.
- View Article
- Google Scholar
6. Daubechies I. Ten lectures on wavelets. vol. 61. Siam; 1992.
7. Donoho DL, Johnstone JM. Ideal spatial adaptation by wavelet shrinkage. biometrika. 1994;81(3):425–455.
- View Article
- Google Scholar
8. Donoho DL. De-noising by soft-thresholding. IEEE transactions on information theory. 1995;41(3):613–627.
- View Article
- Google Scholar
9. Donoho DL, Johnstone IM, Kerkyacharian G, Picard D. Wavelet shrinkage: asymptopia? Journal of the Royal Statistical Society Series B (Methodological). 1995; p. 301–369.
- View Article
- Google Scholar
10. Donoho DL, Johnstone IM. Adapting to unknown smoothness via wavelet shrinkage. Journal of the american statistical association. 1995;90(432):1200–1224.
- View Article
- Google Scholar
11. Hao H, Wang H, Rehman N. A joint framework for multivariate signal denoising using multivariate empirical mode decomposition. Signal Processing. 2017;135:263–273.
- View Article
- Google Scholar
12. Blu T, Luisier F. The SURE-LET approach to image denoising. IEEE Transactions on Image Processing. 2007;16(11):2778–2786. pmid:17990754
- View Article
- PubMed/NCBI
- Google Scholar
13. Chang SG, Yu B, Vetterli M. Adaptive wavelet thresholding for image denoising and compression. IEEE transactions on image processing. 2000;9(9):1532–1546. pmid:18262991
- View Article
- PubMed/NCBI
- Google Scholar
14. Figueiredo MA, Nowak RD. Wavelet-based image estimation: an empirical Bayes approach using Jeffrey’s noninformative prior. IEEE Transactions on Image Processing. 2001;10(9):1322–1331. pmid:18255547
- View Article
- PubMed/NCBI
- Google Scholar
15. Remenyi N, Nicolis O, Nason G, Vidakovic B. Image denoising with 2D scale-mixing complex wavelet transforms. IEEE Transactions on Image Processing. 2014;23(12):5165–5174. pmid:25312931
- View Article
- PubMed/NCBI
- Google Scholar
16. Cai TT, Silverman BW. Incorporating information on neighbouring coefficients into wavelet estimation. Sankhyā: The Indian Journal of Statistics, Series B. 2001; p. 127–148.
- View Article
- Google Scholar
17. Chen G, Bui TD, Krzyżak A. Image denoising with neighbour dependency and customized wavelet and threshold. Pattern recognition. 2005;38(1):115–124.
- View Article
- Google Scholar
18. Dengwen Z, Wengang C. Image denoising with an optimal threshold and neighbouring window. Pattern Recognition Letters. 2008;29(11):1694–1697.
- View Article
- Google Scholar
19. Tavakoli A, Pourmohammad A. Image denoising based on compressed sensing. International Journal of Computer Theory and Engineering. 2012;4(2):266.
- View Article
- Google Scholar
20. Elad M, Aharon M. Image denoising via sparse and redundant representations over learned dictionaries. IEEE Transactions on Image processing. 2006;15(12):3736–3745. pmid:17153947
- View Article
- PubMed/NCBI
- Google Scholar
21. Chatterjee P, Milanfar P. Clustering-based denoising with locally learned dictionaries. IEEE transactions on Image Processing. 2009;18(7):1438–1451. pmid:19447711
- View Article
- PubMed/NCBI
- Google Scholar
22. Dong W, Li X, Zhang L, Shi G. Sparsity-based image denoising via dictionary learning and structural clustering. In: Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE; 2011. p. 457–464.
23. Li H, Liu F. Image denoising via sparse and redundant representations over learned dictionaries in wavelet domain. In: 2009 Fifth International Conference on Image and Graphics. IEEE; 2009. p. 754–758.
24. Deledalle CA, Salmon J, Dalalyan AS. Image denoising with patch based PCA: local versus global. In: BMVC. vol. 81; 2011. p. 425–455.
25. Dabov K, Foi A, Katkovnik V, Egiazarian K. Image denoising with block-matching and 3D filtering. In: Image Processing: Algorithms and Systems, Neural Networks, and Machine Learning. vol. 6064. International Society for Optics and Photonics; 2006. p. 606414.
26. Beck A, Teboulle M. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM journal on imaging sciences. 2009;2(1):183–202.
- View Article
- Google Scholar
27. Shi Y, Chang Q. Efficient algorithm for isotropic and anisotropic total variation deblurring and denoising. Journal of Applied Mathematics. 2013;2013.
- View Article
- Google Scholar
28. Zosso D, Bustin A. A primal-dual projected gradient algorithm for efficient Beltrami regularization. Computer Vision and Image Understanding. 2014; p. 14–52.
- View Article
- Google Scholar
29. Buades A, Coll B, Morel JM. A non-local algorithm for image denoising. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. vol. 2. IEEE; 2005. p. 60–65.
30. Chen G, Zhu WP. Signal denoising using neighbouring dual-tree complex wavelet coefficients. IET Signal Processing. 2012;6(2):143–147.
- View Article
- Google Scholar
31. Portilla J, Strela V, Wainwright MJ, Simoncelli EP. Image denoising using scale mixtures of Gaussians in the wavelet domain. IEEE Transactions on Image processing. 2003;12(11):1338–1351. pmid:18244692
- View Article
- PubMed/NCBI
- Google Scholar
32. Pizurica A, Philips W. Estimating the probability of the presence of a signal of interest in multiresolution single-and multiband image denoising. IEEE Transactions on image processing. 2006;15(3):654–665. pmid:16519352
- View Article
- PubMed/NCBI
- Google Scholar
33. Sendur L, Selesnick IW. Bivariate shrinkage functions for wavelet-based denoising exploiting interscale dependency. IEEE Transactions on signal processing. 2002;50(11):2744–2756.
- View Article
- Google Scholar
34. Coifman RR, Donoho DL. Translation-invariant de-noising. In: Wavelets and statistics. Springer; 1995. p. 125–150.
35. Selesnick IW, Baraniuk RG, Kingsbury NC. The dual-tree complex wavelet transform. IEEE signal processing magazine. 2005;22(6):123–151.
- View Article
- Google Scholar
36. Selesnick IW, Li KY. Video denoising using 2D and 3D dual-tree complex wavelet transforms. In: Wavelets: Applications in Signal and Image Processing X. vol. 5207. International Society for Optics and Photonics; 2003. p. 607–619.
37. Ranjani JJ, Thiruvengadam S. Dual-tree complex wavelet transform based SAR despeckling using interscale dependence. IEEE Transactions on Geoscience and Remote Sensing. 2010;48(6):2723–2731.
- View Article
- Google Scholar
38. Raj VNP, Venkateswarlu T. Denoising of medical images using dual tree complex wavelet transform. Procedia Technology. 2012;4:238–244.
- View Article
- Google Scholar
39. Coupé P, Manjón JV, Gedamu E, Arnold D, Robles M, Collins DL. Robust Rician noise estimation for MR images. Medical image analysis. 2010;14(4):483–493. pmid:20417148
- View Article
- PubMed/NCBI
- Google Scholar
40. Fierro M, Kyung WJ, Ha YH. Dual-tree complex wavelet transform based denoising for random spray image enahcement methods. In: Conference on Colour in Graphics, Imaging, and Vision. vol. 2012. Society for Imaging Science and Technology; 2012. p. 194–199.
41. Varsha A, Basu P. An improved dual tree complex wavelet transform based image denoising using GCV thresholding. In: 2014 First International Conference on Computational Systems and Communications (ICCSC). IEEE; 2014. p. 133–138.
42. ur Rehman N, Abbas SZ, Asif A, Javed A, Naveed K, Mandic DP. Translation invariant multi-scale signal denoising based on goodness-of-fit tests. Signal Processing. 2017;131:220–234.
- View Article
- Google Scholar
43. Naveed K, Shaukat B, ur Rehman N. Signal denoising based on dual tree complex wavelet transform and goodness of fit test. In: 2017 22nd International Conference on Digital Signal Processing (DSP). IEEE; 2017. p. 1–5.
44. Naveed K, Shaukat B, ur Rehman N. Dual tree complex wavelet transform-based signal denoising method exploiting neighbourhood dependencies and goodness-of-fit test. Royal Society open science. 2018;5(9):180436. pmid:30839740
- View Article
- PubMed/NCBI
- Google Scholar
45. ur Rehman N, Naveed K, Ehsan S, McDonald-Maier K. Multi-scale image denoising based on goodness of fit (GOF) tests. In: 2016 24th European Signal Processing Conference (EUSIPCO). IEEE; 2016. p. 1548–1552.
46. Naveed K, Ehsan S, McDonald-Maier KD, ur Rehman N. A Multiscale Denoising Framework Using Detection Theory with Application to Images from CMOS/CCD Sensors. Sensors. 2019;19(1):206.
- View Article
- Google Scholar
47. D’Agostino RB. Goodness-of-fit-techniques. vol. 68. CRC press; 1986.
48. Stephens MA. EDF statistics for goodness of fit and some comparisons. Journal of the American statistical Association. 1974;69(347):730–737.
- View Article
- Google Scholar
49. Plackett RL. Karl Pearson and the chi-squared test. International Statistical Review/Revue Internationale de Statistique. 1983; p. 59–72.
- View Article
- Google Scholar
50. Shaphiro S, Wilk M. An analysis of variance test for normality. Biometrika. 1965;52(3):591–611.
- View Article
- Google Scholar
51. Anderson TW, Darling DA. A test of goodness of fit. Journal of the American statistical association. 1954;49(268):765–769.
- View Article
- Google Scholar
52. Cramér H. On the composition of elementary errors: First paper: Mathematical deductions. Scandinavian Actuarial Journal. 1928;1928(1):13–74.
- View Article
- Google Scholar
53. Ingster Y, Suslina IA. Nonparametric goodness-of-fit testing under Gaussian models. vol. 169. Springer Science & Business Media; 2012.
54. Yucek T, Arslan H. A survey of spectrum sensing algorithms for cognitive radio applications. IEEE communications surveys & tutorials. 2009;11(1):116–130.
- View Article
- Google Scholar
55. Azim AW, Khalid SS, Abrar S. Statistical Spectrum sensing in cognitive radio. In: 2012 10th International Conference on Frontiers of Information Technology. IEEE; 2012. p. 145–152.
56. Wang H, Yang EH, Zhao Z, Zhang W. Spectrum sensing in cognitive radio using goodness of fit testing. IEEE Transactions on Wireless Communications. 2009;8(11):5427–5430.
- View Article
- Google Scholar
57. Pearson ES, Hartley HO. Biometrika tables for statisticians. 1966.
58. Kingsbury N. The dual-tree complex wavelet transform: a new efficient tool for image restoration and enhancement. In: 9th European Signal Processing Conference (EUSIPCO 1998). IEEE; 1998. p. 1–4.
59. Zhang L, Zhang L, Mou X, Zhang D. FSIM: A feature similarity index for image quality assessment. IEEE transactions on Image Processing. 2011;20(8):2378–2386. pmid:21292594
- View Article
- PubMed/NCBI
- Google Scholar
60. Rehman N, Ehsan S, Abdullah S, Akhtar M, Mandic D, McDonald-Maier K, et al. Multi-scale pixel-based image fusion using multivariate empirical mode decomposition. Sensors. 2015;15(5):10923–10947. pmid:26007714
- View Article
- PubMed/NCBI
- Google Scholar
61. Kingsbury N. Image processing with complex wavelets. Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 1999;357(1760):2543–2560.
- View Article
- Google Scholar

[ref1] 1. Wiener N, of Technology (Cambridge MMI. Extrapolation, interpolation, and smoothing of stationary time series: with engineering applications. Technology Press; 1950.

[ref2] 2. Donoho DL, Johnstone IM. Minimax estimation via wavelet shrinkage. The annals of Statistics. 1998;26(3):879–921.
View Article
Google Scholar

[3] View Article

[4] Google Scholar

[ref3] 3. Mallat S. A wavelet tour of signal processing. Elsevier; 1999.

[ref4] 4. Mallat SG. A theory for multiresolution signal decomposition: the wavelet representation. IEEE Transactions on Pattern Analysis & Machine Intelligence. 1989;(7):674–693.
View Article
Google Scholar

[7] View Article

[8] Google Scholar

[ref5] 5. Cohen A, Kovacevic J. Wavelets: The mathematical background. Proceedings of the IEEE. 1996;84(4):514–522.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref6] 6. Daubechies I. Ten lectures on wavelets. vol. 61. Siam; 1992.

[ref7] 7. Donoho DL, Johnstone JM. Ideal spatial adaptation by wavelet shrinkage. biometrika. 1994;81(3):425–455.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref8] 8. Donoho DL. De-noising by soft-thresholding. IEEE transactions on information theory. 1995;41(3):613–627.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref9] 9. Donoho DL, Johnstone IM, Kerkyacharian G, Picard D. Wavelet shrinkage: asymptopia? Journal of the Royal Statistical Society Series B (Methodological). 1995; p. 301–369.
View Article
Google Scholar

[20] View Article

[21] Google Scholar

[ref10] 10. Donoho DL, Johnstone IM. Adapting to unknown smoothness via wavelet shrinkage. Journal of the american statistical association. 1995;90(432):1200–1224.
View Article
Google Scholar

[23] View Article

[24] Google Scholar

[ref11] 11. Hao H, Wang H, Rehman N. A joint framework for multivariate signal denoising using multivariate empirical mode decomposition. Signal Processing. 2017;135:263–273.
View Article
Google Scholar

[26] View Article

[27] Google Scholar

[ref12] 12. Blu T, Luisier F. The SURE-LET approach to image denoising. IEEE Transactions on Image Processing. 2007;16(11):2778–2786. pmid:17990754
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref13] 13. Chang SG, Yu B, Vetterli M. Adaptive wavelet thresholding for image denoising and compression. IEEE transactions on image processing. 2000;9(9):1532–1546. pmid:18262991
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref14] 14. Figueiredo MA, Nowak RD. Wavelet-based image estimation: an empirical Bayes approach using Jeffrey’s noninformative prior. IEEE Transactions on Image Processing. 2001;10(9):1322–1331. pmid:18255547
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref15] 15. Remenyi N, Nicolis O, Nason G, Vidakovic B. Image denoising with 2D scale-mixing complex wavelet transforms. IEEE Transactions on Image Processing. 2014;23(12):5165–5174. pmid:25312931
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref16] 16. Cai TT, Silverman BW. Incorporating information on neighbouring coefficients into wavelet estimation. Sankhyā: The Indian Journal of Statistics, Series B. 2001; p. 127–148.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref17] 17. Chen G, Bui TD, Krzyżak A. Image denoising with neighbour dependency and customized wavelet and threshold. Pattern recognition. 2005;38(1):115–124.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref18] 18. Dengwen Z, Wengang C. Image denoising with an optimal threshold and neighbouring window. Pattern Recognition Letters. 2008;29(11):1694–1697.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref19] 19. Tavakoli A, Pourmohammad A. Image denoising based on compressed sensing. International Journal of Computer Theory and Engineering. 2012;4(2):266.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref20] 20. Elad M, Aharon M. Image denoising via sparse and redundant representations over learned dictionaries. IEEE Transactions on Image processing. 2006;15(12):3736–3745. pmid:17153947
View Article
PubMed/NCBI
Google Scholar

[57] View Article

[58] PubMed/NCBI

[59] Google Scholar

[ref21] 21. Chatterjee P, Milanfar P. Clustering-based denoising with locally learned dictionaries. IEEE transactions on Image Processing. 2009;18(7):1438–1451. pmid:19447711
View Article
PubMed/NCBI
Google Scholar

[61] View Article

[62] PubMed/NCBI

[63] Google Scholar

[ref22] 22. Dong W, Li X, Zhang L, Shi G. Sparsity-based image denoising via dictionary learning and structural clustering. In: Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on. IEEE; 2011. p. 457–464.

[ref23] 23. Li H, Liu F. Image denoising via sparse and redundant representations over learned dictionaries in wavelet domain. In: 2009 Fifth International Conference on Image and Graphics. IEEE; 2009. p. 754–758.

[ref24] 24. Deledalle CA, Salmon J, Dalalyan AS. Image denoising with patch based PCA: local versus global. In: BMVC. vol. 81; 2011. p. 425–455.

[ref25] 25. Dabov K, Foi A, Katkovnik V, Egiazarian K. Image denoising with block-matching and 3D filtering. In: Image Processing: Algorithms and Systems, Neural Networks, and Machine Learning. vol. 6064. International Society for Optics and Photonics; 2006. p. 606414.

[ref26] 26. Beck A, Teboulle M. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM journal on imaging sciences. 2009;2(1):183–202.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref27] 27. Shi Y, Chang Q. Efficient algorithm for isotropic and anisotropic total variation deblurring and denoising. Journal of Applied Mathematics. 2013;2013.
View Article
Google Scholar

[72] View Article

[73] Google Scholar

[ref28] 28. Zosso D, Bustin A. A primal-dual projected gradient algorithm for efficient Beltrami regularization. Computer Vision and Image Understanding. 2014; p. 14–52.
View Article
Google Scholar

[75] View Article

[76] Google Scholar

[ref29] 29. Buades A, Coll B, Morel JM. A non-local algorithm for image denoising. In: Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. vol. 2. IEEE; 2005. p. 60–65.

[ref30] 30. Chen G, Zhu WP. Signal denoising using neighbouring dual-tree complex wavelet coefficients. IET Signal Processing. 2012;6(2):143–147.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref31] 31. Portilla J, Strela V, Wainwright MJ, Simoncelli EP. Image denoising using scale mixtures of Gaussians in the wavelet domain. IEEE Transactions on Image processing. 2003;12(11):1338–1351. pmid:18244692
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

[ref32] 32. Pizurica A, Philips W. Estimating the probability of the presence of a signal of interest in multiresolution single-and multiband image denoising. IEEE Transactions on image processing. 2006;15(3):654–665. pmid:16519352
View Article
PubMed/NCBI
Google Scholar

[86] View Article

[87] PubMed/NCBI

[88] Google Scholar

[ref33] 33. Sendur L, Selesnick IW. Bivariate shrinkage functions for wavelet-based denoising exploiting interscale dependency. IEEE Transactions on signal processing. 2002;50(11):2744–2756.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref34] 34. Coifman RR, Donoho DL. Translation-invariant de-noising. In: Wavelets and statistics. Springer; 1995. p. 125–150.

[ref35] 35. Selesnick IW, Baraniuk RG, Kingsbury NC. The dual-tree complex wavelet transform. IEEE signal processing magazine. 2005;22(6):123–151.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref36] 36. Selesnick IW, Li KY. Video denoising using 2D and 3D dual-tree complex wavelet transforms. In: Wavelets: Applications in Signal and Image Processing X. vol. 5207. International Society for Optics and Photonics; 2003. p. 607–619.

[ref37] 37. Ranjani JJ, Thiruvengadam S. Dual-tree complex wavelet transform based SAR despeckling using interscale dependence. IEEE Transactions on Geoscience and Remote Sensing. 2010;48(6):2723–2731.
View Article
Google Scholar

[98] View Article

[99] Google Scholar

[ref38] 38. Raj VNP, Venkateswarlu T. Denoising of medical images using dual tree complex wavelet transform. Procedia Technology. 2012;4:238–244.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref39] 39. Coupé P, Manjón JV, Gedamu E, Arnold D, Robles M, Collins DL. Robust Rician noise estimation for MR images. Medical image analysis. 2010;14(4):483–493. pmid:20417148
View Article
PubMed/NCBI
Google Scholar

[104] View Article

[105] PubMed/NCBI

[106] Google Scholar

[ref40] 40. Fierro M, Kyung WJ, Ha YH. Dual-tree complex wavelet transform based denoising for random spray image enahcement methods. In: Conference on Colour in Graphics, Imaging, and Vision. vol. 2012. Society for Imaging Science and Technology; 2012. p. 194–199.

[ref41] 41. Varsha A, Basu P. An improved dual tree complex wavelet transform based image denoising using GCV thresholding. In: 2014 First International Conference on Computational Systems and Communications (ICCSC). IEEE; 2014. p. 133–138.

[ref42] 42. ur Rehman N, Abbas SZ, Asif A, Javed A, Naveed K, Mandic DP. Translation invariant multi-scale signal denoising based on goodness-of-fit tests. Signal Processing. 2017;131:220–234.
View Article
Google Scholar

[110] View Article

[111] Google Scholar

[ref43] 43. Naveed K, Shaukat B, ur Rehman N. Signal denoising based on dual tree complex wavelet transform and goodness of fit test. In: 2017 22nd International Conference on Digital Signal Processing (DSP). IEEE; 2017. p. 1–5.

[ref44] 44. Naveed K, Shaukat B, ur Rehman N. Dual tree complex wavelet transform-based signal denoising method exploiting neighbourhood dependencies and goodness-of-fit test. Royal Society open science. 2018;5(9):180436. pmid:30839740
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref45] 45. ur Rehman N, Naveed K, Ehsan S, McDonald-Maier K. Multi-scale image denoising based on goodness of fit (GOF) tests. In: 2016 24th European Signal Processing Conference (EUSIPCO). IEEE; 2016. p. 1548–1552.

[ref46] 46. Naveed K, Ehsan S, McDonald-Maier KD, ur Rehman N. A Multiscale Denoising Framework Using Detection Theory with Application to Images from CMOS/CCD Sensors. Sensors. 2019;19(1):206.
View Article
Google Scholar

[119] View Article

[120] Google Scholar

[ref47] 47. D’Agostino RB. Goodness-of-fit-techniques. vol. 68. CRC press; 1986.

[ref48] 48. Stephens MA. EDF statistics for goodness of fit and some comparisons. Journal of the American statistical Association. 1974;69(347):730–737.
View Article
Google Scholar

[123] View Article

[124] Google Scholar

[ref49] 49. Plackett RL. Karl Pearson and the chi-squared test. International Statistical Review/Revue Internationale de Statistique. 1983; p. 59–72.
View Article
Google Scholar

[126] View Article

[127] Google Scholar

[ref50] 50. Shaphiro S, Wilk M. An analysis of variance test for normality. Biometrika. 1965;52(3):591–611.
View Article
Google Scholar

[129] View Article

[130] Google Scholar

[ref51] 51. Anderson TW, Darling DA. A test of goodness of fit. Journal of the American statistical association. 1954;49(268):765–769.
View Article
Google Scholar

[132] View Article

[133] Google Scholar

[ref52] 52. Cramér H. On the composition of elementary errors: First paper: Mathematical deductions. Scandinavian Actuarial Journal. 1928;1928(1):13–74.
View Article
Google Scholar

[135] View Article

[136] Google Scholar

[ref53] 53. Ingster Y, Suslina IA. Nonparametric goodness-of-fit testing under Gaussian models. vol. 169. Springer Science & Business Media; 2012.

[ref54] 54. Yucek T, Arslan H. A survey of spectrum sensing algorithms for cognitive radio applications. IEEE communications surveys & tutorials. 2009;11(1):116–130.
View Article
Google Scholar

[139] View Article

[140] Google Scholar

[ref55] 55. Azim AW, Khalid SS, Abrar S. Statistical Spectrum sensing in cognitive radio. In: 2012 10th International Conference on Frontiers of Information Technology. IEEE; 2012. p. 145–152.

[ref56] 56. Wang H, Yang EH, Zhao Z, Zhang W. Spectrum sensing in cognitive radio using goodness of fit testing. IEEE Transactions on Wireless Communications. 2009;8(11):5427–5430.
View Article
Google Scholar

[143] View Article

[144] Google Scholar

[ref57] 57. Pearson ES, Hartley HO. Biometrika tables for statisticians. 1966.

[ref58] 58. Kingsbury N. The dual-tree complex wavelet transform: a new efficient tool for image restoration and enhancement. In: 9th European Signal Processing Conference (EUSIPCO 1998). IEEE; 1998. p. 1–4.

[ref59] 59. Zhang L, Zhang L, Mou X, Zhang D. FSIM: A feature similarity index for image quality assessment. IEEE transactions on Image Processing. 2011;20(8):2378–2386. pmid:21292594
View Article
PubMed/NCBI
Google Scholar

[148] View Article

[149] PubMed/NCBI

[150] Google Scholar

[ref60] 60. Rehman N, Ehsan S, Abdullah S, Akhtar M, Mandic D, McDonald-Maier K, et al. Multi-scale pixel-based image fusion using multivariate empirical mode decomposition. Sensors. 2015;15(5):10923–10947. pmid:26007714
View Article
PubMed/NCBI
Google Scholar

[152] View Article

[153] PubMed/NCBI

[154] Google Scholar

[ref61] 61. Kingsbury N. Image processing with complex wavelets. Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 1999;357(1760):2543–2560.
View Article
Google Scholar

[156] View Article

[157] Google Scholar

Figures

Abstract

1 Introduction

2 Theoretical background

2.1 Wavelet transform based image denoising

2.2 Statistical goodness-of-fit testing

3 GoF based multiscale image denoising

4 Computational complexity

5 Experimental results

6 Conclusion

Supporting information

S1 Fig. Standard Input test images (a) Lena (b) Barbara (c) Peppers (d) Plane (e) Cameraman (g) Brain MRI.

References