A Retinex-based network for image enhancement in low-light environments

Ji Wu; Bing Ding; Beining Zhang; Jie Ding

doi:10.1371/journal.pone.0303696

Abstract

Most of the existing low-light image enhancement methods suffer from the problems of detail loss, color distortion and excessive noise. To address the above-mentioned issues, this paper proposes a neural network-based low-light image enhancement network. The network is divided into three parts: decomposition network, reflection component denoising network, and illumination component enhancement network. In the decomposition network, the input image is decomposed into a reflection image and an illumination image. In the reflection component denoising network, the Unet3+ network improved by fusion CA attention is adopted to denoise the reflection image. In the illumination component enhancement network, the adaptive mapping curve is adopted to enhance the illumination image iteratively. Finally, the processed illumination and reflection images are fused based on Retinex theory to obtain the final enhanced image. The experimental results show that the proposed network achieves excellent visual effects in subjective evaluation. Additionally, it shows a significant improvement in objective evaluation metrics, including PSNR, SSIM, NIQE, and so on, when compared to the results in several public datasets.

Citation: Wu J, Ding B, Zhang B, Ding J (2024) A Retinex-based network for image enhancement in low-light environments. PLoS ONE 19(5): e0303696. https://doi.org/10.1371/journal.pone.0303696

Editor: Xin Xu, Wuhan University of Science and Technology, CHINA

Received: February 2, 2024; Accepted: April 29, 2024; Published: May 24, 2024

Copyright: © 2024 Wu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting Information files.

Funding: National Key Research and Development Program of China (2022YFB3204600) Beijing Institute of Technology Research Fund Program for Young Scholars.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Images play an irreplaceable role in our daily life as a way to obtain information [1]. However, the complicated shooting environment, different lighting conditions and other factors lead to unsatisfactory image acquisition, uneven illumination, low contrast and the presence of a large amount of noise, etc. They interfere with the image recognition in the subsequent processing. The low-light image enhancement technology can be used to make images clearer and reduce identification costs. The research on image enhancement methods in low-light environments is of great significance.

Existing enhancement methods can be classified into two main categories: traditional image enhancement methods and image enhancement methods based on deep learning. Traditional image enhancement methods are mainly used in industry. The main representative methods include grey scale transformation, histogram equalization, and the Retinex method [2–7]. Among them, enhancement methods based on Retinex theory are widely used, such as [5–7]. However, traditional methods are less sensitive to noise. They often result in color distortion and unsatisfied denoising.

Image enhancement methods based on deep learning have developed rapidly in recent years. LLNet was the first application of deep learning theories to low-light image enhancement [8]. Afterwards, Retinex-Net used deep learning neural networks to the Retinex theory for image enhancement [9]. However, they assumed “Ground Truth” image existing and therefore ignored the influence of noise on different regions, resulting bad detail restoration. [10–12] etc. avoided the need for “Ground Truth” reflectance and illumination images, but it overlooked detail optimization, such as structure and texture. To reduce reliance on paired datasets, some unsupervised methods were proposed. Zero-DCE became the first low-light enhancement network that operated independent of paired datasets [13]. EnlightenGAN further minimized reliance on paired data [14]. However, due to the lack of guidance from paired datasets, unsupervised methods were unable to effectively learn real-world scene features and had limited generalization capabilities. Other research methods [15–21] attempted to address the issues of unrealistic recovery effects and complex network scales by introducing new learning modules and attention mechanisms. Nevertheless, these methods had limitations in their experimental outcomes and lack robust generalization. For example, in different scenarios, the restoration effects of [13–16, 18, 21] were unstable, and the models lacked constraints and guidance for specific scenes. When strong light conditions occur, over-enhancement phenomena can be observed in [17, 19, 20]. To address these limitations, we propose a new model based on Retinex. Compared to other advanced methods, the method proposed in this paper shows good restoration effects in complicated conditions such as extreme low-light and over-exposure. It also shows excellent performance in image denoising and enhancement without the need of a large amount of training data.

This paper is arranged as follows. The Proposed Network section introduces the structure of proposed enhanced network. Experimental data sets and details of training sets are presented in the Experimental Process section. The Results and Analysis section analyses the experimental results of different methods. The Conclusions section draws the conclusion.

Proposed network

In this paper, a low-light image enhancement network combining Retinex theory [4] and a convolutional neural network is designed. The principle of Retinex is shown in Fig 1.

Download:

Fig 1. Retinex schematic diagram.

https://doi.org/10.1371/journal.pone.0303696.g001

According to the Retinex theory, the illumination image represents the lighting conditions and the reflection image represents the texture information of the object. The enhancement of the original image is achieved by multiplying the illumination image and the reflection image. This relationship is expressed by Eq (1).

(1)

S(x,y) represents the image information received by the observer S. L(x,y) represents the illumination component of light. R(x,y) represents the reflection component of the object R.

Based on the Retinex theory, an image can be decomposed into a reflection component and an illumination component. For each component, a network is built. And an additional network is also needed to decompose the image. Therefore, the proposed network can be divided into three parts: the decomposition network, the reflection component denoising network, and the illumination component enhancement network. The overall network structure is shown in Fig 2. The specific network design and the corresponding loss function for each sub-network are demonstrated in the following part.

Download:

Fig 2. Overall network structure.

https://doi.org/10.1371/journal.pone.0303696.g002

Decomposition network

The structure of the KinD [10] is adopted in the decomposition network. However, the KinD has problems of over-enhancement and visual defects. Therefore, in the first and third convolutional layers, the original activation function ReLU is replaced by the GELU, which exhibits stable optimization capabilities and excellent generalization. Compared to ReLU function, GELU function can better capture complex relationships in image data, aiding in enhancing the structural and textural information of the image. The smoothness of the GELU function reduces issue like gradient explosion or disappearance, resulting in superior performance in preserving image detail information and handling exposure.

Furthermore, to improve the accuracy of the network, a new structural similarity loss function SSIM [22] is added. SSIM is a metric used to measure image quality. It is mainly used to assess the structural similarity between two images. The SSIM loss function includes three aspects of image features: brightness, contrast and structure. By minimizing the SSIM loss function, the decomposed image can be made closer to the original image and can maintain a better perceptual quality. The details of the decomposition network are shown in Fig 3.

Download:

Fig 3. Structure of the decomposition network.

https://doi.org/10.1371/journal.pone.0303696.g003

5 loss functions are used in the decomposition network. They are reconstruction loss function, reflection component consistent loss function, illumination component smoothing loss function, illumination intercorrelation loss function, and structural similarity loss function. The details of these loss functions are illustrated below.

The reconstruction loss is (2)

S_l and S_h denote the low-light image and the normal-light image, respectively. R_l and R_h denote the reflection component from the decomposition of the low-light image and the normal-light image, respectively. I_l and I_h denote the illumination component from the decomposition of the low-light image and the normal-light image, respectively.

The reflection component consistent loss is (3)

The illumination component smoothing loss is (4)

∇ is the first-order derivative operator. ϵ is a constant, here it is set to 0.01.

The illumination intercorrelation loss is (5)

c is the parameter that controls the shape of the function, here it is set to 10. G represents the sum of the gradients.

The structural similarity loss is (6) (7)

S_out and S_h denote the output image and the normal-light image, respectively. and denote the mean values of the output image and the normal-light image, respectively. and denote the standard deviation of the output image and the normal-light image, respectively. c₁ and c₂ are constants.

The total loss function of the image decomposition network is (8)

λ_rec, λ_rs, λ_is, λ_mc, and λ_SSIM are the weighting coefficients for reconstruction loss, reflection component agreement loss, illumination component smoothing loss, illumination intercorrelation loss, and structural similarity loss, respectively. λ_rec, λ_rs, λ_is, λ_mc, and λ_SSIM are set to 1, 0.009, 0.2, 0.15 and 0.07, respectively.

We compare the effects before and after adding the SSIM loss function to KinD. The experimental results are shown in Fig 4. Although KinD has a muddy shadow in some areas after adding SSIM loss function, the overall effect is better. The outcome aligns more closely with the visual perception of the human eye.

Download:

Fig 4. Results of adding SSIM loss function to KinD.

https://doi.org/10.1371/journal.pone.0303696.g004

Fig 5 shows the effects of replacing the ReLU function with the GELU function in the decomposition network. Fig 5(B) depicts that the reflection image is blurry and has serious color distortion phenomenon when using ReLU. Contrarily, in Fig 5(C), using GELU results in more realistic colors and less noise. Fig 5(D) shows overexposure in the illumination image when using ReLU, whereas Fig 5(E) depicts the image clearer when using GELU.

Download:

Fig 5. Comparison of ReLU function and GELU function in decomposition network.

https://doi.org/10.1371/journal.pone.0303696.g005

Reflective component denoising network

When the low-light image passes through the decomposition network, the reflection image retains the detail information. However, the noise in the low-light region is amplified at the same time. Therefore, it is necessary to denoise the decomposed reflection image. The structure of the Unet3+ [23] is adopted in the reflective component denoising network. However, the Unet3+ does not consider the extracting object size, which results in a mismatch between the receptive field and the scale. It leads to certain limitations in denoising. Therefore, CA attention [24] is added to the encoder part in Unet3+. CA attention combines channel attention and spatial attention to enhance the capture of direction and position information. It can help the network to adaptively learn the noise model of different regions in the image and make weighted estimates of the noise so that the network can more accurately recover parts of the signal and retain more detailed information. CA Attention can help the network to achieve local attention, allowing the network to focus more on the regions in the image that need to be processed. Meanwhile, it can be easily inserted into the network module to improve accuracy. The details of the reflective component denoising network are shown in Fig 6.

Download:

Fig 6. Structure of the reflective component denoising network.

https://doi.org/10.1371/journal.pone.0303696.g006

2 loss functions are used in the reflective component denoising network. They are multi-scale structural similarity loss function and detail loss function. The details of these loss functions are illustrated below.

The multi-scale structural similarity loss L_ms−ssim is (9)

M denotes the total number of scales, here it is set to 2. μ_p and μ_g denote the mean of the denoised and normal-light reflectance images, respectively. σ_p and σ_g denote the standard deviation of the denoised and normal-light reflectance images, respectively. C₁ and C₂ are constants. σ_pg denotes the covariance of the denoised and normal-light reflectance images. Both the β_m and γ_m components are set to 0.2856.

The detail loss L_par is (10)

R_h denotes the reflectance image of the normal-light image. R_L denotes the denoised reflectance image. || ||₁ denotes the L₁ parametric regularization constraint on both.

The total loss function of the reflectance component denoising network is (11)

λ_ms−ssim and λ_par are the weighting coefficients of the multi-scale structural similarity loss and detail loss, respectively. λ_ms−ssim is set to 1 and λ_par is set to 0.009.

We compare the effects before and after adding the CA attention to Unet3+. The experimental results are shown in Fig 7. The Unet3+ recovered image is brighter, but it suffers from increased blurriness and noise. Although the addition of CA attention reduces image brightness, it significantly improves denoising effect. The colors in image are fuller and more realistic.

Download:

Fig 7. Results of adding CA attention to Unet3+.

https://doi.org/10.1371/journal.pone.0303696.g007

Illumination component enhancement network

The illumination image represents different light distributions in the image. Zero-DCE has the advantages of lightweight and excellent image brightness enhancement. Though Zero-DCE’s denoising effect is ordinary, the reflection component denoising network proposed in this paper has a good denoising effect. Therefore, Zero-DCE is adopted in the illumination component enhancement network. Zero-DCE is used to improve the curve fit through multiple iterations. The difference in the experimental results. Therefore, n is set to find the best iteration times through multiple experiments in this work. The details of the illumination component enhancement network are shown in Fig 8.

Download:

Fig 8. Structure of the illumination component enhancement network.

https://doi.org/10.1371/journal.pone.0303696.g008

4 loss functions are used in the illumination component enhancement network. They are exposure control loss function, color constant loss function, illumination smoothing loss function, and spatial consistency loss function. The details of these loss functions are illustrated below.

The exposure control loss L_exp is (12)

E is a constant. It is set to 0.6. M is the toal number of pixels. Y_k is the mean value of a pixel region.

The color constant loss L_col is (13)

J^p and J^q are the luminance averages of color channel p and color channel q, respectively. (p,q) traverses all two-by-two combinations of three color channels.

The illumination smoothing loss is (14)

N denotes the iteration times. ∇_x and ∇_y denote the gradient operators in the horizontal and vertical directions, respectively.

The spatial consistency loss L_spa is (15)

Y denotes the pixel value after enhancement. I denotes the pixel value before enhancement. Ω is the neighboring pixels of the pixel.

The total loss function of the illumination component enhancement network is (16)

, and W_spa are the weighting factors for exposure control loss, color constancy loss, illumination smoothing loss, and spatial consistency loss, respectively. , and W_spa are set to 10, 5, 200 and 1, respectively.

The results of different iteration times in the illumination component enhancement network are shown in Table 1 and Fig 9. Table 1 shows that the effect reaches the best when n is 6. Fig 9 also verifies that when n is 6 the image is more in line with the human subjective visual effect.

Download:

Fig 9. Comparison results of different iteration times n.

https://doi.org/10.1371/journal.pone.0303696.g009

Download:

Table 1. Comparison results of different iteration times n.

https://doi.org/10.1371/journal.pone.0303696.t001

Experimental process

Experimental data sets

In the training process, 485 groups of LOL dataset are used as the training set. The remaining 15 groups of LOL dataset are used as the test set. In order to verify the model effect, the paired datasets VE-LOL-L, SID and ELD are used as other test sets. The unpaired datasets DICM and MEF are also used as the test set.

Details of training process

The experiments are carried out under the framework Pytorch 1.10.1, based on Python 3.7 with Cuda 11.1 environment. The Adam optimizer is used in the training process. The low-light image enhancement is accomplished by the proposed three sub-networks. The experimental details and steps are illustrated below.

Input the normal-light image and the low-light image into the decomposition network for decomposition. The learning rate of the decomposition network is set to 0.004. The batch size is set to 32.
Input the decomposed reflection image into the reflection component denoising network for denoising. The learning rate of the reflection component denoising network is set to 0.001. The batch size is set to 1.
Input the decomposed illumination image into the illumination component enhancement network for enhancement. The learning rate of the illumination component enhancement network is set to 0.001. The iteration times n is set to 6. The batch size is set to 8.
Multiply the denoised reflection image R(x,y) and the enhanced image L(x,y) to obtain the final image.

Results and analysis

Both subjective and objective visual evaluations are employed to evaluate the effects of image enhancement. To validate the necessity of each sub-network, we have conducted ablation experiments. In the objective visual evaluation, various representative metrics are used to assess the experiments. These metrics include peak signal-to-noise ratio (PSNR), structural similarity (SSIM), and no- reference metrics, such as Natural Image Quality Evaluation (NIQE), Image Perceptual Quality (PI), No- Reference Quality Evaluation (BRISQUE), and Neural Image Assessment (NIMA).

Subjective visual evaluation

In the subjective visual evaluation, KinD, Retinex-Net, SCI, URetinex-Net, and Zero-DCE are selected for comparison. The final results on the test set are shown in Figs 10–15.

Download:

Fig 10. Comparison results of different methods on LOL test set.

https://doi.org/10.1371/journal.pone.0303696.g010

Download:

Fig 11. Comparison results of different methods on VE-LOL-L test set.

https://doi.org/10.1371/journal.pone.0303696.g011

Download:

Fig 12. Comparison results of different methods on DICM test set.

https://doi.org/10.1371/journal.pone.0303696.g012

Download:

Fig 13. Comparison results of different methods on MEF test set.

https://doi.org/10.1371/journal.pone.0303696.g013

Download:

Fig 14. Comparison results of different methods on SID test set.

https://doi.org/10.1371/journal.pone.0303696.g014

Download:

Fig 15. Comparison results of different methods on ELD test set.

https://doi.org/10.1371/journal.pone.0303696.g015

From Figs 10–15, it can be seen that the proposed network and rest of five methods in the comparison are all able to achieve low-light images enhancement. However, each method has its own disadvantages. Figs 10(C) and 13(C) show that the Retinex-Net enhanced images are too noisy. There is a serious distortion in the color recovery, and the overall effect is unsatisfied. The Retinex-Net is very likely to lead to serious global color distortion. Although some of the noise can be removed, the global color shift is obvious.

Figs 10(B) and 12(B) show overexposure when recovering a brighter light source using KinD. As shown in Fig 15(B), the upper part of the image is also over-enhanced. The KinD can lead to image over-enhancement or distortion. It is not in line with the actual scene.

As shown in Fig 11(D), the SCI has a dim recovery effect on indoor archery image. Fig 13(D) also uses SCI enhancement method. However, the result of SCI is darker in the indoor bookshelf area when compared with results obtained from other methods. As shown in Fig 15(D), the brightness of the black background area in the image is almost not enhanced at all, but in Fig 14(D), it is over-enhanced. It has a significant difference from the image under normal-light. SCI can lead to abnormal brightness enhancement in some cases. It makes the image unreal.

In Fig 10(F), Zero-DCE recovers the indoor sports stadium image with distorted colors in the upper-right region, and the scene is dark. Fig 13(F) shows that the bookshelf part (row 3, column 2) appears a muddy shadow when Zero-DCE is used to recover the indoor image. The reason lies in that the Zero-DCE can only performs enhancement for the whole image, not for the specific areas in the image. In the strong reflection and extreme contrast conditions, the Zero-DCE work poorly in image enhancement, which is also shown in Fig 12(F).

Using the URetinex-Net, Fig 12(E) shows that the clouds in the image are almost not recovered. It is covered by over-enhanced white color. As shown in Fig 14(E), the details of the two chairs and the water bottle in the image are lost, the overall visual effect is blurry. In the URetinex-Net, the Retinex decomposition and reconstruction for the low-light image is directly performed. Therefore, the URetinex-Net cannot completely recover the extremely low-light images. There are still some problems such as distortion in recovering image details. These problems can affect image quality and visual effect.

From Figs 10–15, the network proposed in this paper performs better compared to the other five methods. It proves the effectiveness and generalization of the proposed network. They are more in line with the visual perception of the human eye.

Objective metric evaluation

In the objective visual evaluation, several rigorous objective metrics are used to assess the performance comprehensively. These metrics include PSNR, SSIM, NIQE, PI, BRISQUE, and NIMA. Among these metrics, higher values for PSNR, NIMA, and SSIM indicate better image quality, while lower values for NIQE, PI, and BRISQUE indicate better visual image quality. The results with bold font in the table represent the best outcomes.

To maximize the accuracy, 15 images are selected on LOL dataset, 10 images are selected on the VE-LOL-L dataset, 10 images are selected on SID dataset, 10 images are selected on ELD dataset, 10 images are selected on DICM dataset, and 10 images are selected on MEF dataset as the test set. The average values of the six methods are calculated on different datasets. The experimental results are shown from Tables 2–7.

Download:

Table 2. Comparison results of different methods on LOL dataset.

https://doi.org/10.1371/journal.pone.0303696.t002

Download:

Table 3. Comparison results of different methods on VE-LOL-L dataset.

https://doi.org/10.1371/journal.pone.0303696.t003

Download:

Table 4. Comparison results of different methods on SID dataset.

https://doi.org/10.1371/journal.pone.0303696.t004

Download:

Table 5. Comparison results of different methods on ELD dataset.

https://doi.org/10.1371/journal.pone.0303696.t005

Download:

Table 6. Comparison results of different methods on DICM dataset.

https://doi.org/10.1371/journal.pone.0303696.t006

Download:

Table 7. Comparison results of different methods on MEF dataset.

https://doi.org/10.1371/journal.pone.0303696.t007

The experimental results on the paired datasets LOL and VE-LOL-L are shown in Tables 2 and 3. As shown in Table 2, the proposed network achieves best values of 22.4568, 0.8243, and 4.7531 in PSNR, SSIM, and NIMA respectively. These values are 13.18%, 8.52%, and 3.5% higher compared with the second highest value. According to Table 3, the proposed network achieves best values of 21.3067, 0.8943, and 29.3789 in PSNR, SSIM, and BRISQUE respectively. The values of PSNR, SSIM and BRISQUE are 6.75%, 0.85% and 8.9% better compared with the second-best value.

The experimental results on the paired datasets SID and ELD are presented in Tables 4 and 5. Table 4 shows that in SSIM, PI, BQISQUE, and NIMA, our method improves 17.04%, 6.48%, 3.59%, and 8.47% compared to the second-best values. In Table 5 our method improves 10.48%, 11.88%, 2.97%, and 10.02% in PSNR, SSIM, NIQE, and BRISQUE compared to the second-best values.

The experimental results on the unpaired datasets DICM and MEF are shown in Tables 6 and 7. From Table 6, it can be seen that the proposed network achieves best values of 2.1752 and 14.2458 in PI and BRISQUE, respectively. These values are 17.74% and 4.68% better than the second-best value. As shown in Table 7, the proposed network achieves best value of 2.5957 in PI. Although the values of metrics except PI are not optimal, the result achieves a more balanced performance.

Ablation study

To verify the necessity of the method framework proposed in this paper, we conducted ablation experiments by separately removing the denoising network and the enhancement network. The results of the experiments are shown in Fig 16 and Table 8.

Download:

Fig 16. Ablation studies on the framework.

‘w/o Denois.’ denotes our method without reflective component denoising network. ‘w/o Enhance.’ denotes our method without illumination component enhancement network.

https://doi.org/10.1371/journal.pone.0303696.g016

Download:

Table 8. Comparison of ablation results for different modules.

https://doi.org/10.1371/journal.pone.0303696.t008

From Fig 16(B), it can be observed that when the enhancement sub-network is present but the denoising sub-network is absent, the overall image exhibits excessive noise and unclear details, although there is some improvement in brightness. From Fig 16(C), it can be seen that when the denoising sub-network is present but the enhancement sub-network is absent, the image noise is effectively reduced, but the overall brightness is hardly enhanced, resulting in a dull color appearance. In Fig 16(D), the contrast of the image is effectively improved, with clearer details and noticeable noise reduction. This clearly has shown that all the components are important for achieving better performance.

The objective comparisons of ablation results for each module are presented in Table 8. It can be seen that both the absence of the denoising network or the enhancement network leads to relatively poor performance across multiple metrics. In contrast, our proposed network achieves the best results across all metrics, further demonstrating the effectiveness of the method proposed in this paper.

Conclusions

In order to further improve the effect of low-light image enhancement, a Retinex-based image enhancement network for a low-light environment is proposed. A new loss function, CA attention mechanism and the adaptive dynamic iteration method is introduced in the proposed network. Experiments show that most objective metrics have been improved. At the same time, the proposed network has a better denoising effect and the visual effect is more in line with human eye vision. It proves the effectiveness and generalization of the network proposed in this paper.

Supporting information

S1 File.

https://doi.org/10.1371/journal.pone.0303696.s001

(ZIP)

References

1. Wen S, Hu X, Ma J, Sun F, Fang B. Autonomous robot navigation using Retinex algorithm for multiscale image adaptability in low-light environment. Intelligent Service Robotics. 2019;12:359–69.
- View Article
- Google Scholar
2. Srinivas K, Bhandari AK. Low light image enhancement with adaptive sigmoid transfer function. IET Image Processing. 2020;14(4):668–78.
- View Article
- Google Scholar
3. Pizer SM, Amburn EP, Austin JD, Cromartie R, Geselowitz A, Greer T, et al. Adaptive histogram equalization and its variations. Computer vision, graphics, and image processing. 1987;39(3):355–68.
- View Article
- Google Scholar
4. Land EH. The retinex theory of color vision. Scientific american. 1977;237(6):108–29. pmid:929159
- View Article
- PubMed/NCBI
- Google Scholar
5. Jobson DJ, Rahman Z-u, Woodell GA. Properties and performance of a center/surround retinex. IEEE transactions on image processing. 1997;6(3):451–62. pmid:18282940
- View Article
- PubMed/NCBI
- Google Scholar
6. Rahman Z-u Jobson DJ, Woodell GA, editors. Multi-scale retinex for color image enhancement. Proceedings of 3rd IEEE international conference on image processing. 1996;3:1003–6.
- View Article
- Google Scholar
7. Jobson DJ, Rahman Z-u, Woodell GA. A multiscale retinex for bridging the gap between color images and the human observation of scenes. IEEE Transactions on Image processing. 1997;6(7):965–76.
- View Article
- Google Scholar
8. Lore KG, Akintayo A, Sarkar S. LLNet: A deep autoencoder approach to natural low-light image enhancement. Pattern Recognition. 2017;61:650–62.
- View Article
- Google Scholar
9. Wei C, Wang W, Yang W, Liu J. Deep retinex decomposition for low-light enhancement. arXiv preprint arXiv:180804560. 2018.
- View Article
- Google Scholar
10. Zhang Y, Zhang J, Guo X, editors. Kindling the darkness: A practical low-light image enhancer. Proceedings of the 27th ACM international conference on multimedia. 2019:1632–40.
- View Article
- Google Scholar
11. Lv F, Lu F, Wu J, Lim C, editors. MBLLEN: Low-Light Image/Video Enhancement Using CNNs. BMVC. 2018;220(1):4.
12. Wang R, Zhang Q, Fu C-W, Shen X, Zheng W-S, Jia J, editors. Underexposed photo enhancement using deep illumination estimation. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019:6849–57.
- View Article
- Google Scholar
13. Guo C, Li C, Guo J, Loy CC, Hou J, Kwong S, et al., editors. Zero-reference deep curve estimation for low-light image enhancement. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020:1780–9.
- View Article
- Google Scholar
14. Jiang Y, Gong X, Liu D, Cheng Y, Fang C, Shen X, et al. Enlightengan: Deep light enhancement without paired supervision. IEEE transactions on image processing. 2021;30:2340–9. pmid:33481709
- View Article
- PubMed/NCBI
- Google Scholar
15. Lu K, Zhang L. TBEFN: A two-branch exposure-fusion network for low-light image enhancement. IEEE Transactions on Multimedia. 2020;23:4093–105.
- View Article
- Google Scholar
16. Zheng C, Shi D, Shi W, editors. Adaptive unfolding total variation network for low-light image enhancement. Proceedings of the IEEE/CVF international conference on computer vision. 2021:4439-48.
17. Ma L, Ma T, Liu R, Fan X, Luo Z, editors. Toward fast, flexible, and robust low-light image enhancement. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022:5637-46.
18. Jiang K, Wang Z, Wang Z, Chen C, Yi P, Lu T, et al., editors. Degrade is upgrade: Learning degradation for low-light image enhancement. Proceedings of the AAAI Conference on Artificial Intelligence. 2022;36(1):1078-86.
19. Wu W, Weng J, Zhang P, Wang X, Yang W, Jiang J, editors. Uretinex-net: Retinex-based deep unfolding network for low-light image enhancement. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022:5901-10.
20. Wang T, Zhang K, Shen T, Luo W, Stenger B, Lu T, editors. Ultra-high-definition low-light image enhancement: A benchmark and transformer-based method. Proceedings of the AAAI Conference on Artificial Intelligence. 2023;37(3):2654-62.
21. Fu Z, Yang Y, Tu X, Huang Y, Ding X, Ma K-K, editors. Learning a Simple Low-Light Image Enhancer From Paired Low-Light Instances. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023:22252-61.
22. Wang Z, Bovik AC, Sheikh HR, Simoncelli EP. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing. 2004;13(4):600–12. pmid:15376593
- View Article
- PubMed/NCBI
- Google Scholar
23. Huang H, Lin L Tong R, Hu H, Zhang Q, Iwamoto Y, et al., editors. Unet 3+: A full-scale connected unet for medical image segmentation. ICASSP 2020-2020 IEEE international conference on acoustics, speech and signal processing (ICASSP). 2020:1055-9.
24. Hou Q, Zhou D, Feng J, editors. Coordinate attention for efficient mobile network design. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021:13713–22.
- View Article
- Google Scholar

[ref1] 1. Wen S, Hu X, Ma J, Sun F, Fang B. Autonomous robot navigation using Retinex algorithm for multiscale image adaptability in low-light environment. Intelligent Service Robotics. 2019;12:359–69.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Srinivas K, Bhandari AK. Low light image enhancement with adaptive sigmoid transfer function. IET Image Processing. 2020;14(4):668–78.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Pizer SM, Amburn EP, Austin JD, Cromartie R, Geselowitz A, Greer T, et al. Adaptive histogram equalization and its variations. Computer vision, graphics, and image processing. 1987;39(3):355–68.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Land EH. The retinex theory of color vision. Scientific american. 1977;237(6):108–29. pmid:929159
View Article
PubMed/NCBI
Google Scholar

[11] View Article

[12] PubMed/NCBI

[13] Google Scholar

[ref5] 5. Jobson DJ, Rahman Z-u, Woodell GA. Properties and performance of a center/surround retinex. IEEE transactions on image processing. 1997;6(3):451–62. pmid:18282940
View Article
PubMed/NCBI
Google Scholar

[15] View Article

[16] PubMed/NCBI

[17] Google Scholar

[ref6] 6. Rahman Z-u Jobson DJ, Woodell GA, editors. Multi-scale retinex for color image enhancement. Proceedings of 3rd IEEE international conference on image processing. 1996;3:1003–6.
View Article
Google Scholar

[19] View Article

[20] Google Scholar

[ref7] 7. Jobson DJ, Rahman Z-u, Woodell GA. A multiscale retinex for bridging the gap between color images and the human observation of scenes. IEEE Transactions on Image processing. 1997;6(7):965–76.
View Article
Google Scholar

[22] View Article

[23] Google Scholar

[ref8] 8. Lore KG, Akintayo A, Sarkar S. LLNet: A deep autoencoder approach to natural low-light image enhancement. Pattern Recognition. 2017;61:650–62.
View Article
Google Scholar

[25] View Article

[26] Google Scholar

[ref9] 9. Wei C, Wang W, Yang W, Liu J. Deep retinex decomposition for low-light enhancement. arXiv preprint arXiv:180804560. 2018.
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref10] 10. Zhang Y, Zhang J, Guo X, editors. Kindling the darkness: A practical low-light image enhancer. Proceedings of the 27th ACM international conference on multimedia. 2019:1632–40.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref11] 11. Lv F, Lu F, Wu J, Lim C, editors. MBLLEN: Low-Light Image/Video Enhancement Using CNNs. BMVC. 2018;220(1):4.

[ref12] 12. Wang R, Zhang Q, Fu C-W, Shen X, Zheng W-S, Jia J, editors. Underexposed photo enhancement using deep illumination estimation. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019:6849–57.
View Article
Google Scholar

[35] View Article

[36] Google Scholar

[ref13] 13. Guo C, Li C, Guo J, Loy CC, Hou J, Kwong S, et al., editors. Zero-reference deep curve estimation for low-light image enhancement. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020:1780–9.
View Article
Google Scholar

[38] View Article

[39] Google Scholar

[ref14] 14. Jiang Y, Gong X, Liu D, Cheng Y, Fang C, Shen X, et al. Enlightengan: Deep light enhancement without paired supervision. IEEE transactions on image processing. 2021;30:2340–9. pmid:33481709
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref15] 15. Lu K, Zhang L. TBEFN: A two-branch exposure-fusion network for low-light image enhancement. IEEE Transactions on Multimedia. 2020;23:4093–105.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref16] 16. Zheng C, Shi D, Shi W, editors. Adaptive unfolding total variation network for low-light image enhancement. Proceedings of the IEEE/CVF international conference on computer vision. 2021:4439-48.

[ref17] 17. Ma L, Ma T, Liu R, Fan X, Luo Z, editors. Toward fast, flexible, and robust low-light image enhancement. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022:5637-46.

[ref18] 18. Jiang K, Wang Z, Wang Z, Chen C, Yi P, Lu T, et al., editors. Degrade is upgrade: Learning degradation for low-light image enhancement. Proceedings of the AAAI Conference on Artificial Intelligence. 2022;36(1):1078-86.

[ref19] 19. Wu W, Weng J, Zhang P, Wang X, Yang W, Jiang J, editors. Uretinex-net: Retinex-based deep unfolding network for low-light image enhancement. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2022:5901-10.

[ref20] 20. Wang T, Zhang K, Shen T, Luo W, Stenger B, Lu T, editors. Ultra-high-definition low-light image enhancement: A benchmark and transformer-based method. Proceedings of the AAAI Conference on Artificial Intelligence. 2023;37(3):2654-62.

[ref21] 21. Fu Z, Yang Y, Tu X, Huang Y, Ding X, Ma K-K, editors. Learning a Simple Low-Light Image Enhancer From Paired Low-Light Instances. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023:22252-61.

[ref22] 22. Wang Z, Bovik AC, Sheikh HR, Simoncelli EP. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing. 2004;13(4):600–12. pmid:15376593
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref23] 23. Huang H, Lin L Tong R, Hu H, Zhang Q, Iwamoto Y, et al., editors. Unet 3+: A full-scale connected unet for medical image segmentation. ICASSP 2020-2020 IEEE international conference on acoustics, speech and signal processing (ICASSP). 2020:1055-9.

[ref24] 24. Hou Q, Zhou D, Feng J, editors. Coordinate attention for efficient mobile network design. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021:13713–22.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

Figures

Abstract

Introduction

Proposed network

Decomposition network

Reflective component denoising network

Illumination component enhancement network

Experimental process

Experimental data sets

Details of training process

Results and analysis

Subjective visual evaluation

Objective metric evaluation

Ablation study

Conclusions

Supporting information

S1 File.

References