[go: up one dir, main page]

CN110728643A - Low-illumination band noise image optimization method based on convolutional neural network - Google Patents

Low-illumination band noise image optimization method based on convolutional neural network Download PDF

Info

Publication number
CN110728643A
CN110728643A CN201910993809.1A CN201910993809A CN110728643A CN 110728643 A CN110728643 A CN 110728643A CN 201910993809 A CN201910993809 A CN 201910993809A CN 110728643 A CN110728643 A CN 110728643A
Authority
CN
China
Prior art keywords
layer
neural network
deconvolution
image
convolutional neural
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910993809.1A
Other languages
Chinese (zh)
Inventor
吴海申
李启明
王礼凯
徐璐
吕玥齐
康信杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Maritime University
Original Assignee
Shanghai Maritime University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Maritime University filed Critical Shanghai Maritime University
Priority to CN201910993809.1A priority Critical patent/CN110728643A/en
Publication of CN110728643A publication Critical patent/CN110728643A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/70Denoising; Smoothing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Image Processing (AREA)
  • Image Analysis (AREA)

Abstract

本发明公开一种基于卷积神经网络的低照度带噪声图像优化方法,包含:S1、构建LLED‑Net卷积神经网络,其包括10个卷积层与10层相应的镜像反卷积层,使用跳越连接从卷积层连接到对应的镜像反卷积层;S2、收集正常光照图像,并通过人工合成相应的低照度带噪声图像,将图像进行数据增强扩充数量后作为训练数据,使用训练数据训练卷积神经网络模型,动态调整网络模型的学习率进行训练,得到训练好的LLED‑Net卷积神经网络;S3、通过训练好的LLED‑Net卷积神经网络模型对采集到的真实低照度带噪声图像进行重构优化,实现重构高质量图像。本发明可在低照度噪声图像的噪声强度、亮度未知情况下,使用模型自动学习图像的特征,提高图像亮度、去除图像噪声,重构高质量图片。

Figure 201910993809

The present invention discloses a low-illuminance and noise-based image optimization method based on a convolutional neural network. Use skip connections to connect from the convolutional layer to the corresponding mirror deconvolutional layer; S2, collect normal illumination images, and artificially synthesize the corresponding low-illumination and noisy images, and use the data to enhance and expand the images as training data. The training data trains the convolutional neural network model, dynamically adjusts the learning rate of the network model for training, and obtains the trained LLED-Net convolutional neural network; S3, through the trained LLED-Net convolutional neural network model, the collected real Reconstruction and optimization of low-illumination and noisy images to achieve high-quality image reconstruction. The present invention can use the model to automatically learn the characteristics of the image under the condition that the noise intensity and brightness of the low-illumination noise image are unknown, improve the image brightness, remove the image noise, and reconstruct a high-quality picture.

Figure 201910993809

Description

一种基于卷积神经网络的低照度带噪声图像优化方法A low-illumination and noisy image optimization method based on convolutional neural network

技术领域technical field

本发明涉及图像处理领域,具体涉及一种基于卷积神经网络的低照度带噪声图像优化方法。The invention relates to the field of image processing, in particular to a low-illuminance and noisy image optimization method based on a convolutional neural network.

背景技术Background technique

运用相机等摄像设备拍摄高质量的图片与视频在许多情况下都有着很大的作用,但是并非所有拍摄到的图像的图片都有着良好的质量。由于拍摄时的光照不足以及在传输过程中各种电噪声,机械噪声,信道噪声和其他噪声干扰,通常会导致捕捉的图像整体偏暗,同时还伴有图像噪声,造成图像降质,因此很难清楚地识别物体或纹理,因此很有必要改善低亮度图片的质量。Taking high-quality pictures and videos with camera and other imaging equipment can be useful in many situations, but not all images captured are of good quality. Due to insufficient lighting at the time of shooting and various electrical noise, mechanical noise, channel noise and other noise interference during the transmission process, the captured image is usually dark as a whole, accompanied by image noise, resulting in image degradation. It is difficult to clearly identify objects or textures, so it is necessary to improve the quality of low-light images.

目前,国内外改善低亮度噪声图像的主流算法是直方图均衡化(Histogramequalization,HE)、对比度受限自适应直方图均衡化(Contrast limited adaptivehistogram equalization,CLAHE)、伽马校正(Gamma correction,GC),这些方法虽然改善了低亮度噪声图像的图片质量,但效果仍不理想,并且有着各自的缺点,容易造成图片的色彩失真,图像出现大量白块等问题。At present, the mainstream algorithms for improving low luminance noise images at home and abroad are histogram equalization (HE), contrast limited adaptive histogram equalization (CLAHE), and gamma correction (GC). , although these methods improve the image quality of low-luminance noise images, the effect is still not ideal, and they have their own shortcomings, which are easy to cause color distortion in the image, and a large number of white blocks in the image.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于提供一种基于卷积神经网络的低照度带噪声图像优化方法,其可以在低照度噪声图像的噪声强度、亮度未知的情况下,使用模型自动学习图像的特征,提高图像亮度、去除图像噪声,重构高质量图片。The purpose of the present invention is to provide a low-illumination noise image optimization method based on a convolutional neural network, which can automatically learn the characteristics of the image by using the model under the condition that the noise intensity and brightness of the low-illumination noise image are unknown, and improve the image brightness. , remove image noise and reconstruct high-quality images.

为了达到上述目的,本发明通过以下技术方案实现:In order to achieve the above object, the present invention realizes through the following technical solutions:

一种基于卷积神经网络的低照度带噪声图像优化方法,包含:A low-light and noisy image optimization method based on convolutional neural network, including:

步骤S1、构建LLED-Net卷积神经网络;所述LLED-Net卷积神经网络为卷积-反卷积结构,包括N个卷积层与N层相应的镜像反卷积层,使用跳越连接从卷积层连接到对应的镜像反卷积层,所述跳越连接是指单个卷积层的输出传递给下一层卷积层,且将输出传递给对应的镜像反卷积层;所述LLED-Net卷积神经网络使用结构相似性损失作为卷积神经网络的损失函数;Step S1, constructing an LLED-Net convolutional neural network; the LLED-Net convolutional neural network is a convolution-deconvolution structure, including N convolutional layers and N-layer corresponding mirrored deconvolutional layers, using skipping The connection is connected from the convolution layer to the corresponding mirror deconvolution layer, and the skip connection means that the output of a single convolution layer is passed to the next layer of convolution layer, and the output is passed to the corresponding mirror deconvolution layer; The LLED-Net convolutional neural network uses structural similarity loss as the loss function of the convolutional neural network;

步骤S2、收集正常光照图像,并通过人工合成相应的低照度带噪声图像,将该低照度带噪声图像进行数据增强扩充数量后作为训练数据,使用所述训练数据训练所述LLED-Net卷积神经网络模型,动态调整网络模型的学习率进行训练,得到训练好的LLED-Net卷积神经网络;Step S2: Collect normal illumination images, and artificially synthesize corresponding low-illuminance noisy images, use the low-illuminance noisy images for data enhancement and expansion as training data, and use the training data to train the LLED-Net convolution. Neural network model, dynamically adjust the learning rate of the network model for training, and obtain a trained LLED-Net convolutional neural network;

步骤S3、通过训练好的LLED-Net卷积神经网络模型对采集到的真实低照度带噪声图像进行重构优化,实现重构高质量图像。Step S3: Reconstructing and optimizing the collected real low-illuminance image with noise by using the trained LLED-Net convolutional neural network model to achieve high-quality image reconstruction.

优选地,所述LLED-Net卷积神经网络包含十层用于处理图像的卷积层与十层相应的镜像反卷积层;所述卷积层作为特征提取器,经过所述卷积层转发后,后接10个反卷积层恢复图像中的细节,反卷积层特征图数目与相应的卷积层特征图数目镜像相同;所述LLED-Net卷积神经网络模型使用跳越连接从卷积层连接到对应的镜像反卷积层,将层层传递的卷积层特征图逐项求和又作为反卷积层的输入,并通过跳越连接进行校正后再传递到下一层反卷积层。Preferably, the LLED-Net convolutional neural network includes ten convolutional layers for processing images and ten corresponding mirrored deconvolutional layers; the convolutional layers are used as feature extractors, and the convolutional layers pass through the convolutional layers. After forwarding, 10 deconvolution layers are followed to restore the details in the image, and the number of deconvolution layer feature maps is the same as the number of corresponding convolution layer feature maps; the LLED-Net convolutional neural network model uses skip connections Connect from the convolutional layer to the corresponding mirror deconvolutional layer, sum up the convolutional layer feature maps transmitted layer by layer and use it as the input of the deconvolutional layer, and correct it through skip connections before passing it to the next layer deconvolution layer.

优选地,所述LLED-Net卷积神经网络的结构中,进一步包含:Preferably, the structure of the LLED-Net convolutional neural network further includes:

十层卷积层从输入端至输出端依次记作为第一层卷积层、第二层卷积层…直至第十层卷积层;The ten-layer convolutional layer is recorded as the first convolutional layer, the second convolutional layer... until the tenth convolutional layer from the input end to the output end;

十层反卷积层从输入端至输出端依次记作为第一层反卷积层、第二层反卷积层…直至第十层反卷积层;The ten deconvolution layers are sequentially recorded as the first deconvolution layer, the second deconvolution layer... until the tenth deconvolution layer from the input end to the output end;

第i层卷积层对应地与第11-i层反卷积层进行跳越连接,且1≤i≤10。The i-th convolutional layer is correspondingly skip-connected with the 11-i-th deconvolutional layer, and 1≤i≤10.

优选地,所述LLED-Net卷积神经网络的结构中,进一步包含:Preferably, the structure of the LLED-Net convolutional neural network further includes:

第一层卷积层至第四层卷积层的卷积核数目均为128,第五层卷积层至第七层卷积层的卷积核数目均为256,第八层卷积层至第十层卷积层的卷积核数目均为512;The number of convolution kernels from the first convolutional layer to the fourth convolutional layer is 128, the number of convolutional kernels from the fifth convolutional layer to the seventh convolutional layer is 256, and the eighth convolutional layer The number of convolution kernels up to the tenth convolution layer is 512;

第一层反卷积层至第二层反卷积层的卷积核数目均为512,第三层反卷积层至第五层反卷积层的卷积核数目均为256,第六层反卷积层至第九层反卷积层的卷积核数目均128,第十层反卷积层的卷积核数目为3。The number of convolution kernels from the first deconvolution layer to the second deconvolution layer is 512, the number of convolution kernels from the third deconvolution layer to the fifth deconvolution layer is 256, and the number of convolution kernels of the sixth deconvolution layer is 256. The number of convolution kernels from the deconvolution layer to the ninth deconvolution layer is 128, and the number of convolution kernels of the tenth deconvolution layer is 3.

优选地,所述的基于卷积神经网络的低照度带噪声图像优化方法,进一步包含以下过程:经过每个卷积或者反卷积操作后采用padding进行补零操作;每个卷积层或反卷积层操作后使用非线性整流函数进行激活;所有的卷积层和反卷积层的卷积核大小被设置为3×3。Preferably, the convolutional neural network-based low-illumination image optimization method with noise further includes the following process: after each convolution or deconvolution operation, padding is used to perform a zero-filling operation; each convolution layer or inverse The convolutional layers are activated with a non-linear rectification function after operation; the kernel size of all convolutional and deconvolutional layers is set to 3 × 3.

优选地,所述步骤S1中,进一步包含:Preferably, in the step S1, it further comprises:

将结构相似性SSIM损失作为所述LLED-Net卷积神经网络的损失函数,结构相似性的公式如下:Taking the structural similarity SSIM loss as the loss function of the LLED-Net convolutional neural network, the formula for structural similarity is as follows:

Figure BDA0002239123590000031
Figure BDA0002239123590000031

其中,μx是x的平均值;μy是y的平均值;

Figure BDA0002239123590000034
是x的方差;
Figure BDA0002239123590000033
是y的方差;σxy是x与y的协方差;c1=(k1L)2,c2=(k2L)2,是用来维持稳定的常数;L是像素值的动态范围;k1=0.01,k2=0.03;where μ x is the mean value of x; μ y is the mean value of y;
Figure BDA0002239123590000034
is the variance of x;
Figure BDA0002239123590000033
is the variance of y; σ xy is the covariance of x and y; c 1 =(k 1 L) 2 , c 2 =(k 2 L) 2 , are constants used to maintain stability; L is the dynamic range of pixel values ; k 1 =0.01, k 2 =0.03;

损失函数公式如下:The loss function formula is as follows:

其中,N代表训练数据集数量;x代表人工合成的低照度带噪声图像;X代表人工合成的低照度带噪声图像数据集;y代表正常光照图像;Y代表正常光照图像数据集。Among them, N represents the number of training data sets; x represents the artificially synthesized low-illuminance image with noise; X represents the artificially synthesized low-illumination image dataset with noise; y represents the normal illumination image; Y represents the normal illumination image dataset.

优选地,所述步骤S2中,进一步包含:Preferably, in the step S2, it further comprises:

步骤S21、选取多张在正常光照下拍摄的无噪声图像,将所有的无噪声图像依次顺时针旋转90°、180°、270°,再进行水平翻转,并将所有图像再进行数据增强扩充数据后成为待处理训练数据;Step S21: Select multiple noise-free images taken under normal lighting, rotate all noise-free images clockwise by 90°, 180°, and 270° in turn, and then perform horizontal flipping, and then perform data enhancement and data expansion on all images. Then it becomes the training data to be processed;

步骤S22、对经所述步骤S21处理后的每张图像加上高斯噪声,再进行非线性调整变为低亮度图片,则处理后的图片成为随机亮度且带有随机高斯噪声的低质量图像;Step S22, adding Gaussian noise to each image processed in step S21, and then performing non-linear adjustment to become a low-brightness image, then the processed image becomes a low-quality image with random brightness and random Gaussian noise;

步骤S23、对经过所述步骤S22处理后的每张低质量图像随机切割像素为41×41的图片块,将得到的图片集作为训练数据集;Step S23, randomly cutting a picture block with a pixel of 41×41 for each low-quality image processed in the step S22, and using the obtained picture set as a training data set;

步骤S24、将经步骤S23得到的训练数据集输入所述LLED-Net卷积神经网络,实现正向传播;其中,每遍历4000个样本为一代,每两代将学习率降为原先的0.1,迭代次数为10代,最后得到训练好的LLED-Net卷积神经网络模型。Step S24, input the training data set obtained in step S23 into the LLED-Net convolutional neural network to realize forward propagation; wherein, each traversed 4000 samples is one generation, and the learning rate is reduced to the original 0.1 every two generations, The number of iterations is 10 generations, and finally the trained LLED-Net convolutional neural network model is obtained.

优选地,所述步骤S22中,进一步包含:Preferably, in the step S22, it further comprises:

高斯噪声的噪声强度σ在(0,25)范围内随机选择;图片亮度值在γ(2,5)之间随机选取。The noise intensity σ of Gaussian noise is randomly selected in the range of (0, 25); the image brightness value is randomly selected between γ (2, 5).

与现有技术相比,本发明的有益效果在于:本发明构建端到端的卷积神经网络,人工合成训练数据,无需人为的干预,从训练数据中自主的学习图像的主要的特征,构建去除噪声、提高图像亮度的最优模型;本发明无需知道待处理图像的亮度与噪声强度,能够使用训练好的模型快速处理未知低照度噪声图像,重建高质量图像,泛化效果好、具有高鲁棒性;本发明处理效果极好,无论是重建图像的亮度、色彩还原度、图像纹理、噪声去除度都比以前的技术有较大的进步。Compared with the prior art, the beneficial effects of the present invention are: the present invention constructs an end-to-end convolutional neural network, artificially synthesizes training data, without human intervention, autonomously learns the main features of the image from the training data, constructs and removes noise and the optimal model for improving image brightness; the present invention does not need to know the brightness and noise intensity of the image to be processed, can use the trained model to quickly process unknown low-illuminance noise images, reconstruct high-quality images, and has good generalization effect and high robustness. Robustness; the processing effect of the present invention is excellent, and the brightness of the reconstructed image, the degree of color restoration, the image texture, and the degree of noise removal have been greatly improved compared with the previous technology.

附图说明Description of drawings

图1为本发明的基于卷积神经网络的低照度带噪声图像优化方法的流程示意图;1 is a schematic flowchart of a method for optimizing a low-illuminance image with noise based on a convolutional neural network of the present invention;

图2为本发明的基于卷积神经网络的低照度带噪声图像优化方法中整体模型的结构图;2 is a structural diagram of an overall model in the method for optimizing a low-illuminance image with noise based on a convolutional neural network of the present invention;

图3为本发明的低照度噪声图像;Fig. 3 is the low illumination noise image of the present invention;

图4为本发明的经过LLED-Net卷积神经网络模型处理后的图像。FIG. 4 is an image processed by the LLED-Net convolutional neural network model of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments It is only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

如图1-图4所示,本发明公开了一种基于卷积神经网络的低照度带噪声图像优化方法,包含以下步骤:As shown in FIG. 1-FIG. 4, the present invention discloses a method for optimizing low-illuminance images with noise based on convolutional neural network, which includes the following steps:

步骤S1、构建LLED-Net卷积神经网络;其中,所述LLED-Net卷积神经网络包括N个(例如10个)卷积层与N层(10层)相应的镜像反卷积层,使用跳越连接从各个卷积层连接到对应的镜像反卷积层,如图2所示;并使用结构相似性(SSIM)损失作为卷积神经网络的损失函数,如图1所示。Step S1, constructing an LLED-Net convolutional neural network; wherein, the LLED-Net convolutional neural network includes N (for example, 10) convolutional layers and N layers (10 layers) corresponding mirror deconvolution layers, using Skip connections connect from each convolutional layer to the corresponding mirror deconvolutional layer, as shown in Figure 2; and use the Structural Similarity (SSIM) loss as the loss function of the convolutional neural network, as shown in Figure 1.

步骤S2、收集正常光照图像,并通过人工合成相应的低照度带噪声图像,将该低照度带噪声图像进行数据增强扩充数量后作为训练数据(训练数据为低照度带噪声图像集),使用训练数据训练所述LLED-Net卷积神经网络模型,动态调整网络模型的学习率进行训练,得到训练好的LLED-Net卷积神经网络。Step S2, collecting normal illumination images, and artificially synthesizing corresponding low-illuminance and noisy images, performing data enhancement and expansion on the low-illumination and noisy images as training data (the training data is a set of low-illumination and noisy images), and using the training data The LLED-Net convolutional neural network model is trained with data, and the learning rate of the network model is dynamically adjusted for training to obtain a trained LLED-Net convolutional neural network.

步骤S3、通过训练好的LLED-Net卷积神经网络模型对采集到的真实低照度带噪声图像进行重构优化,实现重构高质量图像。Step S3: Reconstructing and optimizing the collected real low-illuminance image with noise by using the trained LLED-Net convolutional neural network model to achieve high-quality image reconstruction.

本实施例中,所述LLED-Net卷积神经网络模型为卷积-反卷积结构,先用10个卷积层处理图像,卷积层的特征图数目逐渐增多;卷积层作为特征提取器,提取图像中物体的主要特征,同时提高图像亮度、去除图像噪声;在经过卷积层转发后,后接10个反卷积层来恢复图像中的细节,反卷积层特征图数目与相应的卷积层特征图数目镜像相同,用来重构高质量图像;跳越连接是指不仅将单个卷积层的输出传递给下一层卷积层,还将输出传递给相对应的镜像反卷积层,即本发明使用跳越连接从卷积层连接到对应的镜像反卷积层。将层层传递的卷积特征图逐项求和又作为反卷积的输入,并通过跳越连接进行校正后再传递到下一层反卷积层;避免卷积神经网络在训练过程中发生梯度消失和梯度爆炸,使网络在训练过程更加平稳,增加网络收敛速度,减少训练时间。In this embodiment, the LLED-Net convolutional neural network model is a convolution-deconvolution structure. First, 10 convolutional layers are used to process the image, and the number of feature maps of the convolutional layer is gradually increased; the convolutional layer is used as feature extraction. It extracts the main features of the objects in the image, improves the image brightness and removes image noise; after forwarding through the convolution layer, 10 deconvolution layers are followed to restore the details in the image. The number of feature maps in the deconvolution layer is the same as The corresponding convolutional layer feature maps have the same number of mirrors, which are used to reconstruct high-quality images; skip connections refer to not only passing the output of a single convolutional layer to the next convolutional layer, but also passing the output to the corresponding mirror The deconvolution layer, that is, the present invention uses skip connections to connect from the convolution layer to the corresponding mirror deconvolution layer. The convolutional feature maps passed layer by layer are summed item by item and used as the input of deconvolution, corrected by skip connection and then passed to the next deconvolution layer; avoid the occurrence of convolutional neural network in the training process Gradient disappearance and gradient explosion make the network more stable in the training process, increase the network convergence speed and reduce the training time.

进一步地,所述LLED-Net卷积神经网络网络结构为:Further, the LLED-Net convolutional neural network network structure is:

(1)如图2所示,10层卷积层沿输入端至输出端依次记作为卷积层1、卷积层2…卷积层9、卷积层10;10层反卷积层沿输入端至输出端依次记作为反卷积层11、反卷积层12…反卷积层19、反卷积层20。对应地,卷积层1使用跳越连接与反卷积层20连接,卷积层2使用跳越连接与反卷积层19连接,卷积层3使用跳越连接与反卷积层18连接…卷积层10使用跳越连接与反卷积层11连接。(1) As shown in Figure 2, the 10-layer convolutional layers are sequentially recorded as convolutional layer 1, convolutional layer 2...convolutional layer 9, and convolutional layer 10 along the input end to the output end; The input end to the output end are denoted as the deconvolution layer 11 , the deconvolution layer 12 , the deconvolution layer 19 , and the deconvolution layer 20 in sequence. Correspondingly, convolutional layer 1 is connected to deconvolution layer 20 using skip connections, convolutional layer 2 is connected to deconvolution layer 19 using skip connections, and convolutional layer 3 is connected to deconvolution layer 18 using skip connections. ...convolutional layer 10 is connected to deconvolutional layer 11 using skip connections.

(2)卷积层使用卷积核对图片进行卷积操作,卷积层1至4层卷积核数目为128,卷积层5至7层卷积核数目为256,卷积层8至10层卷积核数目为512;反卷积层使用反卷积核对图片进行反卷积操作,反卷积层11至12层反卷积核数目为512,与卷积层8至10层核数相同,反卷积层13至15层反卷积核数目为256,与卷积层5至7层核数相同,反卷积层16至19层反卷积核数目为128,与卷积层1至4层核数相同,反卷积层20层反卷积核数目为3,进行图片的重构工作。(2) The convolution layer uses convolution kernels to perform convolution operations on the image. The number of convolution kernels in convolution layers 1 to 4 is 128, the number of convolution kernels in convolution layers 5 to 7 is 256, and the number of convolution layers 8 to 10 The number of layer convolution kernels is 512; the deconvolution layer uses deconvolution kernels to perform deconvolution operations on the image. The number of deconvolution kernels in deconvolution layers 11 to 12 is 512, and the number of kernels in convolution layers 8 to 10 layers is 512. The same, the number of deconvolution kernels in the deconvolution layers 13 to 15 is 256, which is the same as the number of kernels in the convolution layers 5 to 7, and the number of deconvolution kernels in the deconvolution layers 16 to 19 is 128, which is the same as the convolution layer. The number of kernels in layers 1 to 4 is the same, and the number of deconvolution kernels in the 20-layer deconvolution layer is 3, which is used for image reconstruction.

本实施例中,经过每个卷积操作后或者反卷积操作后都采用padding进行补零操作。每个卷积层或反卷积层后再使用非线性整流函数(Relu)进行激活(即在每个卷积操作后再使用非线性整流函数进行激活,每个反卷积操作后也再使用非线性整流函数进行激活)。所有卷积层和反卷积层的卷积核大小被设置为3×3。In this embodiment, padding is used to perform a zero-filling operation after each convolution operation or after a deconvolution operation. After each convolution layer or deconvolution layer, use the nonlinear rectification function (Relu) for activation (that is, use the nonlinear rectification function for activation after each convolution operation, and use it again after each deconvolution operation. The nonlinear rectification function is activated). The kernel size of all convolutional and deconvolutional layers is set to 3 × 3.

所述步骤S1中,进一步包含以下过程:In the step S1, the following process is further included:

将结构相似性(SSIM)损失作为所述LLED-Net卷积神经网络的损失函数,结构相似性的公式为:Taking structural similarity (SSIM) loss as the loss function of the LLED-Net convolutional neural network, the formula for structural similarity is:

Figure BDA0002239123590000061
Figure BDA0002239123590000061

其中,μx是x的平均值;μy是y的平均值;

Figure BDA0002239123590000062
是x的方差;
Figure BDA0002239123590000063
是y的方差;σxy是x与y的协方差;c1=(k1L)2,c2=(k2L)2,是用来维持稳定的常数;L是像素值的动态范围;k1=0.01,k2=0.03。where μ x is the mean value of x; μ y is the mean value of y;
Figure BDA0002239123590000062
is the variance of x;
Figure BDA0002239123590000063
is the variance of y; σ xy is the covariance of x and y; c 1 =(k 1 L) 2 , c 2 =(k 2 L) 2 , are constants used to maintain stability; L is the dynamic range of pixel values ; k 1 =0.01, k 2 =0.03.

另,损失函数公式如下:In addition, the loss function formula is as follows:

Figure BDA0002239123590000064
Figure BDA0002239123590000064

其中,N代表训练数据集数量;x代表单个人工合成的低照度带噪声图像;X代表人工合成的低照度带噪声图像数据集;y代表单个正常光照图像;Y代表正常光照图像数据集。Among them, N represents the number of training data sets; x represents a single artificially synthesized low-illuminance image with noise; X represents a synthetic low-illumination image with noise dataset; y represents a single normal illumination image; Y represents a normal illumination image dataset.

所述步骤S2中,进一步包含以下过程:In the step S2, the following process is further included:

步骤S21、选取多张(例如300张)在正常光照下拍摄的无噪声图像,将所有图像依次顺时针旋转90°、180°、270°,再将所有图像进行水平翻转,且所有图像进行数据增强扩充数据后成为待处理训练数据。Step S21, select multiple (for example, 300) noise-free images taken under normal lighting, rotate all images clockwise by 90°, 180°, and 270° in turn, then flip all images horizontally, and perform data analysis on all images. The augmented data becomes the training data to be processed.

步骤S22、对经所述步骤S21处理后的每张图像加上高斯噪声,噪声强度σ在(0,25)范围内随机选择;再对每个图片进行非线性调整变为低亮度图片,图片亮度值γ在(2,5)之间随机选取中,处理后的图片成为随机亮度并且带有随机高斯噪声的低质量图像。上述对图片进行的处理都为matlab软件实现。In step S22, Gaussian noise is added to each image processed in step S21, and the noise intensity σ is randomly selected within the range of (0, 25); The brightness value γ is randomly selected between (2, 5), and the processed image becomes a low-quality image with random brightness and random Gaussian noise. The above image processing is realized by matlab software.

步骤S23、对经过所述步骤S22处理后的每张图片随机切割像素为41×41的图片块,将得到的图片集作为训练数据集。Step S23: Randomly cut a picture block with a pixel size of 41×41 for each picture processed in the step S22, and use the obtained picture set as a training data set.

步骤S24、将经步骤S23得到的训练数据集输入LLED-Net卷积神经网络,实现正向传播;每遍历4000个样本为一代,每两代将学习率降为原先的0.1,迭代次数为10代;最后得到训练好的LLED-Net卷积神经网络模型。Step S24, input the training data set obtained in step S23 into the LLED-Net convolutional neural network to realize forward propagation; each traversal of 4000 samples is one generation, the learning rate is reduced to the original 0.1 every two generations, and the number of iterations is 10 Generation; finally get the trained LLED-Net convolutional neural network model.

所述步骤S3中,进一步包含以下过程:In the step S3, the following process is further included:

将采集到了低照度噪声图片输入训练好的LLED-Net卷积神经网络模型,得到优化后的重建图像,与原始图像相比,重建后的图像的亮度、色彩真实度、纹理细节、图像质量都有很大提高,如图3所示。The collected low-illumination noise pictures are input into the trained LLED-Net convolutional neural network model to obtain an optimized reconstructed image. Compared with the original image, the brightness, color fidelity, texture details and image quality of the reconstructed image are all better. There is a great improvement, as shown in Figure 3.

综上所述,本发明构建端到端的卷积神经网络,人工合成训练数据,无需人为的干预,从训练数据中自主的学习图像的主要的特征,构建去除噪声、提高图像亮度的最优模型;无需知道待处理图像的亮度与噪声强度,能够使用训练好的模型快速处理未知的低照度噪声图像,重建高质量图像,泛化效果好、具有高鲁棒性;处理效果极好,无论是重建图像的亮度、色彩还原度、图像纹理、噪声去除度都比以前的技术有较大的进步。To sum up, the present invention constructs an end-to-end convolutional neural network, artificially synthesizes training data, without human intervention, autonomously learns the main features of the image from the training data, and constructs an optimal model for removing noise and improving image brightness. ; No need to know the brightness and noise intensity of the image to be processed, can use the trained model to quickly process unknown low-illumination noise images, reconstruct high-quality images, and have good generalization effect and high robustness; The processing effect is excellent, whether it is The brightness, color reproduction, image texture, and noise removal of the reconstructed image are all improved compared to the previous techniques.

尽管本发明的内容已经通过上述优选实施例作了详细介绍,但应当认识到上述的描述不应被认为是对本发明的限制。在本领域技术人员阅读了上述内容后,对于本发明的多种修改和替代都将是显而易见的。因此,本发明的保护范围应由所附的权利要求来限定。While the content of the present invention has been described in detail by way of the above preferred embodiments, it should be appreciated that the above description should not be construed as limiting the present invention. Various modifications and alternatives to the present invention will be apparent to those skilled in the art upon reading the foregoing. Accordingly, the scope of protection of the present invention should be defined by the appended claims.

Claims (8)

1. A low-illumination noisy image optimization method based on a convolutional neural network is characterized by comprising the following steps:
step S1, constructing an LLED-Net convolution neural network; the LLED-Net convolutional neural network is of a convolutional-deconvolution structure and comprises N convolutional layers and N corresponding mirror image deconvolution layers, and the convolutional layers are connected to the corresponding mirror image deconvolution layers by using jump connection, wherein the jump connection means that the output of a single convolutional layer is transmitted to the next convolutional layer and the output is transmitted to the corresponding mirror image deconvolution layer; the LLED-Net convolutional neural network uses the loss of structural similarity as a loss function of the convolutional neural network;
step S2, collecting normal illumination images, artificially synthesizing corresponding low-illumination-intensity-band noise images, performing data enhancement and quantity expansion on the low-illumination-intensity-band noise images to obtain training data, training the LLED-Net convolutional neural network model by using the training data, and dynamically adjusting the learning rate of the network model to train to obtain a trained LLED-Net convolutional neural network;
and step S3, reconstructing and optimizing the acquired real low-illumination band noise image through the trained LLED-Net convolutional neural network model, and realizing the reconstruction of a high-quality image.
2. The convolutional neural network-based low-illumination noisy image optimization method of claim 1,
the LLED-Net convolutional neural network comprises ten convolutional layers for processing images and ten corresponding mirror image deconvolution layers;
the convolutional layer is used as a feature extractor, after being forwarded by the convolutional layer, details in the images are restored by connecting 10 deconvolution layers, and the number of deconvolution layer feature images is the same as the number of corresponding convolutional layer feature images in a mirror image manner;
the LLED-Net convolutional neural network model is connected to a corresponding mirror image deconvolution layer from a convolution layer by using jump connection, the convolution layer characteristic graphs transmitted layer by layer are summed item by item and are used as the input of the deconvolution layer, and the convolution layer characteristic graphs are corrected by the jump connection and then transmitted to the next deconvolution layer.
3. The convolutional neural network-based low-illumination noisy image optimization method of claim 2,
the structure of the LLED-Net convolutional neural network further comprises:
the ten convolutional layers are sequentially recorded as a first convolutional layer, a second convolutional layer … to a tenth convolutional layer from the input end to the output end;
the ten deconvolution layers are sequentially recorded as a first layer deconvolution layer, a second layer deconvolution layer … to a tenth layer deconvolution layer from the input end to the output end;
the ith convolution layer is correspondingly connected with the 11 th-i th deconvolution layer in a jump way, and i is more than or equal to 1 and less than or equal to 10.
4. The convolutional neural network-based low-illumination noisy image optimization method of claim 3,
the structure of the LLED-Net convolutional neural network further comprises:
the number of convolution kernels from the first layer of convolution layer to the fourth layer of convolution layer is 128, the number of convolution kernels from the fifth layer of convolution layer to the seventh layer of convolution layer is 256, and the number of convolution kernels from the eighth layer of convolution layer to the tenth layer of convolution layer is 512;
the number of convolution kernels from the first layer of deconvolution layer to the second layer of deconvolution layer is 512, the number of convolution kernels from the third layer of deconvolution layer to the fifth layer of deconvolution layer is 256, the number of convolution kernels from the sixth layer of deconvolution layer to the ninth layer of deconvolution layer is 128, and the number of convolution kernels from the tenth layer of deconvolution layer is 3.
5. The convolutional neural network-based low-illumination noisy image optimization method of claim 1,
further comprising the following processes:
after each convolution or deconvolution operation, padding is adopted for zero padding operation;
activating each convolution layer or each deconvolution layer by using a nonlinear rectification function after operation;
the convolution kernel size of all convolutional and deconvolution layers is set to 3 × 3.
6. The convolutional neural network-based low-illumination noisy image optimization method of claim 1,
the step S1 further includes:
taking the structural similarity SSIM loss as a loss function of the LLED-Net convolutional neural network, wherein the structural similarity formula is as follows:
wherein, muxIs the average value of x; mu.syIs the average value of y;is the variance of x;is the variance of y; sigmaxyIs the covariance of x and y; c. C1=(k1L)2,c2=(k2L)2Is a constant used to maintain stability; l is the dynamic range of the pixel value; k is a radical of1=0.01,k2=0.03;
The loss function is formulated as follows:
Figure FDA0002239123580000031
wherein N represents the number of training data sets; x represents a artificially synthesized low-illumination noisy image; x represents a synthetic low-illumination noisy image dataset; y represents a normal illumination image; y represents a normal-light image dataset.
7. The convolutional neural network-based low-illumination noisy image optimization method of claim 1,
the step S2 further includes:
s21, selecting a plurality of noise-free images shot under normal illumination, clockwise rotating all the noise-free images by 90 degrees, 180 degrees and 270 degrees, horizontally turning, and performing data enhancement and data expansion on all the images to obtain training data to be processed;
step S22, Gaussian noise is added to each image processed in the step S21, then nonlinear adjustment is carried out to change the image into a low-brightness image, and the processed image becomes a low-quality image with random brightness and random Gaussian noise;
step S23, randomly cutting picture blocks with 41 × 41 pixels for each low-quality image processed in the step S22, and taking the obtained picture set as a training data set;
step S24, inputting the training data set obtained in the step S23 into the LLED-Net convolution neural network to realize forward propagation; wherein, every 4000 samples are traversed to be one generation, every two generations reduce the learning rate to be 0.1 of the original, the iteration times is 10 generations, and finally the trained LLED-Net convolution neural network model is obtained.
8. The convolutional neural network-based low-illumination noisy image optimization method of claim 7,
the step S22 further includes:
the noise intensity σ of the gaussian noise is randomly selected in the range of (0, 25);
the picture brightness values are randomly chosen between gamma (2, 5).
CN201910993809.1A 2019-10-18 2019-10-18 Low-illumination band noise image optimization method based on convolutional neural network Pending CN110728643A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910993809.1A CN110728643A (en) 2019-10-18 2019-10-18 Low-illumination band noise image optimization method based on convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910993809.1A CN110728643A (en) 2019-10-18 2019-10-18 Low-illumination band noise image optimization method based on convolutional neural network

Publications (1)

Publication Number Publication Date
CN110728643A true CN110728643A (en) 2020-01-24

Family

ID=69221560

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910993809.1A Pending CN110728643A (en) 2019-10-18 2019-10-18 Low-illumination band noise image optimization method based on convolutional neural network

Country Status (1)

Country Link
CN (1) CN110728643A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114004761A (en) * 2021-10-29 2022-02-01 福州大学 An image optimization method that combines deep learning night vision enhancement and filter noise reduction
CN114372941A (en) * 2021-12-16 2022-04-19 佳源科技股份有限公司 Low-illumination image enhancement method, device, equipment and medium
GB2607158A (en) * 2021-03-08 2022-11-30 Nvidia Corp Neural network training technique
CN116433517A (en) * 2023-03-31 2023-07-14 中国人民解放军国防科技大学 Low-illumination image enhancement method and system based on noise autoregressive model
CN119205537A (en) * 2024-09-26 2024-12-27 深圳职业技术大学 Low illumination image enhancement method based on convolutional neural network

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170213321A1 (en) * 2016-01-22 2017-07-27 Siemens Healthcare Gmbh Deep Unfolding Algorithm For Efficient Image Denoising Under Varying Noise Conditions
CN108876737A (en) * 2018-06-06 2018-11-23 武汉大学 A kind of image de-noising method of joint residual error study and structural similarity
CN109242788A (en) * 2018-08-21 2019-01-18 福州大学 One kind being based on coding-decoding convolutional neural networks low-light (level) image optimization method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170213321A1 (en) * 2016-01-22 2017-07-27 Siemens Healthcare Gmbh Deep Unfolding Algorithm For Efficient Image Denoising Under Varying Noise Conditions
CN108876737A (en) * 2018-06-06 2018-11-23 武汉大学 A kind of image de-noising method of joint residual error study and structural similarity
CN109242788A (en) * 2018-08-21 2019-01-18 福州大学 One kind being based on coding-decoding convolutional neural networks low-light (level) image optimization method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘超等: "超低照度下微光图像的深度卷积自编码网络复原", 《光学精密工程》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2607158A (en) * 2021-03-08 2022-11-30 Nvidia Corp Neural network training technique
CN114004761A (en) * 2021-10-29 2022-02-01 福州大学 An image optimization method that combines deep learning night vision enhancement and filter noise reduction
CN114372941A (en) * 2021-12-16 2022-04-19 佳源科技股份有限公司 Low-illumination image enhancement method, device, equipment and medium
CN114372941B (en) * 2021-12-16 2024-04-26 佳源科技股份有限公司 Low-light image enhancement method, device, equipment and medium
CN116433517A (en) * 2023-03-31 2023-07-14 中国人民解放军国防科技大学 Low-illumination image enhancement method and system based on noise autoregressive model
CN119205537A (en) * 2024-09-26 2024-12-27 深圳职业技术大学 Low illumination image enhancement method based on convolutional neural network

Similar Documents

Publication Publication Date Title
CN110728643A (en) Low-illumination band noise image optimization method based on convolutional neural network
CN113228094B (en) Image Processor
WO2021208122A1 (en) Blind video denoising method and device based on deep learning
CN114140353A (en) Swin-Transformer image denoising method and system based on channel attention
CN106709875B (en) A Compressed Low-Resolution Image Restoration Method Based on Joint Deep Network
Chang et al. Low-light image restoration with short-and long-exposure raw pairs
CN107481278B (en) Image bit depth expansion method and device based on combination frame
CN108876735A (en) A kind of blind denoising method of true picture based on depth residual error network
CN112465727A (en) Low-illumination image enhancement method without normal illumination reference based on HSV color space and Retinex theory
CN111369466B (en) Image distortion correction enhancement method of convolutional neural network based on deformable convolution
CN112308803B (en) A self-supervised low-light image enhancement and denoising method based on deep learning
CN108805836A (en) Method for correcting image based on the reciprocating HDR transformation of depth
CN110619593A (en) Double-exposure video imaging system based on dynamic scene
CN111612722A (en) Low-illumination image processing method based on simplified Unet full-convolution neural network
CN113450290B (en) Low-illumination image enhancement method and system based on image inpainting technology
CN103295194A (en) Brightness-controllable and detail-preservation tone mapping method
CN111915513B (en) Image denoising method based on improved adaptive neural network
CN113096029A (en) High dynamic range image generation method based on multi-branch codec neural network
CN112053309A (en) Image enhancement method and image enhancement device
CN111105376A (en) Single-exposure high dynamic range image generation method based on dual-branch neural network
CN113052764A (en) Video sequence super-resolution reconstruction method based on residual connection
CN112819705B (en) Real image denoising method based on mesh structure and long-distance correlation
CN107392879A (en) A kind of low-light (level) monitoring image Enhancement Method based on reference frame
CN116757959A (en) A HDR image reconstruction method based on Raw domain
CN115082315B (en) Demosaicing method applicable to low-illumination small-pixel CFA sampling and edge computing equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200124