CN110728643A

CN110728643A - Low-illumination band noise image optimization method based on convolutional neural network

Info

Publication number: CN110728643A
Application number: CN201910993809.1A
Authority: CN
Inventors: 吴海申; 李启明; 王礼凯; 徐璐; 吕玥齐; 康信杰
Original assignee: Shanghai Maritime University
Current assignee: Shanghai Maritime University
Priority date: 2019-10-18
Filing date: 2019-10-18
Publication date: 2020-01-24

Abstract

The present invention discloses a low-illuminance and noise-based image optimization method based on a convolutional neural network. Use skip connections to connect from the convolutional layer to the corresponding mirror deconvolutional layer; S2, collect normal illumination images, and artificially synthesize the corresponding low-illumination and noisy images, and use the data to enhance and expand the images as training data. The training data trains the convolutional neural network model, dynamically adjusts the learning rate of the network model for training, and obtains the trained LLED-Net convolutional neural network; S3, through the trained LLED-Net convolutional neural network model, the collected real Reconstruction and optimization of low-illumination and noisy images to achieve high-quality image reconstruction. The present invention can use the model to automatically learn the characteristics of the image under the condition that the noise intensity and brightness of the low-illumination noise image are unknown, improve the image brightness, remove the image noise, and reconstruct a high-quality picture.

Description

A low-illumination and noisy image optimization method based on convolutional neural network

技术领域technical field

本发明涉及图像处理领域，具体涉及一种基于卷积神经网络的低照度带噪声图像优化方法。The invention relates to the field of image processing, in particular to a low-illuminance and noisy image optimization method based on a convolutional neural network.

背景技术Background technique

运用相机等摄像设备拍摄高质量的图片与视频在许多情况下都有着很大的作用，但是并非所有拍摄到的图像的图片都有着良好的质量。由于拍摄时的光照不足以及在传输过程中各种电噪声，机械噪声，信道噪声和其他噪声干扰，通常会导致捕捉的图像整体偏暗，同时还伴有图像噪声，造成图像降质，因此很难清楚地识别物体或纹理，因此很有必要改善低亮度图片的质量。Taking high-quality pictures and videos with camera and other imaging equipment can be useful in many situations, but not all images captured are of good quality. Due to insufficient lighting at the time of shooting and various electrical noise, mechanical noise, channel noise and other noise interference during the transmission process, the captured image is usually dark as a whole, accompanied by image noise, resulting in image degradation. It is difficult to clearly identify objects or textures, so it is necessary to improve the quality of low-light images.

目前，国内外改善低亮度噪声图像的主流算法是直方图均衡化(Histogramequalization，HE)、对比度受限自适应直方图均衡化(Contrast limited adaptivehistogram equalization，CLAHE)、伽马校正(Gamma correction，GC)，这些方法虽然改善了低亮度噪声图像的图片质量，但效果仍不理想，并且有着各自的缺点，容易造成图片的色彩失真，图像出现大量白块等问题。At present, the mainstream algorithms for improving low luminance noise images at home and abroad are histogram equalization (HE), contrast limited adaptive histogram equalization (CLAHE), and gamma correction (GC). , although these methods improve the image quality of low-luminance noise images, the effect is still not ideal, and they have their own shortcomings, which are easy to cause color distortion in the image, and a large number of white blocks in the image.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于提供一种基于卷积神经网络的低照度带噪声图像优化方法，其可以在低照度噪声图像的噪声强度、亮度未知的情况下，使用模型自动学习图像的特征，提高图像亮度、去除图像噪声，重构高质量图片。The purpose of the present invention is to provide a low-illumination noise image optimization method based on a convolutional neural network, which can automatically learn the characteristics of the image by using the model under the condition that the noise intensity and brightness of the low-illumination noise image are unknown, and improve the image brightness. , remove image noise and reconstruct high-quality images.

为了达到上述目的，本发明通过以下技术方案实现：In order to achieve the above object, the present invention realizes through the following technical solutions:

一种基于卷积神经网络的低照度带噪声图像优化方法，包含：A low-light and noisy image optimization method based on convolutional neural network, including:

步骤S1、构建LLED-Net卷积神经网络；所述LLED-Net卷积神经网络为卷积-反卷积结构，包括N个卷积层与N层相应的镜像反卷积层，使用跳越连接从卷积层连接到对应的镜像反卷积层，所述跳越连接是指单个卷积层的输出传递给下一层卷积层，且将输出传递给对应的镜像反卷积层；所述LLED-Net卷积神经网络使用结构相似性损失作为卷积神经网络的损失函数；Step S1, constructing an LLED-Net convolutional neural network; the LLED-Net convolutional neural network is a convolution-deconvolution structure, including N convolutional layers and N-layer corresponding mirrored deconvolutional layers, using skipping The connection is connected from the convolution layer to the corresponding mirror deconvolution layer, and the skip connection means that the output of a single convolution layer is passed to the next layer of convolution layer, and the output is passed to the corresponding mirror deconvolution layer; The LLED-Net convolutional neural network uses structural similarity loss as the loss function of the convolutional neural network;

步骤S2、收集正常光照图像，并通过人工合成相应的低照度带噪声图像，将该低照度带噪声图像进行数据增强扩充数量后作为训练数据，使用所述训练数据训练所述LLED-Net卷积神经网络模型，动态调整网络模型的学习率进行训练，得到训练好的LLED-Net卷积神经网络；Step S2: Collect normal illumination images, and artificially synthesize corresponding low-illuminance noisy images, use the low-illuminance noisy images for data enhancement and expansion as training data, and use the training data to train the LLED-Net convolution. Neural network model, dynamically adjust the learning rate of the network model for training, and obtain a trained LLED-Net convolutional neural network;

步骤S3、通过训练好的LLED-Net卷积神经网络模型对采集到的真实低照度带噪声图像进行重构优化，实现重构高质量图像。Step S3: Reconstructing and optimizing the collected real low-illuminance image with noise by using the trained LLED-Net convolutional neural network model to achieve high-quality image reconstruction.

优选地，所述LLED-Net卷积神经网络包含十层用于处理图像的卷积层与十层相应的镜像反卷积层；所述卷积层作为特征提取器，经过所述卷积层转发后，后接10个反卷积层恢复图像中的细节，反卷积层特征图数目与相应的卷积层特征图数目镜像相同；所述LLED-Net卷积神经网络模型使用跳越连接从卷积层连接到对应的镜像反卷积层，将层层传递的卷积层特征图逐项求和又作为反卷积层的输入，并通过跳越连接进行校正后再传递到下一层反卷积层。Preferably, the LLED-Net convolutional neural network includes ten convolutional layers for processing images and ten corresponding mirrored deconvolutional layers; the convolutional layers are used as feature extractors, and the convolutional layers pass through the convolutional layers. After forwarding, 10 deconvolution layers are followed to restore the details in the image, and the number of deconvolution layer feature maps is the same as the number of corresponding convolution layer feature maps; the LLED-Net convolutional neural network model uses skip connections Connect from the convolutional layer to the corresponding mirror deconvolutional layer, sum up the convolutional layer feature maps transmitted layer by layer and use it as the input of the deconvolutional layer, and correct it through skip connections before passing it to the next layer deconvolution layer.

优选地，所述LLED-Net卷积神经网络的结构中，进一步包含：Preferably, the structure of the LLED-Net convolutional neural network further includes:

十层卷积层从输入端至输出端依次记作为第一层卷积层、第二层卷积层…直至第十层卷积层；The ten-layer convolutional layer is recorded as the first convolutional layer, the second convolutional layer... until the tenth convolutional layer from the input end to the output end;

十层反卷积层从输入端至输出端依次记作为第一层反卷积层、第二层反卷积层…直至第十层反卷积层；The ten deconvolution layers are sequentially recorded as the first deconvolution layer, the second deconvolution layer... until the tenth deconvolution layer from the input end to the output end;

第i层卷积层对应地与第11-i层反卷积层进行跳越连接，且1≤i≤10。The i-th convolutional layer is correspondingly skip-connected with the 11-i-th deconvolutional layer, and 1≤i≤10.

第一层卷积层至第四层卷积层的卷积核数目均为128，第五层卷积层至第七层卷积层的卷积核数目均为256，第八层卷积层至第十层卷积层的卷积核数目均为512；The number of convolution kernels from the first convolutional layer to the fourth convolutional layer is 128, the number of convolutional kernels from the fifth convolutional layer to the seventh convolutional layer is 256, and the eighth convolutional layer The number of convolution kernels up to the tenth convolution layer is 512;

第一层反卷积层至第二层反卷积层的卷积核数目均为512，第三层反卷积层至第五层反卷积层的卷积核数目均为256，第六层反卷积层至第九层反卷积层的卷积核数目均128，第十层反卷积层的卷积核数目为3。The number of convolution kernels from the first deconvolution layer to the second deconvolution layer is 512, the number of convolution kernels from the third deconvolution layer to the fifth deconvolution layer is 256, and the number of convolution kernels of the sixth deconvolution layer is 256. The number of convolution kernels from the deconvolution layer to the ninth deconvolution layer is 128, and the number of convolution kernels of the tenth deconvolution layer is 3.

优选地，所述的基于卷积神经网络的低照度带噪声图像优化方法，进一步包含以下过程：经过每个卷积或者反卷积操作后采用padding进行补零操作；每个卷积层或反卷积层操作后使用非线性整流函数进行激活；所有的卷积层和反卷积层的卷积核大小被设置为3×3。Preferably, the convolutional neural network-based low-illumination image optimization method with noise further includes the following process: after each convolution or deconvolution operation, padding is used to perform a zero-filling operation; each convolution layer or inverse The convolutional layers are activated with a non-linear rectification function after operation; the kernel size of all convolutional and deconvolutional layers is set to 3 × 3.

优选地，所述步骤S1中，进一步包含：Preferably, in the step S1, it further comprises:

将结构相似性SSIM损失作为所述LLED-Net卷积神经网络的损失函数，结构相似性的公式如下：Taking the structural similarity SSIM loss as the loss function of the LLED-Net convolutional neural network, the formula for structural similarity is as follows:

其中，μ_x是x的平均值；μ_y是y的平均值；

是x的方差；

是y的方差；σ_xy是x与y的协方差；c₁＝(k₁L)²，c₂＝(k₂L)²，是用来维持稳定的常数；L是像素值的动态范围；k₁＝0.01，k₂＝0.03；where μ _x is the mean value of x; μ _y is the mean value of y;

is the variance of x;

is the variance of y; σ _xy is the covariance of x and y; c ₁ =(k ₁ L) ² , c ₂ =(k ₂ L) ² , are constants used to maintain stability; L is the dynamic range of pixel values ; k ₁ =0.01, k ₂ =0.03;

损失函数公式如下：The loss function formula is as follows:

其中，N代表训练数据集数量；x代表人工合成的低照度带噪声图像；X代表人工合成的低照度带噪声图像数据集；y代表正常光照图像；Y代表正常光照图像数据集。Among them, N represents the number of training data sets; x represents the artificially synthesized low-illuminance image with noise; X represents the artificially synthesized low-illumination image dataset with noise; y represents the normal illumination image; Y represents the normal illumination image dataset.

优选地，所述步骤S2中，进一步包含：Preferably, in the step S2, it further comprises:

步骤S21、选取多张在正常光照下拍摄的无噪声图像，将所有的无噪声图像依次顺时针旋转90°、180°、270°，再进行水平翻转，并将所有图像再进行数据增强扩充数据后成为待处理训练数据；Step S21: Select multiple noise-free images taken under normal lighting, rotate all noise-free images clockwise by 90°, 180°, and 270° in turn, and then perform horizontal flipping, and then perform data enhancement and data expansion on all images. Then it becomes the training data to be processed;

步骤S22、对经所述步骤S21处理后的每张图像加上高斯噪声，再进行非线性调整变为低亮度图片，则处理后的图片成为随机亮度且带有随机高斯噪声的低质量图像；Step S22, adding Gaussian noise to each image processed in step S21, and then performing non-linear adjustment to become a low-brightness image, then the processed image becomes a low-quality image with random brightness and random Gaussian noise;

步骤S23、对经过所述步骤S22处理后的每张低质量图像随机切割像素为41×41的图片块，将得到的图片集作为训练数据集；Step S23, randomly cutting a picture block with a pixel of 41×41 for each low-quality image processed in the step S22, and using the obtained picture set as a training data set;

步骤S24、将经步骤S23得到的训练数据集输入所述LLED-Net卷积神经网络，实现正向传播；其中，每遍历4000个样本为一代，每两代将学习率降为原先的0.1，迭代次数为10代，最后得到训练好的LLED-Net卷积神经网络模型。Step S24, input the training data set obtained in step S23 into the LLED-Net convolutional neural network to realize forward propagation; wherein, each traversed 4000 samples is one generation, and the learning rate is reduced to the original 0.1 every two generations, The number of iterations is 10 generations, and finally the trained LLED-Net convolutional neural network model is obtained.

优选地，所述步骤S22中，进一步包含：Preferably, in the step S22, it further comprises:

高斯噪声的噪声强度σ在(0，25)范围内随机选择；图片亮度值在γ(2，5)之间随机选取。The noise intensity σ of Gaussian noise is randomly selected in the range of (0, 25); the image brightness value is randomly selected between γ (2, 5).

与现有技术相比，本发明的有益效果在于：本发明构建端到端的卷积神经网络，人工合成训练数据，无需人为的干预，从训练数据中自主的学习图像的主要的特征，构建去除噪声、提高图像亮度的最优模型；本发明无需知道待处理图像的亮度与噪声强度，能够使用训练好的模型快速处理未知低照度噪声图像，重建高质量图像，泛化效果好、具有高鲁棒性；本发明处理效果极好，无论是重建图像的亮度、色彩还原度、图像纹理、噪声去除度都比以前的技术有较大的进步。Compared with the prior art, the beneficial effects of the present invention are: the present invention constructs an end-to-end convolutional neural network, artificially synthesizes training data, without human intervention, autonomously learns the main features of the image from the training data, constructs and removes noise and the optimal model for improving image brightness; the present invention does not need to know the brightness and noise intensity of the image to be processed, can use the trained model to quickly process unknown low-illuminance noise images, reconstruct high-quality images, and has good generalization effect and high robustness. Robustness; the processing effect of the present invention is excellent, and the brightness of the reconstructed image, the degree of color restoration, the image texture, and the degree of noise removal have been greatly improved compared with the previous technology.

附图说明Description of drawings

图1为本发明的基于卷积神经网络的低照度带噪声图像优化方法的流程示意图；1 is a schematic flowchart of a method for optimizing a low-illuminance image with noise based on a convolutional neural network of the present invention;

图2为本发明的基于卷积神经网络的低照度带噪声图像优化方法中整体模型的结构图；2 is a structural diagram of an overall model in the method for optimizing a low-illuminance image with noise based on a convolutional neural network of the present invention;

图3为本发明的低照度噪声图像；Fig. 3 is the low illumination noise image of the present invention;

图4为本发明的经过LLED-Net卷积神经网络模型处理后的图像。FIG. 4 is an image processed by the LLED-Net convolutional neural network model of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚，下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments It is only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

如图1-图4所示，本发明公开了一种基于卷积神经网络的低照度带噪声图像优化方法，包含以下步骤：As shown in FIG. 1-FIG. 4, the present invention discloses a method for optimizing low-illuminance images with noise based on convolutional neural network, which includes the following steps:

步骤S1、构建LLED-Net卷积神经网络；其中，所述LLED-Net卷积神经网络包括N个(例如10个)卷积层与N层(10层)相应的镜像反卷积层，使用跳越连接从各个卷积层连接到对应的镜像反卷积层，如图2所示；并使用结构相似性(SSIM)损失作为卷积神经网络的损失函数，如图1所示。Step S1, constructing an LLED-Net convolutional neural network; wherein, the LLED-Net convolutional neural network includes N (for example, 10) convolutional layers and N layers (10 layers) corresponding mirror deconvolution layers, using Skip connections connect from each convolutional layer to the corresponding mirror deconvolutional layer, as shown in Figure 2; and use the Structural Similarity (SSIM) loss as the loss function of the convolutional neural network, as shown in Figure 1.

步骤S2、收集正常光照图像，并通过人工合成相应的低照度带噪声图像，将该低照度带噪声图像进行数据增强扩充数量后作为训练数据(训练数据为低照度带噪声图像集)，使用训练数据训练所述LLED-Net卷积神经网络模型，动态调整网络模型的学习率进行训练，得到训练好的LLED-Net卷积神经网络。Step S2, collecting normal illumination images, and artificially synthesizing corresponding low-illuminance and noisy images, performing data enhancement and expansion on the low-illumination and noisy images as training data (the training data is a set of low-illumination and noisy images), and using the training data The LLED-Net convolutional neural network model is trained with data, and the learning rate of the network model is dynamically adjusted for training to obtain a trained LLED-Net convolutional neural network.

本实施例中，所述LLED-Net卷积神经网络模型为卷积-反卷积结构，先用10个卷积层处理图像，卷积层的特征图数目逐渐增多；卷积层作为特征提取器，提取图像中物体的主要特征，同时提高图像亮度、去除图像噪声；在经过卷积层转发后，后接10个反卷积层来恢复图像中的细节，反卷积层特征图数目与相应的卷积层特征图数目镜像相同，用来重构高质量图像；跳越连接是指不仅将单个卷积层的输出传递给下一层卷积层，还将输出传递给相对应的镜像反卷积层，即本发明使用跳越连接从卷积层连接到对应的镜像反卷积层。将层层传递的卷积特征图逐项求和又作为反卷积的输入，并通过跳越连接进行校正后再传递到下一层反卷积层；避免卷积神经网络在训练过程中发生梯度消失和梯度爆炸，使网络在训练过程更加平稳，增加网络收敛速度，减少训练时间。In this embodiment, the LLED-Net convolutional neural network model is a convolution-deconvolution structure. First, 10 convolutional layers are used to process the image, and the number of feature maps of the convolutional layer is gradually increased; the convolutional layer is used as feature extraction. It extracts the main features of the objects in the image, improves the image brightness and removes image noise; after forwarding through the convolution layer, 10 deconvolution layers are followed to restore the details in the image. The number of feature maps in the deconvolution layer is the same as The corresponding convolutional layer feature maps have the same number of mirrors, which are used to reconstruct high-quality images; skip connections refer to not only passing the output of a single convolutional layer to the next convolutional layer, but also passing the output to the corresponding mirror The deconvolution layer, that is, the present invention uses skip connections to connect from the convolution layer to the corresponding mirror deconvolution layer. The convolutional feature maps passed layer by layer are summed item by item and used as the input of deconvolution, corrected by skip connection and then passed to the next deconvolution layer; avoid the occurrence of convolutional neural network in the training process Gradient disappearance and gradient explosion make the network more stable in the training process, increase the network convergence speed and reduce the training time.

进一步地，所述LLED-Net卷积神经网络网络结构为：Further, the LLED-Net convolutional neural network network structure is:

(1)如图2所示，10层卷积层沿输入端至输出端依次记作为卷积层1、卷积层2…卷积层9、卷积层10；10层反卷积层沿输入端至输出端依次记作为反卷积层11、反卷积层12…反卷积层19、反卷积层20。对应地，卷积层1使用跳越连接与反卷积层20连接，卷积层2使用跳越连接与反卷积层19连接，卷积层3使用跳越连接与反卷积层18连接…卷积层10使用跳越连接与反卷积层11连接。(1) As shown in Figure 2, the 10-layer convolutional layers are sequentially recorded as convolutional layer 1, convolutional layer 2...convolutional layer 9, and convolutional layer 10 along the input end to the output end; The input end to the output end are denoted as the deconvolution layer 11 , the deconvolution layer 12 , the deconvolution layer 19 , and the deconvolution layer 20 in sequence. Correspondingly, convolutional layer 1 is connected to deconvolution layer 20 using skip connections, convolutional layer 2 is connected to deconvolution layer 19 using skip connections, and convolutional layer 3 is connected to deconvolution layer 18 using skip connections. ...convolutional layer 10 is connected to deconvolutional layer 11 using skip connections.

(2)卷积层使用卷积核对图片进行卷积操作，卷积层1至4层卷积核数目为128，卷积层5至7层卷积核数目为256，卷积层8至10层卷积核数目为512；反卷积层使用反卷积核对图片进行反卷积操作，反卷积层11至12层反卷积核数目为512，与卷积层8至10层核数相同，反卷积层13至15层反卷积核数目为256，与卷积层5至7层核数相同，反卷积层16至19层反卷积核数目为128，与卷积层1至4层核数相同，反卷积层20层反卷积核数目为3，进行图片的重构工作。(2) The convolution layer uses convolution kernels to perform convolution operations on the image. The number of convolution kernels in convolution layers 1 to 4 is 128, the number of convolution kernels in convolution layers 5 to 7 is 256, and the number of convolution layers 8 to 10 The number of layer convolution kernels is 512; the deconvolution layer uses deconvolution kernels to perform deconvolution operations on the image. The number of deconvolution kernels in deconvolution layers 11 to 12 is 512, and the number of kernels in convolution layers 8 to 10 layers is 512. The same, the number of deconvolution kernels in the deconvolution layers 13 to 15 is 256, which is the same as the number of kernels in the convolution layers 5 to 7, and the number of deconvolution kernels in the deconvolution layers 16 to 19 is 128, which is the same as the convolution layer. The number of kernels in layers 1 to 4 is the same, and the number of deconvolution kernels in the 20-layer deconvolution layer is 3, which is used for image reconstruction.

本实施例中，经过每个卷积操作后或者反卷积操作后都采用padding进行补零操作。每个卷积层或反卷积层后再使用非线性整流函数(Relu)进行激活(即在每个卷积操作后再使用非线性整流函数进行激活，每个反卷积操作后也再使用非线性整流函数进行激活)。所有卷积层和反卷积层的卷积核大小被设置为3×3。In this embodiment, padding is used to perform a zero-filling operation after each convolution operation or after a deconvolution operation. After each convolution layer or deconvolution layer, use the nonlinear rectification function (Relu) for activation (that is, use the nonlinear rectification function for activation after each convolution operation, and use it again after each deconvolution operation. The nonlinear rectification function is activated). The kernel size of all convolutional and deconvolutional layers is set to 3 × 3.

所述步骤S1中，进一步包含以下过程：In the step S1, the following process is further included:

将结构相似性(SSIM)损失作为所述LLED-Net卷积神经网络的损失函数，结构相似性的公式为：Taking structural similarity (SSIM) loss as the loss function of the LLED-Net convolutional neural network, the formula for structural similarity is:

其中，μ_x是x的平均值；μ_y是y的平均值；

是x的方差；

是y的方差；σ_xy是x与y的协方差；c₁＝(k₁L)²，c₂＝(k₂L)²，是用来维持稳定的常数；L是像素值的动态范围；k₁＝0.01，k₂＝0.03。where μ _x is the mean value of x; μ _y is the mean value of y;

is the variance of x;

is the variance of y; σ _xy is the covariance of x and y; c ₁ =(k ₁ L) ² , c ₂ =(k ₂ L) ² , are constants used to maintain stability; L is the dynamic range of pixel values ; k ₁ =0.01, k ₂ =0.03.

另，损失函数公式如下：In addition, the loss function formula is as follows:

其中，N代表训练数据集数量；x代表单个人工合成的低照度带噪声图像；X代表人工合成的低照度带噪声图像数据集；y代表单个正常光照图像；Y代表正常光照图像数据集。Among them, N represents the number of training data sets; x represents a single artificially synthesized low-illuminance image with noise; X represents a synthetic low-illumination image with noise dataset; y represents a single normal illumination image; Y represents a normal illumination image dataset.

所述步骤S2中，进一步包含以下过程：In the step S2, the following process is further included:

步骤S21、选取多张(例如300张)在正常光照下拍摄的无噪声图像，将所有图像依次顺时针旋转90°、180°、270°，再将所有图像进行水平翻转，且所有图像进行数据增强扩充数据后成为待处理训练数据。Step S21, select multiple (for example, 300) noise-free images taken under normal lighting, rotate all images clockwise by 90°, 180°, and 270° in turn, then flip all images horizontally, and perform data analysis on all images. The augmented data becomes the training data to be processed.

步骤S22、对经所述步骤S21处理后的每张图像加上高斯噪声，噪声强度σ在(0，25)范围内随机选择；再对每个图片进行非线性调整变为低亮度图片，图片亮度值γ在(2，5)之间随机选取中，处理后的图片成为随机亮度并且带有随机高斯噪声的低质量图像。上述对图片进行的处理都为matlab软件实现。In step S22, Gaussian noise is added to each image processed in step S21, and the noise intensity σ is randomly selected within the range of (0, 25); The brightness value γ is randomly selected between (2, 5), and the processed image becomes a low-quality image with random brightness and random Gaussian noise. The above image processing is realized by matlab software.

步骤S23、对经过所述步骤S22处理后的每张图片随机切割像素为41×41的图片块，将得到的图片集作为训练数据集。Step S23: Randomly cut a picture block with a pixel size of 41×41 for each picture processed in the step S22, and use the obtained picture set as a training data set.

步骤S24、将经步骤S23得到的训练数据集输入LLED-Net卷积神经网络，实现正向传播；每遍历4000个样本为一代，每两代将学习率降为原先的0.1，迭代次数为10代；最后得到训练好的LLED-Net卷积神经网络模型。Step S24, input the training data set obtained in step S23 into the LLED-Net convolutional neural network to realize forward propagation; each traversal of 4000 samples is one generation, the learning rate is reduced to the original 0.1 every two generations, and the number of iterations is 10 Generation; finally get the trained LLED-Net convolutional neural network model.

所述步骤S3中，进一步包含以下过程：In the step S3, the following process is further included:

将采集到了低照度噪声图片输入训练好的LLED-Net卷积神经网络模型，得到优化后的重建图像，与原始图像相比，重建后的图像的亮度、色彩真实度、纹理细节、图像质量都有很大提高，如图3所示。The collected low-illumination noise pictures are input into the trained LLED-Net convolutional neural network model to obtain an optimized reconstructed image. Compared with the original image, the brightness, color fidelity, texture details and image quality of the reconstructed image are all better. There is a great improvement, as shown in Figure 3.

综上所述，本发明构建端到端的卷积神经网络，人工合成训练数据，无需人为的干预，从训练数据中自主的学习图像的主要的特征，构建去除噪声、提高图像亮度的最优模型；无需知道待处理图像的亮度与噪声强度，能够使用训练好的模型快速处理未知的低照度噪声图像，重建高质量图像，泛化效果好、具有高鲁棒性；处理效果极好，无论是重建图像的亮度、色彩还原度、图像纹理、噪声去除度都比以前的技术有较大的进步。To sum up, the present invention constructs an end-to-end convolutional neural network, artificially synthesizes training data, without human intervention, autonomously learns the main features of the image from the training data, and constructs an optimal model for removing noise and improving image brightness. ; No need to know the brightness and noise intensity of the image to be processed, can use the trained model to quickly process unknown low-illumination noise images, reconstruct high-quality images, and have good generalization effect and high robustness; The processing effect is excellent, whether it is The brightness, color reproduction, image texture, and noise removal of the reconstructed image are all improved compared to the previous techniques.

尽管本发明的内容已经通过上述优选实施例作了详细介绍，但应当认识到上述的描述不应被认为是对本发明的限制。在本领域技术人员阅读了上述内容后，对于本发明的多种修改和替代都将是显而易见的。因此，本发明的保护范围应由所附的权利要求来限定。While the content of the present invention has been described in detail by way of the above preferred embodiments, it should be appreciated that the above description should not be construed as limiting the present invention. Various modifications and alternatives to the present invention will be apparent to those skilled in the art upon reading the foregoing. Accordingly, the scope of protection of the present invention should be defined by the appended claims.

Claims

1. A low-illumination noisy image optimization method based on a convolutional neural network is characterized by comprising the following steps:

step S1, constructing an LLED-Net convolution neural network; the LLED-Net convolutional neural network is of a convolutional-deconvolution structure and comprises N convolutional layers and N corresponding mirror image deconvolution layers, and the convolutional layers are connected to the corresponding mirror image deconvolution layers by using jump connection, wherein the jump connection means that the output of a single convolutional layer is transmitted to the next convolutional layer and the output is transmitted to the corresponding mirror image deconvolution layer; the LLED-Net convolutional neural network uses the loss of structural similarity as a loss function of the convolutional neural network;

step S2, collecting normal illumination images, artificially synthesizing corresponding low-illumination-intensity-band noise images, performing data enhancement and quantity expansion on the low-illumination-intensity-band noise images to obtain training data, training the LLED-Net convolutional neural network model by using the training data, and dynamically adjusting the learning rate of the network model to train to obtain a trained LLED-Net convolutional neural network;

and step S3, reconstructing and optimizing the acquired real low-illumination band noise image through the trained LLED-Net convolutional neural network model, and realizing the reconstruction of a high-quality image.

2. The convolutional neural network-based low-illumination noisy image optimization method of claim 1,

the LLED-Net convolutional neural network comprises ten convolutional layers for processing images and ten corresponding mirror image deconvolution layers;

the convolutional layer is used as a feature extractor, after being forwarded by the convolutional layer, details in the images are restored by connecting 10 deconvolution layers, and the number of deconvolution layer feature images is the same as the number of corresponding convolutional layer feature images in a mirror image manner;

the LLED-Net convolutional neural network model is connected to a corresponding mirror image deconvolution layer from a convolution layer by using jump connection, the convolution layer characteristic graphs transmitted layer by layer are summed item by item and are used as the input of the deconvolution layer, and the convolution layer characteristic graphs are corrected by the jump connection and then transmitted to the next deconvolution layer.

3. The convolutional neural network-based low-illumination noisy image optimization method of claim 2,

the structure of the LLED-Net convolutional neural network further comprises:

the ten convolutional layers are sequentially recorded as a first convolutional layer, a second convolutional layer … to a tenth convolutional layer from the input end to the output end;

the ten deconvolution layers are sequentially recorded as a first layer deconvolution layer, a second layer deconvolution layer … to a tenth layer deconvolution layer from the input end to the output end;

the ith convolution layer is correspondingly connected with the 11 th-i th deconvolution layer in a jump way, and i is more than or equal to 1 and less than or equal to 10.

4. The convolutional neural network-based low-illumination noisy image optimization method of claim 3,

the structure of the LLED-Net convolutional neural network further comprises:

the number of convolution kernels from the first layer of convolution layer to the fourth layer of convolution layer is 128, the number of convolution kernels from the fifth layer of convolution layer to the seventh layer of convolution layer is 256, and the number of convolution kernels from the eighth layer of convolution layer to the tenth layer of convolution layer is 512;

the number of convolution kernels from the first layer of deconvolution layer to the second layer of deconvolution layer is 512, the number of convolution kernels from the third layer of deconvolution layer to the fifth layer of deconvolution layer is 256, the number of convolution kernels from the sixth layer of deconvolution layer to the ninth layer of deconvolution layer is 128, and the number of convolution kernels from the tenth layer of deconvolution layer is 3.

5. The convolutional neural network-based low-illumination noisy image optimization method of claim 1,

further comprising the following processes:

after each convolution or deconvolution operation, padding is adopted for zero padding operation;

activating each convolution layer or each deconvolution layer by using a nonlinear rectification function after operation;

the convolution kernel size of all convolutional and deconvolution layers is set to 3 × 3.

6. The convolutional neural network-based low-illumination noisy image optimization method of claim 1,

the step S1 further includes:

taking the structural similarity SSIM loss as a loss function of the LLED-Net convolutional neural network, wherein the structural similarity formula is as follows:

wherein, mu_xIs the average value of x; mu.s_yIs the average value of y;is the variance of x;is the variance of y; sigma_xyIs the covariance of x and y; c. C₁＝(k₁L)²，c₂＝(k₂L)²Is a constant used to maintain stability; l is the dynamic range of the pixel value; k is a radical of₁＝0.01，k₂＝0.03；

The loss function is formulated as follows:

wherein N represents the number of training data sets; x represents a artificially synthesized low-illumination noisy image; x represents a synthetic low-illumination noisy image dataset; y represents a normal illumination image; y represents a normal-light image dataset.

7. The convolutional neural network-based low-illumination noisy image optimization method of claim 1,

the step S2 further includes:

s21, selecting a plurality of noise-free images shot under normal illumination, clockwise rotating all the noise-free images by 90 degrees, 180 degrees and 270 degrees, horizontally turning, and performing data enhancement and data expansion on all the images to obtain training data to be processed;

step S22, Gaussian noise is added to each image processed in the step S21, then nonlinear adjustment is carried out to change the image into a low-brightness image, and the processed image becomes a low-quality image with random brightness and random Gaussian noise;

step S23, randomly cutting picture blocks with 41 × 41 pixels for each low-quality image processed in the step S22, and taking the obtained picture set as a training data set;

step S24, inputting the training data set obtained in the step S23 into the LLED-Net convolution neural network to realize forward propagation; wherein, every 4000 samples are traversed to be one generation, every two generations reduce the learning rate to be 0.1 of the original, the iteration times is 10 generations, and finally the trained LLED-Net convolution neural network model is obtained.

8. The convolutional neural network-based low-illumination noisy image optimization method of claim 7,

the step S22 further includes:

the noise intensity σ of the gaussian noise is randomly selected in the range of (0, 25);

the picture brightness values are randomly chosen between gamma (2, 5).