[go: up one dir, main page]

CN111583130A - Method for recovering license plate image for LPR - Google Patents

Method for recovering license plate image for LPR Download PDF

Info

Publication number
CN111583130A
CN111583130A CN202010291139.1A CN202010291139A CN111583130A CN 111583130 A CN111583130 A CN 111583130A CN 202010291139 A CN202010291139 A CN 202010291139A CN 111583130 A CN111583130 A CN 111583130A
Authority
CN
China
Prior art keywords
network
license plate
training
lpr
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010291139.1A
Other languages
Chinese (zh)
Inventor
杨海东
陈俊杰
黄坤山
彭文瑜
林玉山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Technology
Original Assignee
Guangdong University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Technology filed Critical Guangdong University of Technology
Priority to CN202010291139.1A priority Critical patent/CN111583130A/en
Publication of CN111583130A publication Critical patent/CN111583130A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/73Deblurring; Sharpening
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • G06V20/625License plates

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

本发明公开了一种用于LPR的恢复车牌图像的方法,包括:对已知数据集中的图像进行一系列操作后,将图像按比例划分成训练集、验证集和测试集;建立用于车牌识别的图像恢复网络的模型,用训练集对其训练,得到相应的训练模型;用验证集来检验训练模型的准确率,进而来调节模型的超参数,优化模型来获得更好性能;将测试集图像输入到已确定好的优选模型,测试其泛化性能,观察车牌图像的恢复效果如何。本方案对车牌图像恢复网络的结构进行重新设计,增加了辅助网络来优化图像的恢复质量,使LPR的鲁棒性显著增加;另外通过去噪和校正网络相结合的方法,获得了很好的效果,因而车牌识别的准确率也相当高,是一个又快又准的识别网络。

Figure 202010291139

The invention discloses a method for recovering a license plate image for LPR. The recognized image restores the model of the network, trains it with the training set, and obtains the corresponding training model; uses the validation set to test the accuracy of the training model, and then adjusts the hyperparameters of the model and optimizes the model to obtain better performance; Set the image input to the determined optimal model, test its generalization performance, and observe how the recovery effect of the license plate image is. In this scheme, the structure of the license plate image restoration network is redesigned, an auxiliary network is added to optimize the restoration quality of the image, and the robustness of the LPR is significantly increased. Therefore, the accuracy of license plate recognition is also quite high, and it is a fast and accurate recognition network.

Figure 202010291139

Description

一种用于LPR的恢复车牌图像的方法A method for recovering license plate images for LPR

技术领域technical field

本发明涉及图像处理技术领域,尤其涉及一种用于LPR的恢复车牌图像的方法。The invention relates to the technical field of image processing, and in particular, to a method for recovering a license plate image for LPR.

背景技术Background technique

随着国家的经济实力不断提升,人民的生活水平也得到很大的提高,越来越多的家庭拥有自己的私家车。随之而来的交通问题也日趋增多,如今交通车辆管理的问题是城市管理中的重要难题之一。为此,智能交通系统(Intelligent Transportation System,以下简称ITS)应运而生。With the continuous improvement of the country's economic strength, people's living standards have also been greatly improved, and more and more families have their own private cars. The accompanying traffic problems are also increasing day by day. Today, the problem of traffic vehicle management is one of the important problems in urban management. To this end, the Intelligent Transportation System (hereinafter referred to as ITS) came into being.

ITS是将现有的科学技术(如计算机计数,传感器技术,图像处理技术等)结合起来用于交通运输、服务控制等方面,而车牌识别(License Plate Recognition,以下简称LPR)是其中的一个重要的基础环节。目前比较常用的方法是利用车牌的颜色和纹理特征来定位车牌区域,再对图像进行适当的增强处理,再对图像进行识别得到车牌信息。ITS is a combination of existing science and technology (such as computer counting, sensor technology, image processing technology, etc.) for transportation, service control, etc., and license plate recognition (License Plate Recognition, hereinafter referred to as LPR) is one of the important basic link. At present, the commonly used method is to use the color and texture features of the license plate to locate the license plate area, and then perform appropriate enhancement processing on the image, and then recognize the image to obtain the license plate information.

由于卷积神经网络(简称CNN)的发展和应用,许多计算机视觉领域的任务得到了较大的发展,同时基于CNN的许多LPR方法也被应用于解决识别现实世界中的车牌图像。然而,现有的很多方法都是基于捕获到的高质量的图像来实现的。至于在面对那些在恶劣环境下(如强光,夜晚,雾霾等)捕获的,或运动下采集到的模糊的、倾斜的等较低质量的图像,仍然难以识别。Due to the development and application of Convolutional Neural Networks (CNN for short), many tasks in the field of computer vision have been greatly developed, and many LPR methods based on CNN have also been applied to solve the problem of recognizing license plate images in the real world. However, many existing methods are based on captured high-quality images. As for low-quality images captured in harsh environments (such as strong light, night, haze, etc.), or captured in motion, it is still difficult to identify.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于克服现有技术的不足,提供一种鲁棒性强、识别准确率高、性能好的车牌图像恢复的方法。The purpose of the present invention is to overcome the deficiencies of the prior art, and to provide a method for recovering a license plate image with strong robustness, high recognition accuracy and good performance.

本发明的目的通过下述技术方案实现:The object of the present invention is achieved through the following technical solutions:

一种用于LPR的恢复车牌图像的方法,该方法主要包括如下具体步骤:A method for recovering a license plate image for LPR, the method mainly includes the following specific steps:

步骤S1:对已知数据集中的图像进行一系列操作后,将图像按比例划分成训练集、验证集和测试集。Step S1: After a series of operations are performed on the images in the known data set, the images are divided into training sets, validation sets and test sets in proportion.

具体的,所述S1步骤还包括如下步骤:Specifically, the step S1 further includes the following steps:

步骤S11:采用若干知名的车牌识别数据集VTLP,将训练集、验证集和测试集按6:2:2的比例来进行划分。Step S11: Using several well-known license plate recognition data sets VTLP, the training set, the verification set and the test set are divided according to the ratio of 6:2:2.

步骤S12:为了增加训练数据的量,再对训练集通过采取角度不同的旋转产生四张子图片,并通过尺寸变换和分割方法进行加倍;原来一张训练图记为IH,分成的四张旋转后的子图记为

Figure BDA0002450439400000021
i∈{-30°,-15°,+15°,+30°},尺寸变换后的子图记为
Figure BDA0002450439400000022
采取逐像素的二值分割后的图片记为
Figure BDA0002450439400000023
字符计数值为C。Step S12: In order to increase the amount of training data, the training set is then rotated with different angles to generate four sub-images, which are doubled by size transformation and segmentation methods; the original training image is denoted as I H , and divided into four sub-images. The rotated subgraph is denoted as
Figure BDA0002450439400000021
i∈{-30°,-15°,+15°,+30°}, the size-transformed subgraph is denoted as
Figure BDA0002450439400000022
The image after pixel-by-pixel binary segmentation is denoted as
Figure BDA0002450439400000023
The character count value is C.

作为本发明的优选方案,所述步骤S1中,一系列操作包括均值化、去雾、裁剪操作。As a preferred solution of the present invention, in the step S1, a series of operations include averaging, dehazing, and cropping operations.

步骤S2:建立用于车牌识别的图像恢复网络的模型,用训练集对其训练,得到相应的训练模型。Step S2: Establish a model of an image restoration network for license plate recognition, train it with a training set, and obtain a corresponding training model.

具体的,所述S2步骤还包括如下步骤:Specifically, the step S2 further includes the following steps:

S21:设置恢复网络的主干网络和辅助网络,主干网络包括两个子网络,辅助网络包括两个解码器模块,再分别训练子网络和模块;S21: Set the backbone network and auxiliary network of the restoration network, the backbone network includes two sub-networks, the auxiliary network includes two decoder modules, and then train the sub-networks and modules respectively;

S22:训练降噪子网络MDS22: Train the noise reduction sub-network MD ;

S23:训练校正子网络MRS23: training the correction sub-network MR ;

S24:训练像素分割模块ASS24: Train the pixel segmentation module A S ;

S25:训练字符技术模块ACS25: training character technology module A C ;

S26:把四个损失函数权重相加。S26: Add the weights of the four loss functions.

步骤S3:用验证集来检验训练模型的准确率,进而来调节模型的超参数,优化模型来获得更好性能。Step S3: Use the validation set to test the accuracy of the training model, then adjust the hyperparameters of the model, and optimize the model to obtain better performance.

进一步的,所述S3步骤还包括:在经过步骤S2得到训练模型之后,通过与校正子网络的连接,LPR网络能够获取到校正模块提供的输出图像结果

Figure BDA0002450439400000024
进而调节模型的超参数。Further, the step S3 further includes: after the training model is obtained through step S2, the LPR network can obtain the output image result provided by the correction module through the connection with the correction sub-network
Figure BDA0002450439400000024
Then adjust the hyperparameters of the model.

步骤S4:将测试集图像输入到已确定好的优选模型,测试其泛化性能,观察车牌图像的恢复效果如何。Step S4: Input the test set image into the determined optimal model, test its generalization performance, and observe how the recovery effect of the license plate image is.

进一步的,所述S4步骤还包括:将测试集图片提供给车牌识别网络,分别经过车牌图像恢复网络和LPR网络得到识别的结果LPRresultFurther, the step S4 further includes: providing the test set pictures to the license plate recognition network, and obtaining the recognition result LPR result through the license plate image restoration network and the LPR network respectively.

本发明的工作过程和原理是:本发明提供的一种用于LPR的恢复车牌图像的方法对车牌图像恢复网络的结构进行重新设计,增加了辅助网络来优化图像的恢复质量,使LPR的鲁棒性显著增加;另外本方案提供的恢复网络采取去噪和校正网络相结合的方法,获得了很好的效果,而且LPR网络采取的是目前准确率高且识别快速的检测器,因而车牌识别的准确率也相当高,是一个又快又准的识别网络。The working process and principle of the present invention are as follows: a method for recovering a license plate image for LPR provided by the present invention redesigns the structure of the license plate image recovery network, adds an auxiliary network to optimize the image recovery quality, and makes the LPR robust The robustness is significantly increased; in addition, the restoration network provided by this scheme adopts the method of combining denoising and correction network, and obtains good results, and the LPR network adopts the current detector with high accuracy and fast recognition, so the license plate recognition The accuracy rate is also quite high, and it is a fast and accurate recognition network.

与现有技术相比,本发明还具有以下优点:Compared with the prior art, the present invention also has the following advantages:

(1)本发明所提供的用于LPR的恢复车牌图像的方法能够很好地处理恢复低质量的车牌图片,并且恢复后的图像能被目前流行的LPR网络准确识别,适用于不同环境下的捕获得到的车牌图片。(1) The method for recovering license plate images for LPR provided by the present invention can handle and restore low-quality license plate images well, and the recovered images can be accurately recognized by the currently popular LPR network, which is suitable for different environments. Capture the obtained license plate image.

(2)本发明所提供的用于LPR的恢复车牌图像的方法针对低质量的图像,经过一系列基于深度学习的端到端的卷积神经网络来恢复图像,再把恢复后的图像拿到LPR网络上进行识别,能够显著降低干扰因素,快速、准确识别车牌。(2) The method for recovering license plate images for LPR provided by the present invention is aimed at low-quality images, and recovers images through a series of end-to-end convolutional neural networks based on deep learning, and then takes the recovered images to LPR. Identifying on the network can significantly reduce interference factors and quickly and accurately identify license plates.

(3)本发明所提供的用于LPR的恢复车牌图像的方法,针对车牌图像恢复网络的结构进行设计,增加了辅助网络来优化图像的恢复质量,并且能增强车牌识别时的鲁棒性。本发明的恢复网络采取去噪和校正网络相结合,再加以辅助网络的结构来最大化恢复图像的质量,该结构是独特而创新的。(3) The method for recovering a license plate image for LPR provided by the present invention is designed for the structure of a license plate image recovery network, an auxiliary network is added to optimize the image recovery quality, and the robustness of the license plate recognition can be enhanced. The restoration network of the present invention adopts the combination of denoising and correction network, and adds an auxiliary network structure to maximize the quality of the restored image, which is unique and innovative.

附图说明Description of drawings

图1是本发明所提供的用于LPR的恢复车牌图像的方法的流程图。FIG. 1 is a flowchart of a method for recovering a license plate image for LPR provided by the present invention.

图2是本发明所提供的整个车牌识别网络的结构示意图。FIG. 2 is a schematic structural diagram of the entire license plate recognition network provided by the present invention.

图3是本发明所提供的基于U-Net网络的结构示意图。FIG. 3 is a schematic structural diagram of a U-Net-based network provided by the present invention.

具体实施方式Detailed ways

为使本发明的目的、技术方案及优点更加清楚、明确,以下参照附图并举实施例对本发明作进一步说明。In order to make the objectives, technical solutions and advantages of the present invention clearer and clearer, the present invention will be further described below with reference to the accompanying drawings and examples.

实施例1:Example 1:

如图1至图3所示,本实施例公开了一种用于LPR的恢复车牌图像的方法,该方法主要包括如下具体步骤:As shown in FIG. 1 to FIG. 3 , the present embodiment discloses a method for recovering a license plate image for LPR, and the method mainly includes the following specific steps:

步骤S1:对已知数据集中的图像进行一系列操作后,将图像按比例划分成训练集、验证集和测试集。Step S1: After a series of operations are performed on the images in the known data set, the images are divided into training sets, validation sets and test sets in proportion.

具体的,所述S1步骤还包括如下步骤:Specifically, the step S1 further includes the following steps:

步骤S11:采用若干知名的车牌识别数据集VTLP,将训练集、验证集和测试集按6:2:2的比例来进行划分。Step S11: Using several well-known license plate recognition data sets VTLP, the training set, the verification set and the test set are divided according to the ratio of 6:2:2.

步骤S12:为了增加训练数据的量,再对训练集通过采取角度不同的旋转产生四张子图片,并通过尺寸变换和分割方法进行加倍;原来一张训练图记为IH,分成的四张旋转后的子图记为

Figure BDA0002450439400000041
i∈{-30°,-15°,+15°,+30°},尺寸变换后的子图记为
Figure BDA0002450439400000042
采取逐像素的二值分割后的图片记为
Figure BDA0002450439400000043
字符计数值为C。Step S12: In order to increase the amount of training data, the training set is then rotated with different angles to generate four sub-images, which are doubled by size transformation and segmentation methods; the original training image is denoted as I H , and divided into four sub-images. The rotated subgraph is denoted as
Figure BDA0002450439400000041
i∈{-30°,-15°,+15°,+30°}, the size-transformed subgraph is denoted as
Figure BDA0002450439400000042
The image after pixel-by-pixel binary segmentation is denoted as
Figure BDA0002450439400000043
The character count value is C.

作为本发明的优选方案,所述步骤S1中,一系列操作包括均值化、去雾、裁剪操作。As a preferred solution of the present invention, in the step S1, a series of operations include averaging, dehazing, and cropping operations.

步骤S2:建立用于车牌识别的图像恢复网络的模型,用训练集对其训练,得到相应的训练模型。Step S2: Establish a model of an image restoration network for license plate recognition, train it with a training set, and obtain a corresponding training model.

具体的,所述S2步骤还包括如下步骤:Specifically, the step S2 further includes the following steps:

S21:设置恢复网络的主干网络和辅助网络,主干网络包括两个子网络,辅助网络包括两个解码器模块,再分别训练子网络和模块;S21: Set the backbone network and auxiliary network of the restoration network, the backbone network includes two sub-networks, the auxiliary network includes two decoder modules, and then train the sub-networks and modules respectively;

S22:训练降噪子网络MDS22: Train the noise reduction sub-network MD ;

S23:训练校正子网络MRS23: training the correction sub-network MR ;

S24:训练像素分割模块ASS24: Train the pixel segmentation module A S ;

S25:训练字符技术模块ACS25: training character technology module A C ;

S26:把四个损失函数权重相加。S26: Add the weights of the four loss functions.

步骤S3:用验证集来检验训练模型的准确率,进而来调节模型的超参数,优化模型来获得更好性能。Step S3: Use the validation set to test the accuracy of the training model, then adjust the hyperparameters of the model, and optimize the model to obtain better performance.

进一步的,所述S3步骤还包括:在经过步骤S2得到训练模型之后,通过与校正子网络的连接,LPR网络能够获取到校正模块提供的输出图像结果

Figure BDA0002450439400000044
进而调节模型的超参数。Further, the step S3 further includes: after the training model is obtained through step S2, the LPR network can obtain the output image result provided by the correction module through the connection with the correction sub-network
Figure BDA0002450439400000044
Then adjust the hyperparameters of the model.

步骤S4:将测试集图像输入到已确定好的优选模型,测试其泛化性能,观察车牌图像的恢复效果如何。Step S4: Input the test set image into the determined optimal model, test its generalization performance, and observe how the recovery effect of the license plate image is.

进一步的,所述S4步骤还包括:将测试集图片提供给车牌识别网络,分别经过车牌图像恢复网络和LPR网络得到识别的结果LPRresultFurther, the step S4 further includes: providing the test set pictures to the license plate recognition network, and obtaining the recognition result LPR result through the license plate image restoration network and the LPR network respectively.

本发明的工作过程和原理是:本发明提供的一种用于LPR的恢复车牌图像的方法对车牌图像恢复网络的结构进行重新设计,增加了辅助网络来优化图像的恢复质量,使LPR的鲁棒性显著增加;另外本方案提供的恢复网络采取去噪和校正网络相结合的方法,获得了很好的效果,而且LPR网络采取的是目前准确率高且识别快速的检测器,因而车牌识别的准确率也相当高,是一个又快又准的识别网络。The working process and principle of the present invention are as follows: a method for recovering a license plate image for LPR provided by the present invention redesigns the structure of the license plate image recovery network, adds an auxiliary network to optimize the image recovery quality, and makes the LPR robust The robustness is significantly increased; in addition, the restoration network provided by this scheme adopts the method of combining denoising and correction network, and obtains good results, and the LPR network adopts the current high-accuracy and fast-recognizing detector, so license plate recognition The accuracy rate is also quite high, and it is a fast and accurate recognition network.

实施例2:Example 2:

参照图1到图3,本实施例公开了一种用于LPR的恢复车牌图像的方法包括以下步骤:Referring to FIG. 1 to FIG. 3 , the present embodiment discloses a method for recovering a license plate image for LPR, including the following steps:

S1:对已知数据集中的图像进行一系列如均值化、去雾、裁剪等操作后,生成的图片大小为572*572将图像按比例划分成训练集、验证集和测试集。S1: After performing a series of operations such as averaging, dehazing, and cropping on the images in the known data set, the size of the generated image is 572*572, and the image is divided into training set, validation set and test set proportionally.

具体步骤包括如下:The specific steps include the following:

S11:采用数据集VTLP,里面包含10650张车牌图片,然后把它按6:2:2的比例划分为训练集、验证集和测试集,分别包含6390、2130和2130张车牌图片。S11: The data set VTLP is used, which contains 10650 license plate images, and then it is divided into training set, validation set and test set according to the ratio of 6:2:2, containing 6390, 2130 and 2130 license plate images respectively.

S12:然后再对带正确标签的训练集进行扩容,一张标签图IH通过旋转变换扩充成四张子图

Figure BDA0002450439400000051
i∈{-30°,-15°,+15°,+30°},进行尺寸变换后生成四张子
Figure BDA0002450439400000052
二值分割后的子图记为
Figure BDA0002450439400000053
把字符标签记为C;S12: Then expand the training set with the correct label, and a label image I H is expanded into four sub-images through rotation transformation
Figure BDA0002450439400000051
i∈{-30°,-15°,+15°,+30°}, after the size transformation, four sheets are generated
Figure BDA0002450439400000052
The subgraph after binary segmentation is denoted as
Figure BDA0002450439400000053
Let the character label be C;

S2:建立用于车牌识别的图像恢复网络的模型,用训练集对其对其训练,得到相应的训练模型,具体步骤包括如下:S2: Establish an image restoration network model for license plate recognition, train it with a training set, and obtain a corresponding training model. The specific steps include the following:

S21:恢复网络包括主干网络和辅助网络,其中主干网络包括两个子网络,第一个子网络以低质量图像为输入,输出为恢复图像。第二个为校正网络,对来自降噪网络的输出结果进行校正。辅助网络包括文本计数模块和像素分割模块,他们的输入是降噪网络和校正网络的编码器图片特征之和,输出分别为字符数和二值分割后的图片。本发明的恢复网络都是采用U-Net结构,降噪网络和校正网络都包含编码器和解码器模块,而计数网络和分割网络则只有解码器模块。两个子网络都是基于U-Net结构搭建的,如图三所示即为U-Net结构,之所以选择这个结构,是因为U-Net结构可以提升图像中目标的细节信息,而不会对图像生成产生负面影响。我们采用基于U-Net的结构,同时添加了跳跃连接,可以共享图像低级语义信息。S21: The restoration network includes a backbone network and an auxiliary network, wherein the backbone network includes two sub-networks, and the first sub-network takes a low-quality image as input and outputs a restored image. The second is the correction network, which corrects the output from the noise reduction network. The auxiliary network includes a text counting module and a pixel segmentation module. Their input is the sum of the encoder image features of the noise reduction network and the correction network, and the output is the number of characters and the image after binary segmentation, respectively. The restoration network of the present invention adopts the U-Net structure, the noise reduction network and the correction network both include encoder and decoder modules, while the counting network and the segmentation network have only the decoder module. Both sub-networks are built based on the U-Net structure, as shown in Figure 3, which is the U-Net structure. The reason for choosing this structure is that the U-Net structure can improve the details of the target in the image. Image generation has a negative impact. We adopt a U-Net-based structure while adding skip connections, which can share low-level semantic information of images.

S211:主干网络的两个子网络都包括编码器和解码器,其中编码器的网络结构包括第一卷积层conv1(1,2)(3×3×32)×2,步长2;第二卷积层conv2(1,2)(3×3×64)×2,步长2;第三卷积层conv3(1,2)(3×3×128)×2,步长2;第四卷积层conv4(1,2)(3×3×256)×2,步长2;第五卷积层conv5(1,2)(3×3×512)×2,步长2;第一池化层pool1(2×2)max pooling,步长2;第二池化层pool2(2×2)max pooling,步长2;第三池化层pool3(2×2)max pooling,步长2;第四池化层pool4(2×2)max pooling,步长2;要注意的是每个卷积层前都有加一个BN模块,卷积层后连接的激活函数用的都是LeakyReLU函数,每个卷积层中经过第二个卷积核后生成的特征图都会通过跳跃连接引出到解码器网络中上采样后的卷积层后的特征图相级联,每两个卷积层之间都连有一个相应的最大池化层。S211: Both sub-networks of the backbone network include an encoder and a decoder, wherein the network structure of the encoder includes a first convolutional layer conv1(1,2)(3×3×32)×2, with a stride of 2; the second Convolutional layer conv2(1,2)(3×3×64)×2, stride 2; third convolutional layer conv3(1,2)(3×3×128)×2, stride 2; fourth Convolutional layer conv4(1,2)(3×3×256)×2, stride 2; fifth convolutional layer conv5(1,2)(3×3×512)×2, stride 2; Pooling layer pool1(2×2)max pooling, stride 2; second pooling layer pool2(2×2)max pooling, stride 2; third pooling layer pool3(2×2)max pooling, stride 2; The fourth pooling layer pool4 (2×2) max pooling, step size 2; it should be noted that a BN module is added before each convolutional layer, and the activation function connected after the convolutional layer uses LeakyReLU function, the feature map generated after the second convolution kernel in each convolutional layer will be extracted to the decoder network through skip connections. There is a corresponding max pooling layer between the layers.

S212:主干网络的子网络采用的两个解码器以及辅助网络的像素分割模块采用的解码器的网络结构都是相同的。其中解码器包括:第六卷积层conv6(1,2)(3×3×256)×2,步长2;第七卷积层conv7(1,2)(3×3×128)×2,步长2;第八卷积层conv8(1,2)(3×3×64)×2,步长2;第九卷积层conv9(1,2)(3×3×32)×2,步长2;第十卷积层conv10(1×1×3),步长1;第一上采样层upsample:把con5_2和conv4_2级联;第二上采样层upsample:把con6_2和conv3_2级联;第三上采样层upsample:把con7_2和conv2_2级联;第四上采样层upsample:把con8_2和conv1_2级联。上采样层分别连在两个卷积层之间,且每个卷积层后都经过LeakyReLU激活函数。S212: The network structures of the two decoders adopted by the sub-network of the backbone network and the decoder adopted by the pixel segmentation module of the auxiliary network are the same. The decoder includes: sixth convolutional layer conv6(1,2)(3×3×256)×2, stride 2; seventh convolutional layer conv7(1,2)(3×3×128)×2 , stride 2; the eighth convolutional layer conv8(1,2)(3×3×64)×2, stride 2; the ninth convolutional layer conv9(1,2)(3×3×32)×2 , stride 2; tenth convolution layer conv10 (1×1×3), stride 1; first upsampling layer upsample: concatenate con5_2 and conv4_2; second upsampling layer upsample: concatenate con6_2 and conv3_2 ; The third upsampling layer upsample: concatenate con7_2 and conv2_2; the fourth upsampling layer upsample: concatenate con8_2 and conv1_2. The upsampling layer is connected between two convolutional layers, and each convolutional layer is passed through the LeakyReLU activation function.

S213:辅助网络的文本计数模块采用CNN来进行文本数量的预测来完成简单的分类任务,其中解码器网络结构包括五个卷积层,五个卷积层的尺寸分别为(N×N×512),(1×1×256),(1×1×128),(1×1×64),(1×1×1)。S213: The text counting module of the auxiliary network uses CNN to predict the number of texts to complete a simple classification task, wherein the decoder network structure includes five convolutional layers, and the dimensions of the five convolutional layers are (N×N×512 ), (1×1×256), (1×1×128), (1×1×64), (1×1×1).

S214:主干网络的连接方式是:输入到恢复网络的低质量图片先经过降噪子网络,然后再把输出结果输入到校正子网络。在这其中,降噪网络和校正网络的编码器后会分别引出一个输出,这两个编码器的输出特征图的和记为一个特征集F然后分别喂给辅助网络中的计数解码器和分割解码器。文本计数能够准确地分开车牌的每一个文本字符,预测文本的数量,使得图片更适用于后面的LPR网络来进行检测。像素分割可以使得车牌的图片更清晰,更易于进行文本识别。S214: The connection mode of the backbone network is as follows: the low-quality picture input to the restoration network first passes through the noise reduction sub-network, and then the output result is input to the correction sub-network. In this, the encoder of the noise reduction network and the correction network will lead to an output respectively. The sum of the output feature maps of the two encoders is recorded as a feature set F and then fed to the counting decoder and segmentation in the auxiliary network respectively. decoder. The text count can accurately separate each text character of the license plate, predict the number of text, and make the image more suitable for the subsequent LPR network for detection. Pixel segmentation can make the picture of the license plate clearer and easier for text recognition.

S22:子图

Figure BDA0002450439400000061
i∈{-30°,-15°,+15°,+30°}喂给降噪子网络MD用来训练,其中w是降噪网络的参数,此子网络的损失函数表示如下:S22: Subgraph
Figure BDA0002450439400000061
i∈{-30°,-15°,+15°,+30°} is fed to the noise reduction sub-network MD for training, where w is the parameter of the noise reduction network, and the loss function of this sub-network is expressed as follows:

Figure BDA0002450439400000062
Figure BDA0002450439400000062

S23:子图

Figure BDA0002450439400000063
喂给校正子网络MR用来训练,其中w是校正子网络的参数,其损失函数表达如下:S23: Subgraph
Figure BDA0002450439400000063
It is fed to the syndrome network MR for training, where w is the parameter of the syndrome network, and its loss function is expressed as follows:

Figure BDA0002450439400000064
Figure BDA0002450439400000064

S24:子图

Figure BDA0002450439400000071
喂给像素分割解码器模块AS用以训练,其中F表示主网络的两个子网络的编码器求和之后的特征集合,
Figure BDA0002450439400000072
表明该某个像素的是车牌区域的概率。
Figure BDA0002450439400000073
表明像素真实分类,交叉熵损失函数公式如下:S24: Subgraph
Figure BDA0002450439400000071
is fed to the pixel segmentation decoder module A S for training, where F represents the feature set after the summation of the encoders of the two sub-networks of the main network,
Figure BDA0002450439400000072
Indicates the probability that a certain pixel is a license plate area.
Figure BDA0002450439400000073
Indicates the true classification of pixels, and the formula of the cross-entropy loss function is as follows:

Figure BDA0002450439400000074
Figure BDA0002450439400000074

S25:计数标签值C喂给像素分割解码器模块AC用以训练,Cpred表示预测值,CG,T表示标签值,损失函数如下:S25: The count label value C is fed to the pixel segmentation decoder module A C for training, C pred represents the predicted value, C G, T represent the label value, and the loss function is as follows:

Figure BDA0002450439400000075
Figure BDA0002450439400000075

S26:最后整个恢复网络的损失函数可表示为的所有子目标函数的权重和:S26: Finally, the loss function of the entire restoration network can be expressed as the weight sum of all sub-objective functions:

Figure BDA0002450439400000076
Figure BDA0002450439400000076

S27:根据这个总的损失函数可以调参训练出最优的模型S27: According to this total loss function, parameters can be adjusted to train the optimal model

S3:在每一个epoch完成后,用验证集来检验训练模型的准确率,进而来调节模型的超参数,优化模型来获得更好性能,具体步骤如下:S3: After each epoch is completed, use the validation set to test the accuracy of the training model, and then adjust the hyperparameters of the model and optimize the model to obtain better performance. The specific steps are as follows:

S31:根据S2得到的训练模型,把验证集的车牌图片输入到恢复网络中,得到输出结果

Figure BDA0002450439400000077
然后输入到LPR网络中进行目标检测,可以得到图片的准确率结果,然后根据结果可以适当来调节超参数优化模型。S31: According to the training model obtained in S2, input the license plate picture of the verification set into the recovery network, and obtain the output result
Figure BDA0002450439400000077
Then input it into the LPR network for target detection, and the accuracy results of the pictures can be obtained, and then the hyperparameter optimization model can be appropriately adjusted according to the results.

S4:将测试集图像输入到已确定好的优选模型,测试其泛化性能,观察车牌图像的恢复效果如何,具体步骤包括:S4: Input the test set image into the determined optimal model, test its generalization performance, and observe the recovery effect of the license plate image. The specific steps include:

S41:将测试集的图片Itest,再次输入到恢复网络,其输出结果输入到LPR网络,得到的输出可以表示为:S41: Input the image I test of the test set to the restoration network again, and input the output result to the LPR network, and the obtained output can be expressed as:

LPRresult=LPR(MR(MD(Itest)))LPR result = LPR(MR ( M D (I test )))

S42:然后通过判断结果的准确率和现有的SOTA车牌识别网络模型来比较。总结得知本发明的网络性能的优秀程度,是目前采用一样数据集的网络结构中的最优网络,识别的准确率也高达93%。S42: Then compare the accuracy of the judgment result with the existing SOTA license plate recognition network model. It is concluded that the network performance of the present invention is excellent, and it is the optimal network in the network structure using the same data set at present, and the recognition accuracy rate is as high as 93%.

上述实施例为本发明较佳的实施方式,但本发明的实施方式并不受上述实施例的限制,其他的任何未背离本发明的精神实质与原理下所作的改变、修饰、替代、组合、简化,均应为等效的置换方式,都包含在本发明的保护范围之内。The above-mentioned embodiments are preferred embodiments of the present invention, but the embodiments of the present invention are not limited by the above-mentioned embodiments, and any other changes, modifications, substitutions, combinations, The simplification should be equivalent replacement manners, which are all included in the protection scope of the present invention.

Claims (6)

1. A method for recovering a license plate image for LPR is characterized by comprising the following steps:
step S1: after a series of operations are carried out on the images in the known data set, the images are proportionally divided into a training set, a verification set and a test set;
step S2: establishing a model of an image recovery network for license plate recognition, and training the model by using a training set to obtain a corresponding training model;
step S3: the verification set is used for checking the accuracy of the training model, so that the hyper-parameters of the model are adjusted, and the model is optimized to obtain better performance;
step S4: and inputting the test set image into the determined optimal model, testing the generalization performance of the test set image, and observing the recovery effect of the license plate image.
2. The method for restoring a license plate image for LPR of claim 1, wherein said step of S1 further comprises the steps of:
step S11: adopting a plurality of famous license plate recognition data sets VTLP, and carrying out verification and test on a training set, a verification set and a test set according to the following steps of 6:2:2 to divide;
step S12: to increase the amount of training data, the pairsThe training set generates four sub-pictures by adopting rotation with different angles, and doubles the four sub-pictures by a size conversion and segmentation method; the original training picture is marked as IHThe divided four rotated subgraphs are
Figure FDA0002450439390000011
The subgraph after the size transformation is marked as
Figure FDA0002450439390000012
The picture after binary segmentation by pixel is recorded as
Figure FDA0002450439390000013
The character count value is C.
3. The method for restoring a license plate image for LPR of claim 1, wherein said step of S2 further comprises the steps of:
s21: setting a main network and an auxiliary network of a recovery network, wherein the main network comprises two sub-networks, and the auxiliary network comprises two decoder modules and then respectively trains the sub-networks and the modules;
s22: training noise reduction subnetwork MD
S23: training the syndrome network MR
S24: training pixel segmentation module AS
S25: training character technology module AC
S26: the four loss function weights are added.
4. The method for restoring a license plate image for LPR of claim 1, wherein said step of S3 further comprises: after obtaining the training model through step S2, the LPR network can obtain the output image result provided by the correction module through connection with the syndrome network
Figure FDA0002450439390000014
And then adjust the modelAnd (4) super-parameter.
5. The method for restoring a license plate image for LPR of claim 1, wherein said step of S4 further comprises: providing the test set picture to a license plate recognition network, and respectively obtaining recognition results LPR through a license plate image recovery network and an LPR networkresult
6. The method for restoring a license plate image for LPR of claim 1, wherein said sequence of operations of step S1 comprises averaging, defogging and cropping.
CN202010291139.1A 2020-04-14 2020-04-14 Method for recovering license plate image for LPR Pending CN111583130A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010291139.1A CN111583130A (en) 2020-04-14 2020-04-14 Method for recovering license plate image for LPR

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010291139.1A CN111583130A (en) 2020-04-14 2020-04-14 Method for recovering license plate image for LPR

Publications (1)

Publication Number Publication Date
CN111583130A true CN111583130A (en) 2020-08-25

Family

ID=72111549

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010291139.1A Pending CN111583130A (en) 2020-04-14 2020-04-14 Method for recovering license plate image for LPR

Country Status (1)

Country Link
CN (1) CN111583130A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111882004A (en) * 2020-09-28 2020-11-03 北京易真学思教育科技有限公司 Model training method, question judging method, device, equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100177982A1 (en) * 2009-01-09 2010-07-15 Sony Corporation Image processing device, image processing method, program, and imaging device
CN110866561A (en) * 2019-11-18 2020-03-06 佛山市南海区广工大数控装备协同创新研究院 Plastic bottle color sorting method based on image recognition

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100177982A1 (en) * 2009-01-09 2010-07-15 Sony Corporation Image processing device, image processing method, program, and imaging device
CN110866561A (en) * 2019-11-18 2020-03-06 佛山市南海区广工大数控装备协同创新研究院 Plastic bottle color sorting method based on image recognition

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
YOUNKWAN LEE等: "SNIDER: Single Noisy Image Denoising and Rectification for Improving License Plate Recognition" *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111882004A (en) * 2020-09-28 2020-11-03 北京易真学思教育科技有限公司 Model training method, question judging method, device, equipment and storage medium
CN111882004B (en) * 2020-09-28 2021-01-05 北京易真学思教育科技有限公司 Model training method, question judging method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN114841972B (en) Transmission line defect recognition method based on saliency map and semantic embedding feature pyramid
WO2023083280A1 (en) Scene text recognition method and device
CN113592736A (en) Semi-supervised image deblurring method based on fusion attention mechanism
CN112396607A (en) Streetscape image semantic segmentation method for deformable convolution fusion enhancement
CN112581409B (en) Image defogging method based on end-to-end multiple information distillation network
CN108764063A (en) A kind of pyramidal remote sensing image time critical target identifying system of feature based and method
CN109886200B (en) A Generative Adversarial Network-Based Detection Method for Unmanned Vehicle Lane Lines
CN111179167A (en) An image super-resolution method based on multi-stage attention enhancement network
CN113505640B (en) A small-scale pedestrian detection method based on multi-scale feature fusion
CN112232351B (en) A license plate recognition system based on deep neural network
CN112364838B (en) Method for improving handwriting OCR performance by utilizing synthesized online text image
Song et al. RoboFusion: Towards robust multi-modal 3D object detection via SAM
CN106886778B (en) License plate character segmentation and recognition method in monitoring scene
CN110647820B (en) Low-resolution face recognition method based on feature space super-resolution mapping
CN113838064A (en) A Cloud Removal Method Using Multitemporal Remote Sensing Data Based on Branch GAN
CN114187581A (en) A fine-grained detection method for driver distraction based on unsupervised learning
CN117351360A (en) Remote sensing image road extraction method based on attention mechanism improvement
CN115131414B (en) UAV image alignment method, electronic device and storage medium based on deep learning
CN117197756A (en) Hidden danger area intrusion detection method, device, equipment and storage medium
CN117115616A (en) A real-time low-light image target detection method based on convolutional neural network
CN110503049B (en) A method for estimating the number of vehicles in satellite video based on generative adversarial network
CN116363535A (en) Ship detection method in unmanned aerial vehicle aerial image based on convolutional neural network
CN111583130A (en) Method for recovering license plate image for LPR
CN114898304A (en) Vehicle tracking method and device, road side equipment and network side equipment
CN113591861A (en) Image processing method, image processing device, computing equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200825

RJ01 Rejection of invention patent application after publication