CN114066808B - Pavement defect detection method and system based on deep learning - Google Patents
Pavement defect detection method and system based on deep learning Download PDFInfo
- Publication number
- CN114066808B CN114066808B CN202111182085.6A CN202111182085A CN114066808B CN 114066808 B CN114066808 B CN 114066808B CN 202111182085 A CN202111182085 A CN 202111182085A CN 114066808 B CN114066808 B CN 114066808B
- Authority
- CN
- China
- Prior art keywords
- convolution
- defects
- image
- crack
- road surface
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000007547 defect Effects 0.000 title claims abstract description 91
- 238000001514 detection method Methods 0.000 title claims abstract description 45
- 238000013135 deep learning Methods 0.000 title claims abstract description 18
- 238000000034 method Methods 0.000 claims abstract description 15
- 238000013528 artificial neural network Methods 0.000 claims abstract description 14
- 238000000605 extraction Methods 0.000 claims abstract description 14
- 239000000284 extract Substances 0.000 claims abstract description 13
- 238000003709 image segmentation Methods 0.000 claims abstract description 13
- 230000006870 function Effects 0.000 claims description 19
- 238000004364 calculation method Methods 0.000 claims description 14
- 238000010276 construction Methods 0.000 claims description 14
- 238000012549 training Methods 0.000 claims description 13
- 230000011218 segmentation Effects 0.000 claims description 10
- 238000005070 sampling Methods 0.000 claims description 9
- 230000004913 activation Effects 0.000 claims description 8
- 238000011156 evaluation Methods 0.000 claims description 8
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 7
- 238000011176 pooling Methods 0.000 claims description 6
- 238000011158 quantitative evaluation Methods 0.000 claims description 6
- 238000013139 quantization Methods 0.000 claims description 6
- 230000003044 adaptive effect Effects 0.000 claims description 3
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 230000006835 compression Effects 0.000 claims description 3
- 238000007906 compression Methods 0.000 claims description 3
- 238000010586 diagram Methods 0.000 description 7
- 238000012423 maintenance Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 6
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 239000010426 asphalt Substances 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000013341 scale-up Methods 0.000 description 2
- 101001121408 Homo sapiens L-amino-acid oxidase Proteins 0.000 description 1
- 101000827703 Homo sapiens Polyphosphoinositide phosphatase Proteins 0.000 description 1
- 102100026388 L-amino-acid oxidase Human genes 0.000 description 1
- 102100023591 Polyphosphoinositide phosphatase Human genes 0.000 description 1
- 101100012902 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FIG2 gene Proteins 0.000 description 1
- 101100233916 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KAR5 gene Proteins 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 230000010339 dilation Effects 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000003708 edge detection Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 238000011897 real-time detection Methods 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/0002—Inspection of images, e.g. flaw detection
- G06T7/0004—Industrial image inspection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/253—Fusion techniques of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/11—Region-based segmentation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/60—Analysis of geometric attributes
- G06T7/62—Analysis of geometric attributes of area, perimeter, diameter or volume
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/60—Analysis of geometric attributes
- G06T7/66—Analysis of geometric attributes of image moments or centre of gravity
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Geometry (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Quality & Reliability (AREA)
- Image Analysis (AREA)
- Investigating Materials By The Use Of Optical Means Adapted For Particular Applications (AREA)
Abstract
Description
技术领域Technical Field
本发明涉及路面缺陷检测技术领域,尤其涉及一种基于深度学习的路面缺陷检测方法及系统。The present invention relates to the technical field of road surface defect detection, and in particular to a road surface defect detection method and system based on deep learning.
背景技术Background Art
随着交通运输业的快速发展,我国已经形成具有一定规模的公路交通网。在使用过程中产生的大量表面缺陷标志着现有公路已进入大面积检测和养护阶段。而公路交通网的愈发多样化和复杂化,致使该领域对缺陷检测方法的性能要求越来越高。因此,能够及时发现和更加精准且快速地实现裂缝检测对公路养护和行车安全具有重大意义。With the rapid development of the transportation industry, my country has formed a road network of a certain scale. The large number of surface defects generated during use indicates that the existing roads have entered the stage of large-scale inspection and maintenance. The increasing diversity and complexity of the road network has led to higher and higher performance requirements for defect detection methods in this field. Therefore, being able to detect cracks in a timely manner and more accurately and quickly is of great significance to road maintenance and driving safety.
路面裂缝是最为常见的病害之一,对公路安全产生了巨大威胁。能够及时发现、修复和阻止进一步恶化成为了交通部门的一项重要任务。早期路面缺陷检测方法主要有摄像测量法、探地雷达法、激光测量和红外线测量法等。这些方法通常采用人工评估,实地获取数据再回到实验室进行数据分析。这种评估方法不仅增加了成本,费时费力,而且检测精度低,达不到要求。之后对路面缺陷检测技术的研究主要集中在机器学习方面,其中包括定向梯度直方图、支持向量机、Canny边缘检测和Otsu阈值分割等检测方法。这类基于阈值的检测方法在简单背景下检测效果较好。但现如今路况复杂程度不断加深,一些诸如油渍、阴影和异物等原因严重影响着检测精度,制约着图像分割技术的发展。从而满足不了实际检测要求。Pavement cracks are one of the most common diseases, posing a huge threat to highway safety. Being able to detect, repair and prevent further deterioration in a timely manner has become an important task for the transportation department. Early pavement defect detection methods mainly include video measurement, ground penetrating radar, laser measurement and infrared measurement. These methods usually use manual evaluation, obtain data on the spot and then return to the laboratory for data analysis. This evaluation method not only increases costs, is time-consuming and labor-intensive, but also has low detection accuracy and does not meet the requirements. Later, research on pavement defect detection technology focused on machine learning, including detection methods such as oriented gradient histogram, support vector machine, Canny edge detection and Otsu threshold segmentation. This type of threshold-based detection method has a good detection effect under simple backgrounds. However, nowadays, the complexity of road conditions continues to deepen, and some factors such as oil stains, shadows and foreign matter seriously affect the detection accuracy and restrict the development of image segmentation technology. Therefore, it cannot meet the actual detection requirements.
随着深度学习的发展,各种深度神经网路被用于道路缺陷检测领域,能够挖掘出更深层次的特征,在很大程度上滤除复杂背景带来的影响,成为了道路缺陷检测领域的主流趋势。典型的全卷积网络(FCN)因其跳级融合结构取得的良好分割效果成为了语义分割领域具有里程碑式意义的分割网络。但FCN将下采样阶段获得的裂缝边缘、图案和形状等不同特征进行了相同的处理,这样对分割结果影响较大。而且对于路面裂缝这种比较纤细的目标来说,FCN的连续卷积操作会使局部细节丢失严重。为了改善这一问题,国内外诸多研究学者对FCN进行了改进,诞生出了U-Net和SegNet等网络,这些网络沿用了FCN编解码结构的基本思路。不同的是,二者都对上采样阶段获得的特征图与下采样阶段获得的底层特征进行了融合,增加了定位的准确性和语义信息的完整性,提高了裂缝分割效果。但是,这些基于编解码结构的网络仅对一些裂缝连续性好且与强干扰对比度大等情况检测效果较为理想。因此,在光线变化大、背景干扰性强以及一些纤细裂缝与噪声边界模糊等情况下检测效果依然不佳。由此可见,实现高精度的裂缝检测依然是一个研究重点。With the development of deep learning, various deep neural networks have been used in the field of road defect detection. They can mine deeper features and filter out the influence of complex background to a large extent, becoming the mainstream trend in the field of road defect detection. The typical fully convolutional network (FCN) has become a milestone segmentation network in the field of semantic segmentation due to its good segmentation effect achieved by its skip-level fusion structure. However, FCN treats different features such as crack edges, patterns and shapes obtained in the downsampling stage in the same way, which has a great impact on the segmentation results. Moreover, for relatively thin targets such as road cracks, the continuous convolution operation of FCN will cause serious loss of local details. In order to improve this problem, many researchers at home and abroad have improved FCN and created networks such as U-Net and SegNet. These networks follow the basic idea of the FCN codec structure. The difference is that both of them fuse the feature map obtained in the upsampling stage with the underlying features obtained in the downsampling stage, which increases the accuracy of positioning and the integrity of semantic information, and improves the crack segmentation effect. However, these networks based on codec structures are only ideal for some cracks with good continuity and large contrast with strong interference. Therefore, the detection effect is still poor when the light changes greatly, the background interference is strong, and the boundaries between some fine cracks and noise are blurred. It can be seen that achieving high-precision crack detection is still a research focus.
综上所述,寻求一种路面缺陷检测方法以解决上述问题是非常有必要的。In summary, it is very necessary to seek a road defect detection method to solve the above problems.
发明内容Summary of the invention
本发明的目的在于提供一种基于深度学习的路面缺陷检测方法及系统,通过设置U-MDM,该模块采用4个尺度的上下采样结构,并且结合了膨胀卷积的优点,克服了常用标准卷积提取的特征单一的缺点,并且可以去除冗余信息减少计算量。The purpose of the present invention is to provide a road defect detection method and system based on deep learning. By setting U-MDM, the module adopts a 4-scale up and down sampling structure and combines the advantages of dilated convolution, overcoming the disadvantage of single feature extraction of commonly used standard convolution, and can remove redundant information to reduce the amount of calculation.
为了解决上述技术问题,本发明提供的基于深度学习的路面缺陷检测方法及系统的技术方案具体如下:In order to solve the above technical problems, the technical solutions of the road surface defect detection method and system based on deep learning provided by the present invention are as follows:
第一方面,本发明实施例公开了一种基于深度学习的路面缺陷检测方法,所述方法包括以下步骤:In a first aspect, an embodiment of the present invention discloses a road surface defect detection method based on deep learning, the method comprising the following steps:
图像分割模型构建,所述图像分割模型构建包括U型多尺度扩张网络,所述U型多尺度扩张网络将U型多尺度膨胀卷积模块嵌入到U-Net深度神经网络上采样之前形成了U-MDN,U-MDN使用U-Net特征提取网络作为主干,提取从不同卷积阶段得到的多尺度特征,使用U-MDM进一步提取缺陷深层特征,再将浅层特征和深层特征中得到的多尺度特征进行高效融合,以得到预测结果;Image segmentation model construction, which includes a U-shaped multi-scale dilated network, which embeds a U-shaped multi-scale dilated convolution module into a U-Net deep neural network before upsampling to form a U-MDN. The U-MDN uses the U-Net feature extraction network as the backbone to extract multi-scale features obtained from different convolution stages, and uses the U-MDM to further extract deep features of defects, and then efficiently fuses the multi-scale features obtained from shallow features and deep features to obtain prediction results;
路面缺陷量化评定,所述路面缺陷量化评定根据得到的预测结果对缺陷的量化参数进行量化计算,并且基于量化参数进一步计算路面破损指标,得到相应的路面破损等级。The pavement defect quantitative assessment is to quantify the defect parameters according to the obtained prediction results, and further calculate the pavement damage index based on the quantified parameters to obtain the corresponding pavement damage grade.
在上述任一方案中优选的是,U-Net深度神经网络包含有下采样图像压缩路径和上采样图像扩展路径的对称U型结构。In any of the above solutions, preferably, the U-Net deep neural network includes a symmetrical U-shaped structure including a down-sampling image compression path and an up-sampling image expansion path.
在上述任一方案中优选的是,所述下采样阶段使用了4个Convolution block1和1个Convolution block2,上采样阶段使用了4个Convolution block3,Convolution block1和3中每个卷积块都使用了两次卷积核为3*3的卷积操作和一次2*2的MaxPooling操作,Convolution block2中使用了三次卷积核为3*3的卷积操作。In any of the above schemes, preferably, four Convolution block1s and one Convolution block2 are used in the downsampling stage, four Convolution block3s are used in the upsampling stage, each convolution block in Convolution block1 and 3 uses two convolution operations with a convolution kernel of 3*3 and one MaxPooling operation of 2*2, and three convolution operations with a convolution kernel of 3*3 are used in Convolution block2.
在上述任一方案中优选的是,U-MDN网络由4个Convolution block1、1个Convolution block2、1个U-MDM和4个Convolution block3组成,U-MDN网络输入图像为256*256*3,输出特征为256*256*2的黑白二值图像。In any of the above schemes, preferably, the U-MDN network consists of 4 Convolution block1s, 1 Convolution block2, 1 U-MDM and 4 Convolution block3s, the U-MDN network input image is 256*256*3, and the output feature is a black and white binary image of 256*256*2.
在上述任一方案中优选的是,所述图像分割模型构建包括模型训练,训练过程中,设置超参数Batch size为8,每次训练设置150个epoch,初始学习率lr为1e-4,使用Adam优化器来收敛网络,并使用Adam的动量和自适应学习率来加快收敛速度。In any of the above schemes, preferably, the image segmentation model construction includes model training. During the training process, the hyperparameter Batch size is set to 8, 150 epochs are set for each training, the initial learning rate lr is 1e-4, the Adam optimizer is used to converge the network, and Adam's momentum and adaptive learning rate are used to speed up the convergence speed.
在上述任一方案中优选的是,所述对缺陷的量化参数进行量化计算包括以下步骤:In any of the above solutions, preferably, the quantitative calculation of the quantization parameter of the defect comprises the following steps:
根据裂缝图像像素数量N和实际尺寸D计算出换算比例分割后的裂缝图像的实际裂缝尺寸为D=p·N,将分割后的二值图像进行裂缝轮廓提取,将裂缝区域用内接圆填充。Calculate the conversion ratio based on the number of crack image pixels N and the actual size D The actual crack size of the segmented crack image is D=p·N. The crack contour is extracted from the segmented binary image, and the crack area is filled with an inscribed circle.
在上述任一方案中优选的是,所述对缺陷的量化参数进行量化计算包括以下步骤:h为裂缝区域外轮廓长度,w为裂缝区域外轮廓宽度,将相邻圆连接圆心,计算圆心之间的距离,wi为相邻两个圆的圆心像素横坐标之差,hi为相邻两个圆的圆心像素纵坐标之差,则li为裂缝的近似实际距离,那么n个圆心间的距离就是该整条裂缝的实际长度L,实际宽度R为内接圆直径平均值,则需要计算的量化参数宽度的计算方式为: 长度的计算方式为:外接矩形面积的计算方式为:S=p2·h·w。In any of the above schemes, preferably, the quantitative calculation of the quantization parameter of the defect includes the following steps: h is the length of the outer contour of the crack area, w is the width of the outer contour of the crack area, the centers of adjacent circles are connected, and the distance between the centers is calculated, w i is the difference between the horizontal coordinates of the center pixels of two adjacent circles, hi is the difference between the vertical coordinates of the center pixels of two adjacent circles, then l i is the approximate actual distance of the crack, then the distance between the n center points is the actual length L of the entire crack, the actual width R is the average value of the diameter of the inscribed circle, and the calculation method of the quantization parameter width to be calculated is: The length is calculated as: The area of the circumscribed rectangle is calculated as: S = p 2 ·h·w.
在上述任一方案中优选的是,所述U-MDN将得到的16*16*512特征图分别进行了Pooling size为16*16、8*8、4*4和2*2的MaxPooling操作之后分别得到了1*1*512、2*2*512、4*4*512和8*8*512的特征图,再经过膨胀率为1、2、4和6的卷积操作得到1*1*256、2*2*256、4*4*256和8*8*256的特征图。Preferably, in any of the above schemes, the U-MDN performs MaxPooling operations with Pooling sizes of 16*16, 8*8, 4*4 and 2*2 on the obtained 16*16*512 feature map to obtain feature maps of 1*1*512, 2*2*512, 4*4*512 and 8*8*512 respectively, and then performs convolution operations with expansion rates of 1, 2, 4 and 6 to obtain feature maps of 1*1*256, 2*2*256, 4*4*256 and 8*8*256.
在上述任一方案中优选的是,所述特征图经过反卷积还原尺寸到256个高16像素、宽16像素的特征图,反卷积核大小分别为16*16、8*8、4*4和2*2,将4个16*16*256进行叠加融合得到16*16*1024的特征图,再经过一个卷积操作最终得到U-MDM的输出特征图,尺寸为16*16*512,经过U-MDM之后在进行4个Convolution block3得到了32个高256像素、宽256像素的特征图,经过Sigmoid激活函数得到了256*256*2的路面缺陷检测图像,每幅图像均为黑白二值图像,分为缺陷区域和背景区域两类,其中白色区域代表缺陷,黑色区域代表背景。In any of the above schemes, it is preferred that the feature map is restored to 256 feature maps with a height of 16 pixels and a width of 16 pixels after deconvolution, and the deconvolution kernel sizes are 16*16, 8*8, 4*4 and 2*2 respectively. Four 16*16*256 are superimposed and fused to obtain a feature map of 16*16*1024, and then a convolution operation is performed to finally obtain the output feature map of U-MDM with a size of 16*16*512. After U-MDM, four Convolution block3s are performed to obtain 32 feature maps with a height of 256 pixels and a width of 256 pixels. After the Sigmoid activation function, a 256*256*2 road defect detection image is obtained, and each image is a black and white binary image, which is divided into two categories: defect area and background area, where the white area represents the defect and the black area represents the background.
本发明与现有技术相比,具有如下有益效果:Compared with the prior art, the present invention has the following beneficial effects:
通过增加U-MDM,该模块采用4个尺度的上下采样结构,并且结合了膨胀卷积的优点,克服了常用标准卷积提取的特征单一的缺点,可以去除冗余信息减少计算量;通过使用改进的深度神经网路对路面缺陷进行实时检测,克服了传统人工检测路面方法的缺点,提高了检测效率;对路面缺陷检测结果进行路面状况健康评定,为后续公路养护管理提供了充足的数据支持。By adding U-MDM, the module adopts a 4-scale up and down sampling structure and combines the advantages of dilated convolution, overcoming the disadvantage of single feature extraction of commonly used standard convolution, removing redundant information and reducing the amount of calculation; by using an improved deep neural network to perform real-time detection of pavement defects, it overcomes the shortcomings of traditional manual pavement detection methods and improves detection efficiency; the pavement condition health assessment is performed on the pavement defect detection results, providing sufficient data support for subsequent highway maintenance management.
第二方面,一种基于深度学习的路面缺陷检测系统,包括:In a second aspect, a road surface defect detection system based on deep learning includes:
构建模块,用于图像分割模型构建,所述图像分割模型构建包括U型多尺度扩张网络,所述U型多尺度扩张网络将U型多尺度膨胀卷积模块嵌入到U-Net深度神经网络上采样之前形成了U-MDN,U-MDN使用U-Net特征提取网络作为主干,提取从不同卷积阶段得到的多尺度特征,使用U-MDM进一步提取缺陷深层特征,再将浅层特征和深层特征中得到的多尺度特征进行高效融合,以得到预测结果;A construction module for image segmentation model construction, wherein the image segmentation model construction includes a U-shaped multi-scale dilated network, wherein the U-shaped multi-scale dilated convolution module is embedded into the U-Net deep neural network before upsampling to form a U-MDN, wherein the U-MDN uses the U-Net feature extraction network as a backbone to extract multi-scale features obtained from different convolution stages, and uses the U-MDM to further extract deep features of defects, and then efficiently fuses the multi-scale features obtained from shallow features and deep features to obtain prediction results;
评定模块,用于路面缺陷量化评定,所述路面缺陷量化评定根据得到的预测结果对缺陷的量化参数进行量化计算,并且基于量化参数进一步计算路面破损指标,得到相应的路面破损等级。The evaluation module is used for quantitative evaluation of road surface defects. The quantitative evaluation of road surface defects quantitatively calculates the quantitative parameters of the defects according to the obtained prediction results, and further calculates the road surface damage index based on the quantitative parameters to obtain the corresponding road surface damage grade.
本发明与现有技术相比,具有如下有益效果:通过增加U-MDM,该模块采用4个尺度的上下采样结构,并且结合了膨胀卷积的优点,克服了常用标准卷积提取的特征单一的缺点。并且可以去除冗余信息减少计算量;通过使用改进的深度神经网路对路面缺陷进行实时检测,克服了传统人工检测路面方法的缺点,提高了检测效率;对路面缺陷检测结果进行路面状况健康评定,为后续公路养护管理提供了充足的数据支持。Compared with the prior art, the present invention has the following beneficial effects: by adding U-MDM, the module adopts a 4-scale up-down sampling structure and combines the advantages of dilated convolution, overcoming the disadvantage of single feature extraction of commonly used standard convolution. It can also remove redundant information and reduce the amount of calculation; by using an improved deep neural network to detect road defects in real time, it overcomes the shortcomings of traditional manual road detection methods and improves detection efficiency; the road condition health assessment is performed on the road defect detection results, providing sufficient data support for subsequent highway maintenance management.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
附图用于对本发明的进一步理解,与本发明的实施例一起用于解释本发明,并不构成对本发明的限制。The accompanying drawings are used for further understanding of the present invention and are used to explain the present invention together with the embodiments of the present invention, but do not constitute a limitation of the present invention.
图1是本发明一种基于深度学习的路面缺陷检测方法整体结构框图;FIG1 is a block diagram of the overall structure of a road surface defect detection method based on deep learning according to the present invention;
图2是本发明一种基于深度学习的路面缺陷检测方法卷积块示意图;FIG2 is a schematic diagram of a convolution block of a road surface defect detection method based on deep learning according to the present invention;
图3是本发明一种基于深度学习的路面缺陷检测方法U-MDN结构示意图;FIG3 is a schematic diagram of the structure of a road surface defect detection method U-MDN based on deep learning according to the present invention;
图4是本发明一种基于深度学习的路面缺陷检测方法U-MDN网络结构示意图。FIG4 is a schematic diagram of a U-MDN network structure of a road surface defect detection method based on deep learning according to the present invention.
图5是本发明一种基于深度学习的路面缺陷检测方法计算量化参数示意图。FIG5 is a schematic diagram of calculating quantitative parameters of a road surface defect detection method based on deep learning according to the present invention.
图6是本发明一种基于深度学习的路面缺陷检测系统示意图。FIG6 is a schematic diagram of a road surface defect detection system based on deep learning according to the present invention.
具体实施方式DETAILED DESCRIPTION
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。In order to make the purpose, technical solution and advantages of the present invention more clearly understood, the present invention is further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention and are not intended to limit the present invention.
一种基于深度学习的路面缺陷检测方法,具体内容包括:图像分割模型构建和路面缺陷量化评定,本发明将路面缺陷检测视为二值分类任务,通过改进U-Net对路面缺陷进行特征提取。将缺陷图像中每一个像素分为缺陷和背景两类,以便获得精确的缺陷量化信息。具体来说,本发明设计了U型多尺度膨胀卷积模块(U-Multiscale Dilated Module,U-MDM)。将U-MDM嵌入到U-Net上采样之前形成U型多尺度扩张网络(U-Multiscale DilatedNetwork,U-MDN)。该网络使用U-Net特征提取网络作为主干,提取从不同卷积阶段得到的多尺度特征。然后使用U-MDM进一步提取缺陷深层特征,再将浅层特征和深层特征中得到的多尺度特征进行高效融合,最终得到预测结果。A road surface defect detection method based on deep learning, specifically including: image segmentation model construction and road surface defect quantitative assessment. The present invention regards road surface defect detection as a binary classification task, and extracts features of road surface defects by improving U-Net. Each pixel in the defect image is divided into two categories: defect and background, so as to obtain accurate defect quantification information. Specifically, the present invention designs a U-type multi-scale dilated convolution module (U-Multiscale Dilated Module, U-MDM). U-MDM is embedded into U-Net before upsampling to form a U-type multi-scale dilated network (U-Multiscale Dilated Network, U-MDN). The network uses the U-Net feature extraction network as the backbone to extract multi-scale features obtained from different convolution stages. Then, U-MDM is used to further extract deep features of defects, and then the multi-scale features obtained from shallow features and deep features are efficiently fused to finally obtain prediction results.
特征提取网络下采样阶段共有5个卷积块,将不同分辨率的缺陷图像统一尺寸到3通道,高256像素,宽256像素的RGB图像之后输入到该网络中。经过下采样得到512通道,高16像素,宽16像素的图像。再将16*16*512特征层输入到U-MDM中经过4个尺度的最大池化和膨胀卷积之后进行融合再次得到16*16*512特征层,最后经过U-Net上采样的4个卷积块得到256*256*32特征层,在将其通过Sigmoid函数得到只包含缺陷和背景的尺寸为256*256*2的路面缺陷预测图像。There are 5 convolution blocks in the downsampling stage of the feature extraction network. The defect images of different resolutions are unified into 3-channel, 256-pixel-high, 256-pixel-wide RGB images and then input into the network. After downsampling, an image with 512 channels, 16 pixels in height, and 16 pixels in width is obtained. The 16*16*512 feature layer is then input into U-MDM, and after 4-scale maximum pooling and dilated convolution, it is fused to obtain a 16*16*512 feature layer again. Finally, the 4 convolution blocks upsampled by U-Net obtain a 256*256*32 feature layer, which is then passed through the Sigmoid function to obtain a road defect prediction image of size 256*256*2 containing only defects and background.
将分割模型搭建完成之后,使用数据集并设置超参数对模型进行训练。训练完毕之后使用训练好的模型对缺陷图像进行检测。最终实时生成缺陷掩码图像以进行后续缺陷量化评定。After the segmentation model is built, the model is trained using the data set and hyperparameters are set. After the training is completed, the trained model is used to detect defect images. Finally, a defect mask image is generated in real time for subsequent defect quantitative assessment.
路面缺陷图像分割后的需要对提取出来的缺陷目标进行信息量化,以便后续进行高效的公路养护管理。根据得到的预测结果对各种缺陷进行量化计算,包括各种线性裂缝的长度、宽度和方向,网状裂缝的覆盖面积以及裂纹密集程度,坑洞、凸起和凹陷等缺陷的几何面积以及外接矩形长度和宽度等量化参数。并且基于这些量化参数进一步计算路面破损指标,最终得到相应的路面破损等级。After the pavement defect image is segmented, the extracted defect targets need to be quantified for efficient subsequent highway maintenance management. Various defects are quantified based on the prediction results, including the length, width and direction of various linear cracks, the coverage area of reticular cracks and the density of cracks, the geometric area of defects such as potholes, protrusions and depressions, and quantitative parameters such as the length and width of the circumscribed rectangle. The pavement damage index is further calculated based on these quantitative parameters, and the corresponding pavement damage grade is finally obtained.
根据相关公路状况评定标准,在评价路面状况时,一般采用破损类型和破损程度两个指标对其进行评定。路面破损主要分为横纵向线性裂缝、网状和块状等不规则裂缝、车辙、凹陷凸起和坑槽等几种类型,破损程度根据缝宽可以分为轻度、中度和重度三个等级。由此可见,不同裂缝类型的破损严重等级判定也不同。为统一标准,采用路面状况指数PCI对其进行评估。PCI的值域范围为[0,100],反映了路面破损严重程度,值越小,路面破损状况就越严重。According to the relevant highway condition assessment standards, when evaluating the pavement condition, two indicators, namely damage type and damage degree, are generally used to assess it. Pavement damage is mainly divided into several types, such as horizontal and vertical linear cracks, irregular cracks such as mesh and block cracks, rutting, depressions and convexities, and potholes. The degree of damage can be divided into three levels according to the width of the crack: mild, moderate, and severe. It can be seen that the severity of damage is also different for different crack types. In order to unify the standards, the pavement condition index PCI is used to evaluate it. The range of PCI is [0,100], which reflects the severity of pavement damage. The smaller the value, the more serious the pavement damage.
为了更好地理解上述技术方案,下面将结合说明书附图及具体实施方式对本发明技术方案进行详细说明。In order to better understand the above technical solution, the technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings and specific implementation methods.
实施例:Example:
本发明设计的U-MDN在整体网络结构上主要由U-Net和U-MDM两部分组成。先使用U-Net提取从不同卷积块得到的多尺度信息,然后使用U-MDM从四个不同的尺度进一步提取深层特征,最后利用U-Net上采样将浅层特征与深层特征相融合。最终通过Sigmoid函数得到预测结果。The U-MDN designed by the present invention mainly consists of two parts in the overall network structure: U-Net and U-MDM. U-Net is first used to extract multi-scale information obtained from different convolution blocks, and then U-MDM is used to further extract deep features from four different scales. Finally, U-Net upsampling is used to merge shallow features with deep features. Finally, the prediction result is obtained through the Sigmoid function.
分割网络设计,U-Net深度神经网络是包含有下采样图像压缩路径和上采样图像扩展路径的对称U型结构。这种结构沿用了FCN编解码器结构的基本思路。且两个网络都使用卷积操作代替全连接操作,这样能够大大减少网络模型参数量,提高模型训练速度。不同的是U-Net将重点放在上采样阶段,增加了上采样阶段的feature map数量,再将其与下采样得到的底层特征进行融合,增加了定位的准确性和语义信息的完整性,使得网络更具有泛化能力。由于这种结构中的下采样过程可以捕捉丰富的语义信息,上采样过程可以进行精确定位,所以利用少量数据就可以得到很好的效果。因此,本发明使用U-Net作为特征提取网络的主干,提取从下采样过程中不同卷积块中获得的语义信息。Segmentation network design, U-Net deep neural network is a symmetrical U-shaped structure that includes a down-sampled image compression path and an up-sampled image expansion path. This structure follows the basic idea of the FCN codec structure. And both networks use convolution operations instead of full connection operations, which can greatly reduce the number of network model parameters and improve the model training speed. The difference is that U-Net focuses on the upsampling stage, increases the number of feature maps in the upsampling stage, and then fuses it with the underlying features obtained by downsampling, which increases the accuracy of positioning and the integrity of semantic information, making the network more generalized. Since the downsampling process in this structure can capture rich semantic information and the upsampling process can perform precise positioning, good results can be obtained using a small amount of data. Therefore, the present invention uses U-Net as the backbone of the feature extraction network to extract semantic information obtained from different convolution blocks in the downsampling process.
在U-Net中使用的卷积块见图2,图中(a)和(b)是下采样阶段的卷积块,(c)为上采样阶段的卷积块。下采样阶段使用了4个Convolution block1和1个Convolution block2,上采样阶段使用了4个Convolution block3,Convolution block1和3中每个卷积块都使用了两次卷积核为3*3的卷积操作和一次2*2的MaxPooling操作,而Convolution block2中使用了三次卷积核为3*3的卷积操作。并且在每个卷积块的卷积层和激活函数之间都加入了Batch Normalization(BN),使用BN会使得梯度变大,避免了梯度消失得问题。而梯度变大意味学习收敛速度加快,大大加快了模型训练速度。激活函数使用了ReLU和Sigmoid函数,在网络的最后一层使用的是Sigmoid激活函数,而在其它卷积层用的是ReLu激活函数。ReLu激活函数效果比Sigmoid函数好,收敛速度也比较快,适用于解决深度网络梯度消失的问题。而sigmoid函数能够将计算结果限制在[0,1]范围内,方便利用概率去区分目标与背景,更加适用于图像的二分类任务,常用于网络最后一层。ReLu和Sigmoid函数计算公式分别见(1)和(2)。The convolution blocks used in U-Net are shown in Figure 2. (a) and (b) are convolution blocks in the downsampling stage, and (c) is a convolution block in the upsampling stage. Four convolution blocks 1 and one convolution block 2 are used in the downsampling stage, and four convolution blocks 3 are used in the upsampling stage. Each convolution block in Convolution block 1 and 3 uses two convolution operations with a convolution kernel of 3*3 and one MaxPooling operation with a convolution kernel of 2*2, while three convolution operations with a convolution kernel of 3*3 are used in Convolution block 2. Batch Normalization (BN) is added between the convolution layer and the activation function of each convolution block. Using BN will increase the gradient and avoid the problem of gradient disappearance. The increase in gradient means that the learning convergence speed is accelerated, which greatly speeds up the model training speed. The activation function uses ReLU and Sigmoid functions. The Sigmoid activation function is used in the last layer of the network, while the ReLu activation function is used in other convolution layers. The ReLu activation function has better effects than the Sigmoid function and converges faster. It is suitable for solving the problem of vanishing gradients in deep networks. The Sigmoid function can limit the calculation results to the range of [0,1], making it easier to use probability to distinguish between targets and backgrounds. It is more suitable for binary classification tasks of images and is often used in the last layer of the network. The calculation formulas for the ReLu and Sigmoid functions are shown in (1) and (2) respectively.
基于编解码器结构的神经网络虽然可以提取裂缝的多尺度信息,但是这些多尺度信息不足以将裂缝从一些复杂性强的背景中检测出来。因此,受到其他研究者使用的多尺度膨胀模块(MDM)的启发,并考虑到U-Net只用了一种标准卷积滤波器提取裂缝特征,这样只能获得裂缝的一种上下文信息。而对于宽度、长度和方向等具有不同拓扑结构的复杂裂缝图像来说,只使用一种卷积层不利于多种有效特征的提取。而且连续的卷积操作对辨识度高的目标识别效果较好,但对于一些辨识度低的目标很不友好。而膨胀卷积可以在不损失信息的情况下增大感受野进而有效地提取更加丰富的语义信息,增强对分辨力较弱目标区域的识别能力。因此,结合膨胀卷积的特点设计了U型多尺度膨胀卷积模块(U-MDM)。Although neural networks based on codec structures can extract multi-scale information of cracks, this multi-scale information is not enough to detect cracks from some complex backgrounds. Therefore, inspired by the multi-scale dilation module (MDM) used by other researchers, and considering that U-Net only uses one standard convolution filter to extract crack features, only one contextual information of the crack can be obtained. For complex crack images with different topological structures such as width, length and direction, using only one convolution layer is not conducive to the extraction of multiple effective features. Moreover, continuous convolution operations have a good effect on the recognition of targets with high recognition, but are not friendly to some targets with low recognition. Dilated convolution can increase the receptive field without losing information, thereby effectively extracting richer semantic information and enhancing the recognition ability of target areas with weak resolution. Therefore, the U-type multi-scale dilated convolution module (U-MDM) is designed in combination with the characteristics of dilated convolution.
U-MDM网络结构如图3,该模块融合了四种不同尺度的深度特征信息。具体来说,U-MDM并没有直接将U-Net下采样之后获得的特征层进行融合,而是先经过了Pooling size为16、8、4、2四种不同尺度的最大池化操作,这样可以去除冗余信息并减少计算量。之后再经过卷积块,与U-Net中的卷积块不同的是该卷积块中卷积操作的膨胀率为1、2、4和6,卷积核均为3*3,且在卷积之后加入了BN和ReLu。最后经过上采样还原尺寸输入到U-Net上采样结构中进行融合。这样就形成了一个U型结构有助于进一步提取裂缝更深层次的特征。The U-MDM network structure is shown in Figure 3. This module integrates deep feature information of four different scales. Specifically, U-MDM does not directly fuse the feature layer obtained after U-Net downsampling, but first undergoes a maximum pooling operation with four different scales of Pooling size 16, 8, 4, and 2, which can remove redundant information and reduce the amount of calculation. It then passes through a convolution block. Unlike the convolution block in U-Net, the expansion rates of the convolution operation in this convolution block are 1, 2, 4, and 6, and the convolution kernels are all 3*3, and BN and ReLu are added after the convolution. Finally, after upsampling and restoring the size, it is input into the U-Net upsampling structure for fusion. This forms a U-shaped structure that helps to further extract deeper features of the crack.
U-MDN的整体结构示意图如图4所示,该网络由4个Convolution block1、1个Convolution block2、1个U-MDM和4个Convolution block3组成。该网络输入图像为256*256*3(分别为宽256像素,高256像素,3个通道,后续均为宽*高*通道数),输出特征为256*256*2的黑白二值图像。具体来说,输入的256*256*3图像连续经过了4个Convolutionblock1得到了256个高16像素、宽16像素的特征图。再经过1个Convolution block2得到了512个高16像素、宽16像素的特征图。再经过U-MDM之后仍然得到了512个高16像素、宽16像素的特征图。不同的是,该特征图融合了4种不同尺度的深层特征,首先,U-MDM将得到的16*16*512特征图分别进行了Pooling size为16*16、8*8、4*4和2*2的MaxPooling操作之后分别得到了1*1*512、2*2*512、4*4*512和8*8*512的特征图,再经过了膨胀率为1、2、4和6的卷积操作得到了1*1*256、2*2*256、4*4*256和8*8*256的特征图,之后又经过了反卷积还原尺寸到256个高16像素、宽16像素的特征图,反卷积核大小分别为16*16、8*8、4*4和2*2。最后将4个16*16*256进行叠加融合得到16*16*1024的特征图,再经过一个卷积操作最终得到U-MDM的输出特征图,尺寸为16*16*512。经过了U-MDM之后在进行4个Convolution block3得到了32个高256像素、宽256像素的特征图,最终经过Sigmoid激活函数得到了256*256*2的路面缺陷检测图像,每幅图像均为黑白二值图像,分为缺陷区域和背景区域两类,其中白色区域代表缺陷,黑色区域代表背景。The overall structure diagram of U-MDN is shown in Figure 4. The network consists of 4 Convolution blocks 1, 1 Convolution block 2, 1 U-MDM and 4 Convolution blocks 3. The network input image is 256*256*3 (256 pixels wide, 256 pixels high, 3 channels, and width*height*number of channels in the following), and the output feature is a 256*256*2 black and white binary image. Specifically, the input 256*256*3 image passes through 4 Convolution blocks 1 in succession to obtain 256 feature maps with a height of 16 pixels and a width of 16 pixels. After passing through 1 Convolution block 2, 512 feature maps with a height of 16 pixels and a width of 16 pixels are obtained. After passing through U-MDM, 512 feature maps with a height of 16 pixels and a width of 16 pixels are still obtained. The difference is that this feature map integrates four deep features of different scales. First, U-MDM performs MaxPooling operations with Pooling sizes of 16*16*512, 8*8, 4*4 and 2*2 on the 16*16*512 feature map to obtain 1*1*512, 2*2*512, 4*4*512 and 8*8*512 feature maps respectively. Then, it undergoes convolution operations with expansion rates of 1, 2, 4 and 6 to obtain feature maps of 1*1*256, 2*2*256, 4*4*256 and 8*8*256. After that, it undergoes deconvolution to restore the size to 256 feature maps with a height of 16 pixels and a width of 16 pixels. The deconvolution kernel sizes are 16*16, 8*8, 4*4 and 2*2 respectively. Finally, the four 16*16*256 images are superimposed and fused to obtain a 16*16*1024 feature map, and then a convolution operation is performed to finally obtain the output feature map of U-MDM, with a size of 16*16*512. After U-MDM, four Convolution blocks3 are performed to obtain 32 feature maps with a height of 256 pixels and a width of 256 pixels. Finally, a 256*256*2 road defect detection image is obtained through the Sigmoid activation function. Each image is a black and white binary image, which is divided into two categories: defect area and background area. The white area represents the defect and the black area represents the background.
模型训练,训练过程中,设置超参数Batch size为8,每次训练设置150个epoch,初始学习率lr为1e-4,使用Adam优化器来收敛网络,并使用Adam的动量和自适应学习率来加快收敛速度。Adam代替了经典随机梯度下降法(Stochastic Gradient Descent.SGD),可以更有效更新网络权重。During model training, the hyperparameter Batch size is set to 8, 150 epochs are set for each training, the initial learning rate lr is 1e-4, and the Adam optimizer is used to converge the network. Adam's momentum and adaptive learning rate are used to speed up the convergence. Adam replaces the classic stochastic gradient descent (SGD) method, which can update the network weights more effectively.
在深度神经网络中,常用到交叉熵作为处理分类问题的损失函数。因此,训练过程使用了神经网络模型应用广泛的交叉熵损失函数计算损失值(Loss)以描述模型在训练过程中预测结果与真实标签的误差值大小。研究中使用的损失函数是交叉熵中的一种Binarycrossentropy(见式3),这种损失函数常用来处理二分类问题,而且在使用过程中会涉及到计算类别概率,所以通常和式1中的sigmoid函数一起使用。In deep neural networks, cross entropy is often used as a loss function for handling classification problems. Therefore, the training process uses the cross entropy loss function, which is widely used in neural network models, to calculate the loss value (Loss) to describe the error between the model's prediction results and the true label during the training process. The loss function used in the study is a type of cross entropy, Binarycrossentropy (see Formula 3). This loss function is often used to handle binary classification problems, and it involves calculating class probabilities during use, so it is usually used together with the sigmoid function in Formula 1.
其中,N为输出尺寸大小,n∈[1,N],yn∈{0,1}为真实标签,为预测概率。Where N is the output size, n∈[1,N], y n ∈{0,1} is the true label, is the predicted probability.
路面缺陷量化评定,在路面缺陷检测任务中,对不同缺陷进行几何参数计算是为了给后续的路面修补工作提供数据支持。路面缺陷量化分别对单向裂缝的长度和宽度等参数和网状裂缝的块度和面积等参数进行计算。针对裂缝需要计算的特征参数有宽度、长度、外接矩形面积。Quantitative assessment of pavement defects. In the task of pavement defect detection, the geometric parameters of different defects are calculated to provide data support for subsequent pavement repair work. Pavement defect quantification calculates parameters such as the length and width of unidirectional cracks and the size and area of reticular cracks. The characteristic parameters that need to be calculated for cracks are width, length, and circumscribed rectangular area.
首先,根据裂缝图像像素数量N和实际尺寸D计算出换算比例则分割后的裂缝图像的实际裂缝尺寸为D=p·N。其次,将分割后的二值图像进行裂缝轮廓提取,将裂缝区域用内接圆填充,请参看图5,h为裂缝区域外轮廓长度,w为裂缝区域外轮廓宽度。将相邻圆连接圆心,计算圆心之间的距离,wi为相邻两个圆的圆心像素横坐标之差,hi为相邻两个圆的圆心像素纵坐标之差,则li为裂缝的近似实际距离。那么n个圆心间的距离就是该整条裂缝的实际长度L,实际宽度R为内接圆直径平均值。需要计算的量化参数以及实际意义如下:First, the conversion ratio is calculated based on the number of crack image pixels N and the actual size D Then the actual crack size of the segmented crack image is D=p·N. Secondly, extract the crack contour of the segmented binary image and fill the crack area with an inscribed circle. Please refer to Figure 5, h is the length of the outer contour of the crack area, and w is the width of the outer contour of the crack area. Connect the centers of adjacent circles and calculate the distance between the centers. w i is the difference in the horizontal coordinates of the center pixels of two adjacent circles, and h i is the difference in the vertical coordinates of the center pixels of two adjacent circles. Then l i is the approximate actual distance of the crack. Then the distance between the n center points is the actual length L of the entire crack, and the actual width R is the average value of the inscribed circle diameter. The quantitative parameters that need to be calculated and their practical significance are as follows:
(1)宽度(1) Width
其物理意义是裂缝边界某点垂线与裂缝交点的距离,反映了裂缝开裂程度。Its physical meaning is the distance between the vertical line at a point on the crack boundary and the intersection of the crack, which reflects the degree of crack opening.
(2)长度(2) Length
其物理意义是细化后裂缝骨架长度,反映了裂缝延伸程度。Its physical meaning is the length of the crack skeleton after refinement, which reflects the extent of crack extension.
(3)外接矩形面积(3) Area of the circumscribed rectangle
S=p2·h·w (7)S=p 2 ·h·w (7)
其物理意义是各种形状裂缝外接矩形面积,反映了裂缝覆盖程度。Its physical meaning is the circumscribed rectangular area of cracks of various shapes, which reflects the degree of crack coverage.
在路面缺陷中,一般采用破损类型和破损程度两个指标对其进行评定,根据各种量化参数的计算公式得到了量化信息之后,就可以根据分类标准得到各种缺陷的破损程度。根据相关公路状况评定指标,列出了路面常见缺陷及其评定标准。如表2。Among the road surface defects, two indicators, damage type and damage degree, are generally used to evaluate them. After obtaining quantitative information based on the calculation formulas of various quantitative parameters, the damage degree of various defects can be obtained according to the classification standards. According to the relevant highway condition evaluation indicators, common road surface defects and their evaluation standards are listed. See Table 2.
表1路面缺陷类型及破损分级Table 1 Pavement defect types and damage classification
由此可见,不同裂缝类型的破损严重等级判定也不同。为统一标准,采用路面状况指数(PCI)对其进行评估。PCI值反映了路面破损严重程度,值越小,路面破损状况越严重,各项指标及维修意见如表3。It can be seen that the severity of damage of different crack types is also different. In order to unify the standard, the pavement condition index (PCI) is used to evaluate it. The PCI value reflects the severity of pavement damage. The smaller the value, the more serious the pavement damage. The various indicators and maintenance suggestions are shown in Table 3.
表2路面破损状况评价指标及养护意见Table 2 Evaluation indexes of pavement damage and maintenance suggestions
PCI的值域范围为[0,100],采用公式4和5来计算。The value range of PCI is [0,100] and is calculated using formulas 4 and 5.
其中,DR为路面破损率(%);Ai为第i类路面破损的累计面积(m2);A为考察路段的总面积(m2);ωi为第i类路面损坏的破损权重值;a0为沥青路面取值15.00,混凝土路面取值10.66;ai为沥青路面取值0.412,混凝土路面取值0.461;i0为包含破损程度的破坏类型总数。最终根据PCI值得出路面破损状况等级。Among them, DR is the pavement damage rate (%); Ai is the cumulative area of the i-th type of pavement damage (m 2 ); A is the total area of the investigated road section (m 2 ); ωi is the damage weight value of the i-th type of pavement damage; a0 is 15.00 for asphalt pavement and 10.66 for concrete pavement; ai is 0.412 for asphalt pavement and 0.461 for concrete pavement; i0 is the total number of damage types including the damage degree. Finally, the pavement damage condition grade is obtained according to the PCI value.
第二方面,一种基于深度学习的路面缺陷检测系统,包括:In a second aspect, a road surface defect detection system based on deep learning includes:
构建模块,用于图像分割模型构建,所述图像分割模型构建包括U型多尺度扩张网络,所述U型多尺度扩张网络将U型多尺度膨胀卷积模块嵌入到U-Net深度神经网络上采样之前形成了U-MDN,U-MDN使用U-Net特征提取网络作为主干,提取从不同卷积阶段得到的多尺度特征,使用U-MDM进一步提取缺陷深层特征,再将浅层特征和深层特征中得到的多尺度特征进行高效融合,以得到预测结果;A construction module for image segmentation model construction, wherein the image segmentation model construction includes a U-shaped multi-scale dilated network, wherein the U-shaped multi-scale dilated convolution module is embedded into the U-Net deep neural network before upsampling to form a U-MDN, wherein the U-MDN uses the U-Net feature extraction network as a backbone to extract multi-scale features obtained from different convolution stages, and uses the U-MDM to further extract deep features of defects, and then efficiently fuses the multi-scale features obtained from shallow features and deep features to obtain prediction results;
评定模块,用于路面缺陷量化评定,所述路面缺陷量化评定根据得到的预测结果对缺陷的量化参数进行量化计算,并且基于量化参数进一步计算路面破损指标,得到相应的路面破损等级。The evaluation module is used for quantitative evaluation of road surface defects. The quantitative evaluation of road surface defects quantitatively calculates the quantitative parameters of the defects according to the obtained prediction results, and further calculates the road surface damage index based on the quantitative parameters to obtain the corresponding road surface damage grade.
以上仅为本发明的优选实施例而已,并不用于限制本发明,尽管参照前述实施例对本发明进行了详细的说明,对于本领域的技术人员来说,其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above are only preferred embodiments of the present invention and are not intended to limit the present invention. Although the present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art can still modify the technical solutions described in the aforementioned embodiments or replace some of the technical features therein by equivalents. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included in the protection scope of the present invention.
Claims (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111182085.6A CN114066808B (en) | 2021-10-11 | 2021-10-11 | Pavement defect detection method and system based on deep learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111182085.6A CN114066808B (en) | 2021-10-11 | 2021-10-11 | Pavement defect detection method and system based on deep learning |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114066808A CN114066808A (en) | 2022-02-18 |
CN114066808B true CN114066808B (en) | 2024-10-22 |
Family
ID=80234498
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111182085.6A Active CN114066808B (en) | 2021-10-11 | 2021-10-11 | Pavement defect detection method and system based on deep learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114066808B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114581415A (en) * | 2022-03-08 | 2022-06-03 | 成都数之联科技股份有限公司 | Method and device for detecting defects of PCB, computer equipment and storage medium |
CN114820493B (en) * | 2022-04-15 | 2023-08-22 | 西南交通大学 | Method for detecting split of composite material orifice caused by hole making |
CN115012281B (en) * | 2022-05-30 | 2023-09-05 | 海南大学 | Road surface quality detection method and device |
CN115228690B (en) * | 2022-08-08 | 2023-08-08 | 深圳市深赛尔股份有限公司 | Paint surface repairing device and method based on environment-friendly water-based coil steel coating |
CN116758507B (en) * | 2023-07-03 | 2023-12-19 | 中铁建设集团有限公司 | Pavement quality analysis method, device and program based on disease image acquisition and segmentation |
CN118918551A (en) * | 2024-07-16 | 2024-11-08 | 成都红云鼎科技有限公司 | Road defect detection method based on deep learning |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013090830A1 (en) * | 2011-12-16 | 2013-06-20 | University Of Southern California | Autonomous pavement condition assessment |
JP5936239B2 (en) * | 2014-08-19 | 2016-06-22 | インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation | Road surface degradation degree estimation method and algorithm (crack detection method using Gabor filter output image of multi-resolution image) |
-
2021
- 2021-10-11 CN CN202111182085.6A patent/CN114066808B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN114066808A (en) | 2022-02-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114066808B (en) | Pavement defect detection method and system based on deep learning | |
Wang et al. | RENet: Rectangular convolution pyramid and edge enhancement network for salient object detection of pavement cracks | |
CN112215819B (en) | Airport pavement crack detection method based on depth feature fusion | |
CN113240623B (en) | Pavement disease detection method and device | |
CN114998852A (en) | Intelligent detection method for road pavement diseases based on deep learning | |
CN110569730A (en) | An automatic identification method of pavement cracks based on U-net neural network model | |
CN115661032A (en) | Intelligent pavement disease detection method suitable for complex background | |
CN112560895A (en) | Bridge crack detection method based on improved PSPNet network | |
CN111986164A (en) | Road crack detection method based on multi-source Unet + Attention network migration | |
CN114049538B (en) | Airport crack image countermeasure generation method based on UDWGAN ++ network | |
Wang et al. | Advanced crack detection and quantification strategy based on CLAHE enhanced DeepLabv3+ | |
CN116503336A (en) | Pavement crack detection method based on deep learning | |
CN116109616A (en) | Pavement crack detection and small-surface element fitting detection method based on YOLOv5 | |
CN115713488A (en) | Bridge apparent disease pixel level identification method and system based on instance segmentation | |
CN116052110B (en) | Intelligent positioning method and system for pavement marking defects | |
CN115393587A (en) | A detection method for highway asphalt pavement damage based on fusion convolutional neural network | |
CN117670855A (en) | RoadU-Net-based intelligent recognition and classification method for asphalt pavement diseases | |
Pang et al. | Multi-scale feature fusion model for bridge appearance defect detection | |
CN116863134A (en) | Method and system for detecting and dividing length and width of tunnel lining crack | |
CN114882205B (en) | Attention mechanism-based target detection method | |
Yin et al. | Promoting Automatic Detection of Road Damage: A High-Resolution Dataset, a New Approach, and a New Evaluation Criterion | |
Gooda et al. | Automatic detection of road cracks using efficientnet with residual u-net-based segmentation and yolov5-based detection | |
CN115170479A (en) | Automatic extraction method for asphalt pavement repairing diseases | |
CN118196028A (en) | An improved YOLOv8 method for extracting rural cement pavement defects | |
CN113643300A (en) | Pavement crack pixel level detection method based on Seg-CapsNet algorithm |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |