CN102103744A

CN102103744A - Image segmentation method based on minimum spanning trees and statistical learning theory

Info

Publication number: CN102103744A
Application number: CN 201110031892
Authority: CN
Inventors: 崔卫红; 潘斌
Original assignee: Wuhan University WHU
Current assignee: Wuhan University WHU
Priority date: 2011-01-28
Filing date: 2011-01-28
Publication date: 2011-06-22

Abstract

The present invention proposes an image segmentation method based on the minimum spanning tree and statistical learning theory. On the basis of constructing the image graph model, a bottom-up (merging) strategy is adopted, and the minimum spanning tree algorithm is designed based on statistical learning according to the present invention. Theoretical merging criteria to construct procedures for generating multiple minimum spanning trees. While satisfying the global optimum, the segmentation method can effectively utilize the regional statistical characteristics, and is suitable for various high-resolution image segmentation; it has good anti-noise performance, and can also obtain better segmentation results for textured areas. Get good region boundaries.

Description

Image Segmentation Method Based on Minimum Spanning Tree and Statistical Learning Theory

技术领域technical field

本方法属于图像处理与模式识别技术领域，特别是涉及一种新的基于最小生成树最优化理论与统计学习理论的面向对象的图像分割方法。The method belongs to the technical field of image processing and pattern recognition, and in particular relates to a new object-oriented image segmentation method based on minimum spanning tree optimization theory and statistical learning theory.

背景技术Background technique

高空间分辨率遥感影像为我们提供了地面景观的高精度空间几何信息、丰富的纹理信息以及多光谱信息，使得传统的基于像素的遥感影像分类方法已不适用，从而使得高分辨率遥感影像处理面临影像所提供的细节的挑战。为此，Baatz和Schape于1999年在[1]中指出：重要的语义解释更需要用有意义的影像中对象及对象之间的相互关系而不是用一个个像素来表示，因此，提出了面向对象的高分辨率遥感影像目标识别与分类方法，即首先对影像进行分割生成对象区域，并用分层网络来描述对象，再以对象为单位进行目标识别。Blaschke和Strobl在[1]发出“像素出了什么问题？”的疑问，指出传统的基于像素的多维特征空间分类没有利用空间信息，特别是对于相邻像素可能属于同一土地覆盖类型的高分辨率影像，以分割为基础的分类，在很多情况下优于传统每个像素的方法。基于对象的光谱、形状、纹理、空间关系以及人的知识进一步推理的新分类器证明在高空间分辨率领域非常有用，它符合人类识别目标的规律，提高了分类精度和细节。基于对象的研究领域的迅速发展使得地理信息科学领域又产生了一个基于对象的影像分析(Object-Based Image Analysis，OBIA)子学科，它专门研究自动将遥感影像分割成有意义的对象，通过空间、光谱和时间尺度来评估它们的特征，以产生GIS数据格式的新的地理信息[2]。面向对象的图像分割是获取对象的主要方法之一，面向对象的目标识别技术是通过对对象的光谱、几何、纹理、空间邻域关系等信息组合来识别目标，从理论和实际应用中发现，对象分割的好坏直接影响影像分类识别的效果和精度，因为它直接关系到能否准确、有效地提取图像上中目标的几何信息和结构信息，因此，面向对象高分辨率遥感影像分割成为遥感图像处理中的关键和基础，面向对象分割方法的研究也成为高空间分辨率热点和难点之一。High-spatial-resolution remote sensing images provide us with high-precision spatial geometric information, rich texture information, and multi-spectral information of the ground landscape, making traditional pixel-based remote sensing image classification methods inapplicable, thus making high-resolution remote sensing image processing more difficult. Challenged with the detail provided by imagery. For this reason, Baatz and Schape pointed out in [1] in 1999 that important semantic interpretation needs to be represented by meaningful objects and the relationship between objects in images rather than by pixels. The target recognition and classification method of high-resolution remote sensing images of objects, that is, firstly segment the images to generate object regions, and use a layered network to describe the objects, and then perform target recognition with objects as units. Blaschke and Strobl in [1] raised the question of "what's wrong with the pixels?", pointing out that the traditional pixel-based multidimensional feature space classification does not make use of spatial information, especially for high-resolution where adjacent pixels may belong to the same land cover type For imagery, segmentation-based classification outperforms traditional per-pixel approaches in many cases. New classifiers based on the object's spectrum, shape, texture, spatial relationship, and further reasoning on human knowledge proved to be very useful in the high-spatial-resolution domain, which conforms to the laws of human recognition of objects, improving classification accuracy and detail. The rapid development of the object-based research field has led to the emergence of an object-based image analysis (Object-Based Image Analysis, OBIA) sub-discipline in the field of geographic information science, which specializes in the automatic segmentation of remote sensing images into meaningful objects. , spectral and temporal scales to evaluate their characteristics to generate new geographic information in GIS data format [2]. Object-oriented image segmentation is one of the main methods to obtain objects. Object-oriented target recognition technology is to identify targets by combining information such as spectrum, geometry, texture, and spatial neighborhood relations of objects. It is found from theory and practical applications that The quality of object segmentation directly affects the effect and accuracy of image classification and recognition, because it is directly related to whether the geometric information and structural information of objects in the image can be accurately and effectively extracted. Therefore, object-oriented high-resolution remote sensing image segmentation has become a remote sensing The key and foundation of image processing, the research of object-oriented segmentation method has also become one of the hotspots and difficulties of high spatial resolution.

图像分割已经是图像处理、计算机视觉的一个经典课题，从早期的单目标与背景的分割到多目标分割；从像素级分割到对象级分割，从不确定目标分割到确定目标分割，研究人员根据需求提出了大量的分割算法。图像分割的基本策略是基于像素灰度值的两个基本特性：不连续性和相似性。区域内部的像素一般具有某种相似性，而在区域之间的边界上一般具有某种不连续性。所以分割算法可据此分为利用区域间特性不连续的基于边界的算法和利用区域内相似性的基于区域的算法。Image segmentation is already a classic topic in image processing and computer vision, from the early segmentation of single target and background to multi-target segmentation; from pixel-level segmentation to object-level segmentation, from uncertain target segmentation to definite target segmentation, researchers based on A large number of segmentation algorithms are required. The basic strategy of image segmentation is based on two basic properties of pixel gray values: discontinuity and similarity. Pixels inside regions generally have some similarity, while there generally are some discontinuities at the boundaries between regions. Therefore, segmentation algorithms can be divided into boundary-based algorithms that utilize discontinuity between regions and region-based algorithms that utilize intra-region similarity.

随着图像分割技术的发展，人们对图像分割的研究已经从最初的简单的阈值分割、边缘检测和区域提取转向将新的概念、新的方法引入图像分割领域，非常重视多种分割算法的有效结合。通过综合使用两种或两种以上的方法，可以部分克服单独的图像分割算法难以对一般图像取得令人满意的分割效果的问题，而采取什么样的结合方式才能体现各种方法的优点，弥补各自的不足，取得好的预期效果，仍是人们关注的主要问题之一。二十世纪八十年代以后，越来越多的学者开始将模糊理论、马尔科夫模型、遗传算法理论、分形理论和小波理论等研究成果运用于图像分割的研究，并取得了很大进展。但由于图像类型的复杂多样性，虽然人们已经进行了广泛深入的研究，但目前尚无通用的图像分割理论提出，现有算法都各有局限性。因此，探索新的分割理论和分割算法对于图像处理、分析具有重大意义With the development of image segmentation technology, people's research on image segmentation has changed from the initial simple threshold segmentation, edge detection and region extraction to the introduction of new concepts and new methods into the field of image segmentation, and attaches great importance to the effectiveness of multiple segmentation algorithms. combined. Through the comprehensive use of two or more methods, it is possible to partially overcome the problem that a single image segmentation algorithm is difficult to achieve satisfactory segmentation results for general images, and what kind of combination method can reflect the advantages of various methods and make up for it. Their respective deficiencies and achieving good expected results are still one of the main issues that people pay attention to. After the 1980s, more and more scholars began to apply fuzzy theory, Markov model, genetic algorithm theory, fractal theory and wavelet theory to the research of image segmentation, and made great progress. However, due to the complexity and diversity of image types, although people have conducted extensive and in-depth research, there is no general image segmentation theory proposed at present, and the existing algorithms have their own limitations. Therefore, exploring new segmentation theories and segmentation algorithms is of great significance for image processing and analysis

http://www.tu-dresden.de/ioer/statisch/segmentation-evaluation/index.html是一个专门用于遥感影像分割软件评价的网站，网站中简单介绍了各种面向对象的高分辨率遥感影像分割软件方法及性能，展示了各软件对高分辨率城市和郊区影像的分割结果。目前展示的主要有如下软件：BerkeleyImgSeg、Definens Developer、CAEASR、Data Dissection Tool、Edge Detection and Image SegmentationON(EDISON)System、ENVI Feature Extraction、Extended Watershed EWS(Multi-channel watershed transformation)、InfoPack、MinimumEntropy Approach、Image segmentation for Erdas Imagine、Imagine WS for Erdas Imagine、PABAT 0.32、RHSEG、SCRM、SegSAR、HaIcon Seg，每种软件的所采用的分割方法介绍如下：http://www.tu-dresden.de/ioer/statisch/segmentation-evaluation/index.html is a website dedicated to the evaluation of remote sensing image segmentation software. The website briefly introduces various object-oriented high-resolution remote sensing Image segmentation software methods and performance, showing the segmentation results of each software on high-resolution urban and suburban images. At present, the main software displayed are as follows: BerkeleyImgSeg, Definens Developer, CAEASR, Data Dissection Tool, Edge Detection and Image SegmentationON (EDISON) System, ENVI Feature Extraction, Extended Watershed EWS (Multi-channel watershed transformation), InfoPack, MinimumEntropy Approach, Image Segmentation for Erdas Imagine, Imagine WS for Erdas Imagine, PABAT 0.32, RHSEG, SCRM, SegSAR, HaIcon Seg, the segmentation method used by each software is introduced as follows:

Definens Developer和BerkeleyImgSeg是遥感领域目前推出的两款面向对象分割与分类软件，这两个分割软件的思想相同，其分割思想都来源于文献[3]，都采用区域增长与合并策略，其合并准则中考虑了多波段光谱特性的权重以及区域形状特性，实现多尺度、多层次的影像分割，上层对象是其下一层对象的合并，对图像进行分层描述，每层分割结果都统计有其光谱、纹理、形状、对象的拓扑特征、上下层对象之间关系的属性信息，此统计结果可以用于进一步影像分类与分析。Definens Developer and BerkeleyImgSeg are two object-oriented segmentation and classification software currently launched in the field of remote sensing. The ideas of these two segmentation software are the same, and their segmentation ideas are derived from literature [3]. Considering the weight of multi-band spectral characteristics and regional shape characteristics, it realizes multi-scale and multi-level image segmentation. The upper layer object is the combination of its lower layer objects, and the image is described layer by layer. The segmentation results of each layer have their own statistics. Spectrum, texture, shape, topological features of objects, and attribute information of the relationship between upper and lower objects. The statistical results can be used for further image classification and analysis.

CAEASR和InfoPack两个软件是依据文献[4]，假设影像中具有r个服从Gamma分布的区域，通过定义代价函数，采用模拟退火(Simulated Annealing，SA)方法完成图像分割。模拟退火方法是S.Kirkpatrick，C.D.Gelatt和M.P.Vecchi在1983年所发明的一种通用概率算法，用来在一个大的搜寻空间内找寻命题的最优解是一种通用的优化算法，目前已广泛应用于最优控制、机器学习、神经网络等优化问题，算法先以搜寻空间内一个任意点作起始，每一步先选择一个“邻居”，然后再计算从现有位置到达“邻居”的概率。CAEASR and InfoPack are based on the literature [4], assuming that there are r regions in the image that obey the Gamma distribution, by defining a cost function, and using the Simulated Annealing (SA) method to complete image segmentation. The simulated annealing method is a general probability algorithm invented by S.Kirkpatrick, C.D.Gelatt and M.P.Vecchi in 1983. It is a general optimization algorithm used to find the optimal solution of a proposition in a large search space. Widely used in optimal control, machine learning, neural network and other optimization problems, the algorithm starts with an arbitrary point in the search space, selects a "neighbor" at each step, and then calculates the distance from the current position to the "neighbor". probability.

Data Dissection Tool采用[5]中提出的超顺磁性聚类算法(Super ParamagneticClusting，SPC)实现图像分割和分析。超顺磁聚类方法是一种全新的思想，它将物理学的思想引人到信息领域，把非均匀Potts模型的热力学聚集运动看作是数据聚类，即在某个温度范围内，数据处于超顺磁相位，然后利用数据点之间的相关性来聚类。该算法对各种参数的依赖性不强，主要用于聚类、影像分析，统计物理。Data Dissection Tool uses the Super Paramagnetic Clustering algorithm (Super Paramagnetic Clustering, SPC) proposed in [5] to realize image segmentation and analysis. The superparamagnetic clustering method is a brand-new idea, which introduces the idea of physics into the information field, and regards the thermodynamic aggregation motion of the non-uniform Potts model as data clustering, that is, within a certain temperature range, the data In the superparamagnetic phase, the correlation between data points is then used to cluster. The algorithm is not strongly dependent on various parameters, and is mainly used for clustering, image analysis, and statistical physics.

Edge Detection and Image SegmentationON(EDISON)System，是基于文献[6]中提出的Mean shift算法完成对彩色影像的分割，Mean Shift这个概念最早由Fukunaga等人在[7]提出的关于概率密度梯度函数的估计，是一种非参数特征空间分析技术，该算法是一个迭代的过程，即先算出当前点的偏移均值，移动该点到其偏移均值，然后以此为新的起始点，继续移动，直到满足一定的条件结束。Mean Shift算法本质上是一个自适应的梯度上升搜索峰值的方法，Mean Shift主要用在聚类、模态的检测和最优化三个方面，已广泛应用于计算机视觉和图像处理的聚类。Edge Detection and Image SegmentationON (EDISON) System is based on the Mean shift algorithm proposed in the literature [6] to complete the segmentation of color images. The concept of Mean Shift was first proposed by Fukunaga et al. in [7] about the probability density gradient function Estimation is a non-parametric feature space analysis technique. The algorithm is an iterative process, that is, first calculate the offset mean of the current point, move the point to its offset mean, and then use this as a new starting point to continue moving , until certain conditions are met. The Mean Shift algorithm is essentially an adaptive gradient ascending search peak method. Mean Shift is mainly used in three aspects of clustering, mode detection and optimization, and has been widely used in computer vision and image processing clustering.

ENVI Feature Extraction在文献[8]和Mumford-Shah模型基础上引入λ参数控制分割尺度通过区域增长方式实现对影像的快速多尺度分割[9]。ENVI Feature Extraction introduces the λ parameter to control the segmentation scale based on the literature [8] and the Mumford-Shah model, and realizes rapid multi-scale segmentation of images by means of region growth [9].

EWS基于文献[10]提出的多通道扩展分水岭变换的分割方法。Minimum Entropy Approach采用三角测量法(Triangulation)、Image segmentation for Erdas Imagine、Imagine WS forErdas Imagine、PABAT32均采用区域增长方法，PABAT32根据文献[11]在分割中考虑了遥感影像不确定性；RHSEG采用层次区域增长(Hierarchical region growing，[12])；SCRM采用分水岭与区域增长相结合(Watershed and region growing，[13])、SegSAR、HaIcon Seg结合边缘与区域的分割(Hybrid edge/region oriented)。EWS is based on the multi-channel extended watershed transform segmentation method proposed in [10]. Minimum Entropy Approach uses Triangulation, Image segmentation for Erdas Imagine, Imagine WS for Erdas Imagine, and PABAT32 all use region growth methods. PABAT32 considers the uncertainty of remote sensing images in segmentation according to literature [11]; RHSEG uses hierarchical regions Growth (Hierarchical region growing, [12]); SCRM uses a combination of watershed and region growing (Watershed and region growing, [13]), SegSAR, HaIcon Seg combines edge and region segmentation (Hybrid edge/region oriented).

总结这些方法，可以发现面向对象的影像分割算法主要集中在区域增长和分水岭，同时引入模拟退火、Mean Shift、超顺磁性聚类算法、Mumford-Shah、综合区域与边缘特性的多尺度分割。这说明了区域增长、分裂与合并、层次聚类以及分水岭等分割算法结合其他准则是当前实现面向对象的高分辨率遥感影像多尺度分割采用的主要方式，大部分文献的研究重点是将新的理论加入到分割准则设置与优化实现多尺度影像分割。Summarizing these methods, it can be found that object-oriented image segmentation algorithms mainly focus on region growth and watershed, while introducing simulated annealing, Mean Shift, superparamagnetic clustering algorithm, Mumford-Shah, and multi-scale segmentation that integrates region and edge characteristics. This shows that segmentation algorithms such as region growth, splitting and merging, hierarchical clustering, and watershed combined with other criteria are the main methods currently used to achieve object-oriented multi-scale segmentation of high-resolution remote sensing images. Most of the literature focuses on the new Theory is added to the setting and optimization of segmentation criteria to achieve multi-scale image segmentation.

在基于区域增长的分割算法中，种子点的选择和合并准则的设置是影响分割效果的关键步骤，种子点应该选在同质区域的中心为最好，而选择和获取同质中心作为种子点是很困难的，如何充分利用图像的多光谱、形状、纹理等特性也是该方法需要在分割准则中考虑的。分水岭分割算法也是一种广泛应用的面向对象的图像分割算法，但其不足之处是容易出现过分割现象。因此，很多学者研究并提出解决这一问题的方法，如对梯度图像进行预处理以减少噪声，对过分割区域进行合并以及对输入图像的梯度取阈值等方法。In the segmentation algorithm based on region growth, the selection of seed points and the setting of merging criteria are the key steps that affect the segmentation effect. The seed point should be selected in the center of the homogeneous area as the best, and the selection and acquisition of the homogeneous center as the seed point It is very difficult, and how to make full use of the multi-spectral, shape, texture and other characteristics of the image is also the method that needs to be considered in the segmentation criteria. The watershed segmentation algorithm is also a widely used object-oriented image segmentation algorithm, but its disadvantage is that it is prone to over-segmentation. Therefore, many scholars have studied and proposed methods to solve this problem, such as preprocessing the gradient image to reduce noise, merging over-segmented regions, and thresholding the gradient of the input image.

背景技术引用文献：Background technology cited documents:

1.Blaschke，T.andJ.Strobl，What′s wrong with pixels？Some recent developments interfacing remote sensing andGIS.GIS-Zeitschrift für Geoinformationssysteme，2001.14(6)：p.12-17.1. Blaschke, T. and J. Strobl, What's wrong with pixels? Some recent developments interfacing remote sensing and GIS. GIS-Zeitschrift für Geoinformationssysteme, 2001.14(6): p.12-17.

2.Blaschke，T.，S.Lang，and G.J.Hay，eds.Object-Based Image Analysis：Spatial Concepts forKnowledge-Driven Remote Sensing Applications.Lecture Notes in Geoinformation and Cartography，ed.W.Cartwright，G.Gartner，L.Meng，et al.2008，Springer-Verlag Berlin Heide1berg.2. Blaschke, T., S. Lang, and G.J. Hay, eds. Object-Based Image Analysis: Spatial Concepts for Knowledge-Driven Remote Sensing Applications. Lecture Notes in Geoinformation and Cartography, ed. W. Cartwright, G. Gartner, L .Meng, et al.2008, Springer-Verlag Berlin Heidelberg.

3.Baatz，M.andA.

Multiresolution Segmentation：an optimization approach for high quality multi-scaleimage segmentation in Strobl，J.，Blaschke，T.，Griesebner，G.(eds)，Angewandte GeographischeInformationsverarbeitung XII.2000.Wichmann，Heidelberg.3. Baatz, M. and A.

Multiresolution Segmentation: an optimization approach for high quality multi-scale image segmentation in Strobl, J., Blaschke, T., Griesebner, G. (eds), Angewandte Geographische Informationsverarbeitung XII.2000.Wichmann, Heidelberg.

4.Cook，R.，I.McConnell，D.Stewart，and C.J.Oliver.Segmentation and simulated annealing.in MicrowaveSensing and Synthetic Aperture Radar 1996.Taormina，Italy SPIE.4. Cook, R., I. McConnell, D. Stewart, and C. J. Oliver. Segmentation and simulated annealing. in Microwave Sensing and Synthetic Aperture Radar 1996. Taormina, Italy SPIE.

5.Ferbe r，C.v.and F.

Cluster update algorithm and recognition.Physical Review E，2000.62(2)：p.Rl461-R1464.5. Ferber, Cvand F.

Cluster update algorithm and recognition. Physical Review E, 2000.62(2): p.Rl461-R1464.

6.Comaniciu，D.and P.Meer，Mean shift：a robust approach toward feature space analysis.Pattern Analysis andMachine Intelligence，IEEE Transactions on，2002.24(5)：p.603-619.6. Comaniciu, D. and P. Meer, Mean shift: a robust approach toward feature space analysis. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2002.24(5): p.603-619.

7.Fukunaga，K.and L.D.Hostetle r，The estimation of the gradient of a density function，with applications inpattern recognition.Information Theory，IEEE Transactions on，1975.21(1)：p.32-40.7. Fukunaga, K. and L.D. Hostetler, The estimation of the gradient of a density function, with applications in pattern recognition. Information Theory, IEEE Transactions on, 1975.21(1): p.32-40.

8.Koepfler，G.，C.Lopez，and J.M.Morel，A multiscale algorithm for image segmentation by variational method.SIAM Journal on Numerical Analysis 1994.31(1)：p.282-299.8. Koepfler, G., C.Lopez, and J.M.Morel, A multiscale algorithm for image segmentation by variational method. SIAM Journal on Numerical Analysis 1994.31(1): p.282-299.

9.Robinson，D.J.，N.J.Redding，and D.J.Crisp.Implementation of a fast algorithm for segmenting SAR imagery.[Technical Report(Defence Science and Technology Organisation(Australia))]2002-01.9.Robinson, D.J., N.J.Redding, and D.J.Crisp.Implementation of a fast algorithm for segmenting SAR imagery.[Technical Report(Defence Science and Technology Organization(Australia))]2002-01.

10.Li，P.and X.Xiao，Multispectral image segmentation by a multichannel watershed-based approach.International Journal of Remote Sensing，2007.28(19)：p.4429-4452.10. Li, P. and X. Xiao, Multispectral image segmentation by a multichannel watershed-based approach. International Journal of Remote Sensing, 2007.28(19): p.4429-4452.

11.Lucieer，A.，Uncertainties in Segmentation and their Visualisation.October 2004，Utrecht University andInternational Institute for Geo-Information Science and Earth Observation(ITC).11. Lucieer, A., Uncertainties in Segmentation and their Visualization. October 2004, Utrecht University and International Institute for Geo-Information Science and Earth Observation (ITC).

12.Tilton，J.C.Analysis of hierarchically related image segmentations.in Proc IEEE Workshop on Advances inTechniques for Analysis of Remotely Sensed Data.2003.12. Tilton, J.C.Analysis of hierarchically related image segmentations.in Proc IEEE Workshop on Advances in Techniques for Analysis of Remotely Sensed Data.2003.

13.Castilla，G.，G.J.Hay，and J.R.Ruiz-Gallardo，Size-constrained region merging(SCRM)：an automateddelineation tool for assisted photo interpretation.Photogrammetric Engineering and Remote Sensing，2008.74(4)：p.409-419.13.Castilla, G., G.J.Hay, and J.R.Ruiz-Gallardo, Size-constrained region merging (SCRM): an automated delineation tool for assisted photo interpretation. Photogrammetric Engineering and Remote Sensing, 2008.74(4): p.409-419.

发明内容Contents of the invention

针对上述困难与问题，本发明采用一种基于最小生成树理论的图像分割并提出基于统计学习理论设计的分割准则，实现了面向对象的图像分割，避免了区域增长中的种子点选择问题，并且有效解决了过分割问题。In view of the above-mentioned difficulties and problems, the present invention adopts a kind of image segmentation based on minimum spanning tree theory and proposes a segmentation criterion based on statistical learning theory design, realizes object-oriented image segmentation, avoids the problem of seed point selection in region growth, and It effectively solves the over-segmentation problem.

本发明所提供基于最小生成树和统计学习理论的图像分割方法，包括以下步骤：The image segmentation method based on minimum spanning tree and statistical learning theory provided by the present invention comprises the following steps:

步骤一、将图像用简单图模型来表示，简单图模型的每个顶点对应图像的一个像素，每两个邻接顶点之间用边连接，每条边的边权值为该边连接的两个顶点所对应两像素之间的差异；Step 1. The image is represented by a simple graph model. Each vertex of the simple graph model corresponds to a pixel of the image. Every two adjacent vertices are connected by an edge. The edge weight of each edge is the two pixels connected by the edge. The difference between the two pixels corresponding to the vertex;

步骤二、基于统计学习理论设定区域合并准则，并设置分割参数；Step 2. Set the region merging criterion based on the statistical learning theory, and set the segmentation parameters;

步骤三、对步骤一所得简单图模型进行基于最小生成树的影像分割，分割时从简单图模型中的最小权值边开始按区域合并准则进行合并直到最大权值边，合并生成的每个最小生成树代表一个连通的区域。Step 3: Carry out image segmentation based on the minimum spanning tree for the simple graph model obtained in step 1. When segmenting, start from the edge with the smallest weight in the simple graph model and merge it according to the region merging criterion until the edge with the largest weight. A spanning tree represents a connected region.

而且，步骤一中边权值计算方式为，两个顶点所对应两像素之间的差异采用波段加权平方距离和来计算。Moreover, the calculation method of the edge weight in step 1 is that the difference between the two pixels corresponding to the two vertices is calculated by using the band weighted squared distance sum.

而且，步骤二中所述基于统计学习理论设定区域合并准则具体实现方式为，将图像分割过程作为一个学习过程，学习过程是将图像分割看做是一个回归估计问题；根据统计学习理论中的基于经验风险最小化的β一致稳定学习算法设定区域合并准则，将经验风险最小化问题转化为一个概率问题，统计学习中的损失函数与步骤一中边权值计算方式相对应。Moreover, the specific implementation method of setting the region merging criterion based on the statistical learning theory in step 2 is to use the image segmentation process as a learning process, and the learning process is to regard the image segmentation as a regression estimation problem; according to the statistical learning theory The β-uniformly stable learning algorithm based on empirical risk minimization sets the regional merging criterion, and transforms the empirical risk minimization problem into a probability problem. The loss function in statistical learning corresponds to the edge weight calculation method in step 1.

而且，步骤二的分割参数包括分割尺度参数，通过设置分割尺度参数的数值影响步骤三影像分割所得区域的大小。分割尺度参数根据区域所允许的最大边权值设置。Moreover, the segmentation parameter in step 2 includes a segmentation scale parameter, and the size of the region obtained by image segmentation in step 3 is affected by setting the value of the segmentation scale parameter. The segmentation scale parameter is set according to the maximum edge weight allowed by the area.

而且，步骤二的分割参数包括最小区域面积参数，通过设置最小区域面积参数的数值防止步骤三影像分割时生成过小区域。Moreover, the segmentation parameters in step 2 include the minimum area parameter, and by setting the value of the minimum area parameter, it is prevented from generating an too small area during image segmentation in step 3.

而且，步骤三所述基于最小生成树的图像分割，采用Kruskal最小生成树算法实现。Moreover, the image segmentation based on the minimum spanning tree described in step 3 is realized by using the Kruskal minimum spanning tree algorithm.

本发明将图像用图模型来表示，图像分割中寻找同质区问题转化为最小生成树问题，同时将图像分割过程看作是一个学习过程；采用Kruskal最小生成树算法，并结合其算法特点，将边权函数与损失函数相对应，将统计学习理论融于最小生成树图像分割。该分割方法在满足全局最优的同时，能有效利用区域统计特性，适合于各类高分辨率影像分割；具有较好的抗噪声性能，对纹理区域也能得到较好的分割效果，同时能获得良好的区域边界。In the present invention, the image is represented by a graph model, and the problem of finding homogeneous regions in image segmentation is transformed into a minimum spanning tree problem, and the image segmentation process is regarded as a learning process; the Kruskal minimum spanning tree algorithm is adopted, combined with its algorithm characteristics, The edge weight function corresponds to the loss function, and the statistical learning theory is integrated into the minimum spanning tree image segmentation. While satisfying the global optimum, the segmentation method can effectively utilize the regional statistical characteristics, and is suitable for various high-resolution image segmentation; it has good anti-noise performance, and can also obtain better segmentation results for textured areas. Get good region boundaries.

附图说明Description of drawings

图1为对简单图模型表示基于自顶向下最小生成树分割结果的示意图，其中图1a为一个5*5影像的灰度值，图1b为与图1a对应的图模型表示，图1c为图1b的最小生成树，图1d为对图1c的分割结果。Figure 1 is a schematic diagram of a simple graph model based on top-down minimum spanning tree segmentation results, where Figure 1a is the gray value of a 5*5 image, Figure 1b is the representation of the graphical model corresponding to Figure 1a, and Figure 1c is The minimum spanning tree in Figure 1b, and Figure 1d is the segmentation result of Figure 1c.

图2为阈值影响分割结果的示意图，其中图2a为阈值取较大值时得到的分割结果，图2b为阈值取较小值时得到的分割结果。Fig. 2 is a schematic diagram of the threshold affecting the segmentation result, wherein Fig. 2a is the segmentation result obtained when the threshold value is larger, and Fig. 2b is the segmentation result obtained when the threshold value is smaller.

图3为本发明实施例的阈值变化规律示意图。FIG. 3 is a schematic diagram of threshold variation rules according to an embodiment of the present invention.

具体实施方式Detailed ways

结合附图和本发明的实施例，对本发明技术方案进行详细说明。实施例的实现过程如下：The technical solution of the present invention will be described in detail in conjunction with the accompanying drawings and the embodiments of the present invention. The implementation process of the embodiment is as follows:

步骤一、将图像用简单图模型来表示，简单图模型的每个顶点对应图像的一个像素，每两个邻接顶点之间用边连接，每条边的边权值为该边连接的两个顶点所对应两像素之间的差异。Step 1. The image is represented by a simple graph model. Each vertex of the simple graph model corresponds to a pixel of the image. Every two adjacent vertices are connected by an edge. The edge weight of each edge is the two pixels connected by the edge. The difference between the two pixels corresponding to the vertex.

简单图模型是现有金字塔模型中的一种。简单图模型中边权值为每条边的权重，实施例采用波段加权平方距离和来计算权重。具体计算方式为现有技术，本发明不予赘述。The simple graph model is one of the existing pyramid models. The edge weight in the simple graph model is the weight of each edge, and the embodiment adopts the band weighted square distance sum to calculate the weight. The specific calculation method is the prior art, and will not be described in detail in the present invention.

步骤二、基于统计学习理论设定区域合并准则，并设置分割参数。Step 2: Set the region merging criterion based on the statistical learning theory, and set the segmentation parameters.

实施例基于统计学习理论设定区域合并准则具体实现方式为，将图像分割过程作为一个学习过程，学习过程是将图像分割看做是一个回归估计问题；根据统计学习理论中的基于经验风险最小化的β一致稳定学习算法设定区域合并准则，将经验风险最小化问题转化为一个概率问题，统计学习中的损失函数与步骤一中边权值计算方式相对应。The embodiment sets the region merging criterion based on the statistical learning theory. The specific implementation method is to use the image segmentation process as a learning process, and the learning process is to regard the image segmentation as a regression estimation problem; according to the empirical risk minimization in the statistical learning theory The β-consistently stable learning algorithm sets the regional merging criterion, and transforms the empirical risk minimization problem into a probability problem. The loss function in statistical learning corresponds to the edge weight calculation method in step 1.

根据Bousquet和E1isseeff提出的改变一个元素的b一致稳定性算法及其根据McDiarmid不等式推导出的泛化误差界，根据统计学习理论思想，经过推导，设计了如下准则：According to Bousquet and Elisseeff's b-consistent stability algorithm for changing an element and the generalization error bound derived from McDiarmid's inequality, according to the idea of statistical learning theory, after derivation, the following criteria are designed:

其中，S₁和S₂代表当前两个区域，n₁和n₂表示区域S₁和S₂中的像素数(顶点数)，n表示当前边，w_n是按边权值从小到大合并过程中的当前边，它连接S₁和S₂表示的两个区域，&&表示同时满足。如果P(S₁，S₂)为TRUE则合并区域S₁和S₂，若P(S₁，S₂)为FALSE则不合并。Among them, S ₁ and S ₂ represent the current two regions, n ₁ and n ₂ represent the number of pixels (number of vertices) in regions S ₁ and S ₂ , n represents the current edge, and w _n is merged according to the edge weight from small to large The current edge in the process, which connects the two regions denoted by S ₁ and S ₂ , && means simultaneous satisfaction. If P(S ₁ , S ₂ ) is TRUE then merge regions S ₁ and S ₂ , if P(S ₁ , S ₂ ) is FALSE then do not merge.

该准则由三个参数(n_i，δ，M)来控制阈值，而且正好是控制分割尺度的参数。如附图3所示，取δ＝0.1(图中标记为delta＝0.1)，横坐标为n_i(区域大小)，纵坐标为th(这个阈值通过

计算)。沿纵轴方向，是M由小到大取不同值30、40、50、80、100、120时的变化曲线；沿横轴方向，是面积增大时的阈值变化趋势。n_i是区域i的像素数，其值随合并过程动态变化；概率δ是在0和1之间的一个小值，它的变化对阈值影响不大，具有微调作用，根据概率理论和前面的分析，我们通常设其值为0.1；M是损失上界，它对应于边权的上确界，我们可以根据最大边权值(区域内最大差异)来选择它的值。The criterion is controlled by three parameters (n _i , δ, M) to control the threshold, and it is precisely the parameter that controls the segmentation scale. As shown in Figure 3, take δ=0.1 (marked as delta=0.1 in the figure), the abscissa is _ni (region size), and the ordinate is th (this threshold passes

calculate). Along the vertical axis, it is the change curve when M takes different values of 30, 40, 50, 80, 100, and 120 from small to large; along the horizontal axis, it is the threshold value change trend when the area increases. n _i is the number of pixels in area i, and its value changes dynamically with the merging process; the probability δ is a small value between 0 and 1, and its change has little effect on the threshold, which has a fine-tuning effect. According to the probability theory and the previous Analysis, we usually set its value to 0.1; M is the upper bound of the loss, which corresponds to the supremum of the edge weight, and we can choose its value according to the maximum edge weight (the largest difference in the region).

本实施例中，将δ固定取为0.1，n_i随合并过程自动变化，因此，仅取M作为控制分割尺度的参数，通过设置不同的分割尺度参数，可以得到影像在不同尺度下的分割结果。大M值得到的阈值较大，允许的泛化误差大，即允许区域内部变化大，从而得到大尺度的分割结果。In this embodiment, δ is fixed at 0.1, and _ni changes automatically with the merging process. Therefore, only M is used as a parameter to control the segmentation scale. By setting different segmentation scale parameters, the segmentation results of images at different scales can be obtained . The threshold value obtained by the large M value is large, and the generalization error allowed is large, that is, the internal variation of the region is allowed to be large, so as to obtain a large-scale segmentation result.

具体实施时，也可以采用其他基于统计学习理论的区域合并准则，例如基于统计学习理论的由区域所有像素特性的设计的准则等。During specific implementation, other region merging criteria based on statistical learning theory may also be used, such as a design criterion based on statistical learning theory based on the characteristics of all pixels in a region.

分割参数设置中，为了避免分割时生成过小区域，本发明提出分割参数中包含一个用于控制最小区域面积大小的参数。当分割时出现某个区域面积小于最小区域面积大小参数时，可以进行的处理为：重新按边权值从小到大顺序开始，若当前边所连接的两个区域不属于同一个区域，并且该边所连接的两个区域中的某个区域面积小于指定最小区域面积，则将这两个区域合并。通过重新从简单图模型中的最小权值边开始按区域合并准则进行增长直到最大权值边，这样保证了合并区域之间相似性最大。In the setting of segmentation parameters, in order to avoid generating too small areas during segmentation, the present invention proposes that the segmentation parameters include a parameter for controlling the size of the minimum area. When the area of a certain area is smaller than the minimum area size parameter during segmentation, the processing that can be performed is: start again in the order of edge weights from small to large, if the two areas connected by the current edge do not belong to the same area, and the If one of the two regions connected by an edge is smaller than the specified minimum region area, the two regions are merged. By re-starting from the minimum weight edge in the simple graph model and growing according to the region merging criterion until the maximum weight edge, this ensures the maximum similarity between the merged regions.

最小生成树问题描述为寻找边权和最小的生成树，对于图像分割来说，寻找差异最小的连通区域问题就可以转化为最小生成树问题。因此，在步骤一实现了将图像表示的像素之间关系用简单图模型表示后，可以基于最小生成树对步骤一所得简单图模型进行影像分割，参见图1，其中图1a为一个5*5影像的灰度值，图1b为与图1a对应的图模型表示，图1c为图1b的最小生成树，图1d为对图1c的分割结果。在步骤二设置的分割尺度参数影响分割结果，参见图2，图2a为阈值取较大值时得到的分割结果，图2b为阈值取较小值时得到的分割结果。基于最小生成树的图像分割主要有自顶向下分裂最小生成树与自底向上合并生成多个最小生成树两种策略。实施例采用Kruska1最小生成树算法以及自底向上的合并策略实现基于最小生成树的影像分割，具体算法和策略为现有技术，本发明不予赘述。The minimum spanning tree problem is described as finding the minimum spanning tree with edge weights. For image segmentation, the problem of finding the connected region with the smallest difference can be transformed into the minimum spanning tree problem. Therefore, after step 1 realizes that the relationship between the pixels represented by the image is represented by a simple graph model, the image segmentation of the simple graph model obtained in step 1 can be performed based on the minimum spanning tree, see Figure 1, where Figure 1a is a 5*5 The gray value of the image, Figure 1b is the graphical model representation corresponding to Figure 1a, Figure 1c is the minimum spanning tree of Figure 1b, and Figure 1d is the segmentation result of Figure 1c. The segmentation scale parameter set in step 2 affects the segmentation result, see Figure 2, Figure 2a shows the segmentation result obtained when the threshold value is larger, and Figure 2b shows the segmentation result obtained when the threshold value is smaller. The image segmentation based on the minimum spanning tree mainly has two strategies of splitting the minimum spanning tree from the top and merging from the bottom to generate multiple minimum spanning trees. The embodiment adopts the Kruska1 minimum spanning tree algorithm and bottom-up merging strategy to realize the image segmentation based on the minimum spanning tree. The specific algorithm and strategy are prior art, and will not be described in detail in the present invention.

Claims

1. based on the image segmentation method of minimum spanning tree and statistical learning theory, it is characterized in that: comprise the following steps,

Step 1. The image is represented by a simple graph model. Each vertex of the simple graph model corresponds to a pixel of the image. Every two adjacent vertices are connected by an edge. The edge weight of each edge is the two pixels connected by the edge. The difference between the two pixels corresponding to the vertex;

Step 2. Set the region merging criterion based on the statistical learning theory, and set the segmentation parameters;

Step 3: Carry out image segmentation based on the minimum spanning tree for the simple graph model obtained in step 1. When segmenting, start from the edge with the smallest weight in the simple graph model and merge it according to the region merging criterion until the edge with the largest weight. A spanning tree represents a connected region.

2. The image segmentation method according to claim 1, characterized in that: in the step 1, the edge weight is calculated in such a way that the difference between the two pixels corresponding to the two vertices is calculated by using the band weighted squared distance sum.

3. The image segmentation method according to claim 2, characterized in that: in the step 2, based on the statistical learning theory, the specific implementation of the regional merging criterion is set as, the image segmentation process is used as a learning process, and the learning process is to use the image Segmentation is regarded as a regression estimation problem; according to the uniform and stable learning algorithm based on empirical risk minimization in statistical learning theory, the regional merging criterion is set, and the empirical risk minimization problem is transformed into a probability problem. The loss function in statistical learning is the same as The calculation method of the edge weight in step 1 is corresponding.

4. The image segmentation method according to claim 1, characterized in that: the segmentation parameter in step 2 includes a segmentation scale parameter, and the size of the region obtained by image segmentation in step 3 is affected by setting the value of the segmentation scale parameter.

5. The image segmentation method according to claim 4, characterized in that: the segmentation scale parameter is set according to the maximum edge weight allowed by the area.

6. The image segmentation method according to claim 1, characterized in that: the segmentation parameter in step 2 includes a minimum area parameter, by setting the value of the minimum area parameter to prevent generation of too small area during image segmentation in step 3.

7. The image segmentation method according to claim 1, characterized in that: the image segmentation based on the minimum spanning tree described in step 3 is realized by using the Kruskal minimum spanning tree algorithm.