CN110335285B

CN110335285B - SAR image target marking method, system and device based on sparse representation

Info

Publication number: CN110335285B
Application number: CN201910611220.0A
Authority: CN
Inventors: 张文生; 杨阳; 黄妍; 杨雪冰
Original assignee: Institute of Automation of Chinese Academy of Science
Current assignee: Institute of Automation of Chinese Academy of Science
Priority date: 2019-07-08
Filing date: 2019-07-08
Publication date: 2022-04-26
Anticipated expiration: 2039-07-08
Also published as: CN110335285A

Abstract

The invention belongs to the field of security remote sensing detection of marine ships, airport airplanes and local automobiles, and particularly relates to a sparse representation-based SAR image target marking method, system and device, aiming at solving the problems of low target detection and marking efficiency and accuracy of the existing SAR image. The method comprises the steps of obtaining an SAR image, performing morphological processing after threshold segmentation of the image, and extracting an interested region; adopting coarse filtration, and taking the region of interest after the coarse filtration as a first region image; extracting multi-scale feature vectors of the first region image, and respectively obtaining sparse representations corresponding to the feature vectors; based on sparse representation of different scales, obtaining corresponding classification results through SVM classification models respectively, and obtaining the category of the first region image by adopting a preset decision method; the first region image is labeled in the SAR image based on the category and location information of the first region image. The invention can efficiently and accurately detect and mark the target.

Description

SAR image target labeling method, system and device based on sparse representation

技术领域technical field

本发明属于海洋舰船、机场飞机和阵地汽车安防遥感检测领域，具体涉及一种基于稀疏表示的SAR图像目标标记方法、系统、装置。The invention belongs to the field of remote sensing detection of marine ships, airport aircraft and position automobiles, and in particular relates to a sparse representation-based SAR image target marking method, system and device.

背景技术Background technique

目前安防行业内较多使用的是合成孔径雷达(SAR)，SAR成像系统随着搭载平台以一定的速度进行运动，然后以预定的速度、时间间隔向目标区域发射电磁波脉冲，并记录回波所携带的强度和相位信息，经过多次观测记录合成目标图像。由于遥感图像远距离和机载移动成像的特点，遥感图像场景中的目标通常较小，而且遥感图像本身具有严重的相干斑噪声和丰富的纹理信息，目标检测难度大，检测效率和准确率较差。At present, synthetic aperture radar (SAR) is widely used in the security industry. The SAR imaging system moves at a certain speed with the carrying platform, and then transmits electromagnetic wave pulses to the target area at a predetermined speed and time interval, and records the echoes. Carrying the intensity and phase information, the synthetic target image is recorded after multiple observations. Due to the characteristics of remote sensing images and airborne mobile imaging, the targets in remote sensing image scenes are usually small, and the remote sensing images themselves have serious speckle noise and rich texture information, which makes target detection difficult, and the detection efficiency and accuracy are relatively high. Difference.

目前为止，SAR图像目标检测及标记的方法主要有基于图像模板匹配的方法和基于特征模板匹配的方法。其中,基于图像模板匹配的方法需要对图像进行方位角估计，建立与测试样本相匹配的模板，该方法简单易行，但是需要占用很大的内存，计算复杂度很高。基于特征模板匹配的方法，首先采用PCA、ICA等算法提取特征，然后用支持向量机进行分类鉴别，但是这种方法仍存在特征维数过大，时间消耗长的问题，严重影响目标的识别率。随着我国安防需求快速发展，目标检测及标记精度要求不断提高，目标检测及标记工作日益增长。传统的SAR图像目标检测及标记方法无法满足当前快速发展的安防行业，包括海洋舰船、机场飞机和阵地汽车目标的检测及标记需求。So far, the methods of SAR image target detection and marking mainly include the method based on image template matching and the method based on feature template matching. Among them, the method based on image template matching needs to estimate the azimuth angle of the image and establish a template that matches the test sample. This method is simple and easy to implement, but it needs a lot of memory and has high computational complexity. The method based on feature template matching first uses algorithms such as PCA and ICA to extract features, and then uses support vector machine for classification and identification. However, this method still has the problem of too large feature dimension and long time consumption, which seriously affects the recognition rate of the target. . With the rapid development of my country's security needs, the accuracy requirements of target detection and marking are continuously improved, and the work of target detection and marking is increasing day by day. Traditional SAR image target detection and marking methods cannot meet the current rapidly developing security industry, including the detection and marking requirements of marine ships, airport aircraft and position vehicles.

如何提高SAR图像处理系统的自动目标检测及标记能力，准确发现并识别各类重要战略目标，实现SAR图像由数据向情报信息的快速转化成为新的课题。How to improve the automatic target detection and marking ability of the SAR image processing system, accurately discover and identify various important strategic targets, and realize the rapid transformation of SAR images from data to intelligence information has become a new topic.

发明内容SUMMARY OF THE INVENTION

为了解决现有技术中的上述问题，即为了解决现有SAR图像目标检测及标记效率、准确率低的问题，本发明第一方面，提出了一种基于稀疏表示的SAR图像目标标记的方法，该方法包括：In order to solve the above problems in the prior art, that is, in order to solve the problems of low efficiency and accuracy of the existing SAR image target detection and labeling, the first aspect of the present invention proposes a method for SAR image target labeling based on sparse representation, The method includes:

步骤S10，获取待标记的SAR图像，对所述SAR图像阈值分割后进行形态学处理，并提取感兴趣区域；Step S10, acquiring the SAR image to be marked, performing morphological processing after threshold segmentation of the SAR image, and extracting a region of interest;

步骤S20，基于提取的感兴趣区域，采用粗过滤，将所述粗过滤后的感兴趣区域作为第一区域图像，并获取其在所述SAR图像中的位置信息；Step S20, using coarse filtering based on the extracted region of interest, using the coarsely filtered region of interest as the first region image, and acquiring its position information in the SAR image;

步骤S30，提取所述第一区域图像的多尺度的特征向量，分别获取不同尺度的特征向量对应的稀疏表示；Step S30, extracting the multi-scale feature vectors of the first area image, respectively obtaining sparse representations corresponding to the feature vectors of different scales;

步骤S40，基于不同尺度的稀疏表示，通过SVM分类模型分别得到其对应的分类结果，并采用预设的决策方法，获取所述第一区域图像的类别；Step S40, based on the sparse representation of different scales, obtain the corresponding classification results through the SVM classification model, and adopt a preset decision method to obtain the category of the first area image;

步骤S50，根据第一区域图像的类别和位置信息，在所述待标记的SAR图像中对所述第一区域图像进行标记；Step S50, marking the first region image in the to-be-marked SAR image according to the category and position information of the first region image;

其中，in,

所述SVM分类模型在训练过程中，以SAR训练集样本图像的奇异值分解得到的特征值作为属性。During the training process of the SVM classification model, the eigenvalues obtained by the singular value decomposition of the sample images of the SAR training set are used as attributes.

在一些优选的实施方式中，骤S30中“多尺度提取所述第一区域图像的特征向量”，其方法为：基于所述第一区域图像构造预设尺度的差分金字塔图像集，分别提取该图像集中不同尺度的图像的特征向量。In some preferred embodiments, in step S30, "multi-scale extraction of the feature vector of the first area image", the method is: constructing a preset scale difference pyramid image set based on the first area image, extracting the Feature vectors of images of different scales in the image set.

在一些优选的实施方式中，步骤S30中“分别获取不同尺度的特征向量对应的稀疏表示”，其方法为：通过稀疏表示模型获取；所述稀疏表示模型其构建方法为：In some preferred embodiments, in step S30 "respectively obtain sparse representations corresponding to feature vectors of different scales", the method is: obtaining through a sparse representation model; the construction method of the sparse representation model is:

随机抽取训练集部分原子构建字典；Randomly extract some atoms of the training set to construct a dictionary;

所述字典根据预设的逼近误差值得到过完备字典，并构建过完备字典矩阵；The dictionary obtains an over-complete dictionary according to a preset approximation error value, and constructs an over-complete dictionary matrix;

基于MP算法对所述的过完备字典矩阵分解获取最优稀疏解；The optimal sparse solution is obtained by decomposing the overcomplete dictionary matrix based on the MP algorithm;

根据所述最优稀疏解构建误差矩阵，采用奇异值分解方法对所述误差矩阵进行分解，根据该分解结果更新字典。An error matrix is constructed according to the optimal sparse solution, the error matrix is decomposed by a singular value decomposition method, and a dictionary is updated according to the decomposition result.

在一些优选的实施方式中，“采用奇异值分解方法对所述误差矩阵进行分解”，其方法为：对所述误差矩阵进行非零系数位置标定形成新的误差矩阵，采用奇异值分解方法对所述新的误差矩阵进行分解；所述奇异值分解方法基于正则项进行约束优化。In some preferred embodiments, "decomposing the error matrix by using the singular value decomposition method", the method is: performing the position calibration of the non-zero coefficients on the error matrix to form a new error matrix, and using the singular value decomposition method to decompose the error matrix. The new error matrix is decomposed; the singular value decomposition method is constrained to optimize based on the regular term.

在一些优选的实施方式中，步骤S20“基于提取的感兴趣区域，采用粗过滤，将所述粗过滤后的感兴趣区域作为第一区域图像”，所述粗过滤包括几何特征过滤、空间位置过滤、灰度特征过滤。In some preferred embodiments, step S20 "uses coarse filtering based on the extracted region of interest, and uses the coarsely filtered region of interest as the first region image", where the coarse filtering includes geometric feature filtering, spatial location Filtering, grayscale feature filtering.

在一些优选的实施方式中，所述差分金字塔图像集中相邻尺度的图像之间的尺度比例为2，同尺度各相邻层的图像之间的平滑系数不同。In some preferred embodiments, the scale ratio between the images of adjacent scales in the differential pyramid image set is 2, and the smoothing coefficients between the images of the adjacent layers of the same scale are different.

在一些优选的实施方式中，所述使用训练集部分原子构建字典，部分原子的数目为训练集原子的50％～80％。In some preferred embodiments, the dictionary is constructed by using partial atoms of the training set, and the number of partial atoms is 50%-80% of the atoms of the training set.

本发明的第二方面，提出了一种基于稀疏表示的SAR图像目标标记的系统，该系统包括提取模块、过滤模块、稀疏表示模块、分类识别模块、标记模块；In a second aspect of the present invention, a system for SAR image target marking based on sparse representation is proposed, the system includes an extraction module, a filtering module, a sparse representation module, a classification and identification module, and a marking module;

所述的提取模块，配置为获取待标记的SAR图像，对所述SAR图像阈值分割后进行形态学处理，并提取感兴趣区域；The extraction module is configured to obtain the SAR image to be marked, perform morphological processing after threshold segmentation of the SAR image, and extract the region of interest;

所述的过滤模块，配置为基于提取的感兴趣区域，采用粗过滤，将所述粗过滤后的感兴趣区域作为第一区域图像，并获取其在所述SAR图像中的位置信息；The filtering module is configured to use coarse filtering based on the extracted region of interest, take the coarsely filtered region of interest as the first region image, and obtain its position information in the SAR image;

所述的稀疏表示模块，配置为提取所述第一区域图像的多尺度的特征向量，分别获取不同尺度的特征向量对应的稀疏表示；The sparse representation module is configured to extract multi-scale feature vectors of the first region image, and obtain sparse representations corresponding to the feature vectors of different scales respectively;

所述的分类识别模块，配置为基于不同尺度的稀疏表示，通过SVM分类模型分别得到其对应的分类结果，并采用预设的决策方法，获取所述第一区域图像的类别；The classification and recognition module is configured to be based on sparse representation of different scales, obtain its corresponding classification results through the SVM classification model, and use a preset decision method to obtain the category of the first area image;

所述的标记模块，配置为根据第一区域图像的类别和位置信息，在所述待标记的SAR图像中对所述第一区域图像进行标记；The marking module is configured to mark the first region image in the to-be-marked SAR image according to the category and position information of the first region image;

其中，in,

本发明的第三方面，提出了一种存储装置，其中存储有多条程序，所述程序应用由处理器加载并执行以实现上述的基于稀疏表示的SAR图像目标标记方法。In a third aspect of the present invention, a storage device is provided, in which a plurality of programs are stored, and the program applications are loaded and executed by a processor to implement the above-mentioned sparse representation-based SAR image target marking method.

本发明的第四方面，提出了一种处理设置，包括处理器、存储装置；处理器，适用于执行各条程序；存储装置，适用于存储多条程序；所述程序适用于由处理器加载并执行以实现上述的基于稀疏表示的SAR图像目标标记方法。In a fourth aspect of the present invention, a processing setup is proposed, including a processor and a storage device; the processor is adapted to execute various programs; the storage device is adapted to store multiple programs; the programs are adapted to be loaded by the processor And execute to realize the above-mentioned sparse representation-based SAR image target labeling method.

本发明的有益效果：Beneficial effects of the present invention:

本发明可以高效、准确的进行目标检测及标记。本发明基于稀疏表示对复杂环境下不同目标进行了精确的检测；并且基于非平衡采样的思路，可以使占少数的目标图像和占多数的背景图像数目平衡；进一步的基于目标检测后返回鉴别结果进行标记并给出定位器在世界坐标系下的精确值。有效解决了现有的海洋舰船、机场飞机和阵地汽车检测方法存在检测及标记效率、准确率较差的技术问题，进而实现了高效准确的检测及标记技术效果。The present invention can efficiently and accurately perform target detection and marking. The invention accurately detects different targets in complex environments based on sparse representation; and based on the idea of unbalanced sampling, the number of target images that account for a minority and the number of background images that account for the majority can be balanced; further identification results are returned after target detection. Marks and gives the exact value of the locator in world coordinates. It effectively solves the technical problems of poor detection and marking efficiency and accuracy in the existing detection methods for marine ships, airport aircraft and position vehicles, thereby achieving efficient and accurate detection and marking technical effects.

附图说明Description of drawings

通过阅读参照以下附图所做的对非限制性实施例所做的详细描述，本申请的其他特征、目的和优点将会变得更明显。Other features, objects and advantages of the present application will become more apparent upon reading the detailed description of non-limiting embodiments taken with reference to the following drawings.

图1是本发明一种实施例的基于稀疏表示的SAR图像目标标记方法的流程示意图；1 is a schematic flowchart of a sparse representation-based SAR image target labeling method according to an embodiment of the present invention;

图2是本发明一种实施例的基于稀疏表示的SAR图像目标标记方法的框架示意图。FIG. 2 is a schematic diagram of a framework of a sparse representation-based SAR image target labeling method according to an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚，下面将结合附图对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。In order to make the objectives, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings. Obviously, the described embodiments are part of the embodiments of the present invention, not All examples. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

下面结合附图和实施例对本申请作进一步的详细说明。可以理解的是，此处所描述的具体实施例仅用于解释相关发明，而非对该发明的限定。另外还需要说明的是，为了便于描述，附图中仅示出了与有关发明相关的部分。The present application will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the related invention, but not to limit the invention. In addition, it should be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

需要说明的是，在不冲突的情况下，本申请中的实施例及实施例中的特征可以相互组合。It should be noted that the embodiments in the present application and the features of the embodiments may be combined with each other in the case of no conflict.

本发明的基于稀疏表示的SAR图像目标标记方法，包括以下步骤：The sparse representation-based SAR image target marking method of the present invention includes the following steps:

其中，in,

为了更清晰地对本发明基于稀疏表示的SAR图像目标标记方法进行说明，下面结合附图1对本发明方法一种实施例中各步骤进行展开详述。In order to more clearly describe the sparse representation-based SAR image target marking method of the present invention, each step in an embodiment of the method of the present invention will be described in detail below with reference to FIG. 1 .

下文优选实施例中，先对稀疏表示模型和SVM分类模型进行详述，然后再对采用稀疏表示模型和SVM分类模型获取待鉴别SAR图像中目标的基于稀疏表示的SAR图像目标标记方法进行详述。In the following preferred embodiments, the sparse representation model and the SVM classification model are first described in detail, and then the sparse representation-based SAR image target labeling method for obtaining the target in the SAR image to be identified by using the sparse representation model and the SVM classification model is described in detail. .

1、稀疏表示模型1. Sparse representation model

1.1训练样本的构建1.1 Construction of training samples

(1)获取SAR图像并多尺度特征提取(1) Acquire SAR images and extract multi-scale features

根据预设的目标检测种类，以不同的分辨率和成像角度分类采集各种类遥感目标图像数据，并统计多尺度待检测图像的尺度，获取多尺度待检测图像数据中最大尺度。According to the preset target detection types, various types of remote sensing target image data are classified and collected with different resolutions and imaging angles, and the scales of the multi-scale images to be detected are counted to obtain the largest scale in the multi-scale image data to be detected.

以最大尺度作为图像数据集构造差分金字塔的第一组图像的尺度，然后使用采样的方法，继续构造第二组、第三组、第四组的尺度图像，如此获得多尺度的图像特征。Use the largest scale as the scale of the image dataset to construct the first group of images of the differential pyramid, and then use the sampling method to continue to construct scale images of the second, third, and fourth groups, so as to obtain multi-scale image features.

图像金字塔是一种含多分辨率图像的结构，对原始图像进行采样获得不同尺度的图像，对同一尺度的图像进行滤波操作获得不同平滑程度的图像。最大的分辨率图像放在底部，以金字塔形式排列，越往上图像分辨率越低，构成了图像金字塔。所述步骤如下：Image pyramid is a structure with multi-resolution images. The original image is sampled to obtain images of different scales, and the image of the same scale is filtered to obtain images with different degrees of smoothness. The largest resolution images are placed at the bottom and arranged in a pyramid form. The higher the resolution, the lower the image resolution, which constitutes an image pyramid. The steps are as follows:

步骤11，把原始图像采样得到金字塔第一组的第一层图像，如果原始图像大于金字塔第一组的设定尺度，则使用下采样，否则使用上采样。然后将第一组的第一层图像通过高斯滤波获得金字塔第一组的第二层图像，高斯滤波函数如公式(1)所示，平滑系数σ＝1.6：Step 11: Sampling the original image to obtain the first layer image of the first group of pyramids, if the original image is larger than the set scale of the first group of pyramids, use downsampling, otherwise use upsampling. Then, the first layer images of the first group are obtained by Gaussian filtering to obtain the second layer images of the first group of pyramids. The Gaussian filter function is shown in formula (1), and the smoothing coefficient σ=1.6:

其中，G(x,y)为高斯滤波函数，x、y为高斯分布的二维坐标，x₀、y₀为高斯分布的二维坐标对应的均值。Among them, G(x, y) is a Gaussian filter function, x and y are the two-dimensional coordinates of the Gaussian distribution, and x ₀ , y ₀ are the mean values corresponding to the two-dimensional coordinates of the Gaussian distribution.

步骤12，重复步骤11，获得第一组的第三层、第四层、第五层图像，平滑系数分别是1.26×1.6、1.26²×1.6、1.26³×1.6。金字塔第一组图像一共五层，它们的尺度一致，平滑系数不同。Step 12: Repeat step 11 to obtain the images of the third layer, the fourth layer, and the fifth layer of the first group, and the smoothing coefficients are 1.26×1.6, 1.26 ² ×1.6, and 1.26 ³ ×1.6, respectively. The first group of images in the pyramid has a total of five layers, and they have the same scale and different smoothing coefficients.

步骤13，对第一组的第三层图像进行下采样获得金字塔第二组第一层图像，比例因子是2，相邻组也可以理解为相邻尺度。重复步骤11和步骤12获得金字塔第二组的五层图像，第二组图像的尺度是第一组的一半。如此循环，构成图像金字塔，一共四组，每组五层。Step 13, down-sampling the images of the third layer of the first group to obtain the images of the first layer of the second group of the pyramid, the scale factor is 2, and adjacent groups can also be understood as adjacent scales. Repeat steps 11 and 12 to obtain five-layer images of the second group of pyramids, and the scale of the second group of images is half that of the first group. This cycle forms an image pyramid, with a total of four groups, each with five layers.

步骤14，在图像金字塔的基础上构造差分金字塔，差分金字塔的第一组第一层图像由图像金字塔的第一组第二层减第一层得到，即差分金字塔的第O组第I层图像由金字塔的第O组第I+1层减第O组第I层得到，其中O、I为自然数，代表差分金字塔的组数和层数。如此循环，获得差分金字塔，一共四组，每组四层。Step 14, construct a difference pyramid on the basis of the image pyramid, the first group of the first layer images of the difference pyramid is obtained by subtracting the first layer from the first group of the second layer of the image pyramid, that is, the 0th group of the first layer image of the difference pyramid. It is obtained by subtracting the Oth group I+1 level of the pyramid from the Oth group I level, where O and I are natural numbers, representing the number of groups and levels of the differential pyramid. In this way, a differential pyramid is obtained, a total of four groups, each with four layers.

(2)SAR图像的非平衡采样(2) Unbalanced sampling of SAR images

为了克服目标图像的数量少于背景图像的数量的问题，采用非平衡采样方法SMOTE构建训练样本。SMOTE的核心思想是K近邻和线性插值，具体做法是在相邻较近的两个少数类样本之间通过线性插值加入新的少数类样本，使少数类样本数目增加，从而达到少数类与多数类样本数量平衡的目的。所述步骤如下：In order to overcome the problem that the number of target images is less than the number of background images, the unbalanced sampling method SMOTE is used to construct training samples. The core idea of SMOTE is K-nearest neighbors and linear interpolation. The specific method is to add a new minority class sample through linear interpolation between two adjacent minority class samples, so that the number of minority class samples increases, so as to achieve minority class and majority class. The purpose of class sample size balance. The steps are as follows:

步骤21，对少数类的每一个样本x_i，计算x_i到所有少数类样本的距离，从中选取N个最近邻的样本，并记录好下标。K值一般5。假设样本x_i和x_j有M维属性，根据公式(2)计算距离：Step 21: For each sample _xi of the minority class, calculate the distance from _xi to all the samples of the minority class, select N nearest neighbor samples from it, and record the subscript. The K value is generally 5. Assuming that samples x _i and x _j have M-dimensional attributes, the distance is calculated according to formula (2):

其中，x_j为除样本x_i以外的少数类样本，x_i(m)和x_j(m)分别是样本x_i和x_j的第m个属性。Among them, x _j is a minority class sample other than sample x _i , and x _i (m) and x _j (m) are the mth attributes of samples x _i and x _j , respectively.

步骤22，根据原始训练集中多数类与少数类的比例(不平衡比率)U设定采样倍率N，对少数类的每一个样本x_i，从x_i的K个最近邻样本中随机选择t个，记t个其中一个为x_j(j＝1,2,…,t)，在x_i和x_j之间进行随机线性插值，获得新样本x_new。计算如公式(3)所示：Step 22: Set the sampling ratio N according to the ratio of the majority class to the minority class (imbalance ratio) U in the original training set, and for each sample _xi of the minority class, randomly select t samples from the K nearest neighbor samples of _xi , denote t one of them as x _j (j=1,2,...,t), and perform random linear interpolation between x _i and x _j to obtain a new sample x _new . The calculation is shown in formula (3):

x_new＝x_i+rand(0,1)×(x_j-x_i) (3)x _new = x _i +rand(0,1)×(x _j -x _i ) (3)

其中，rand(0,1)是0到1之间的一个随机数。where rand(0,1) is a random number between 0 and 1.

步骤23，把获得的新样本加入到原始训练集中得到一个新的训练集，这个新训练集多数类与少数类样本数目平衡，即背景图像和目标图像数目平衡。Step 23, adding the obtained new samples to the original training set to obtain a new training set, in which the number of samples of the majority class and the minority class is balanced, that is, the number of background images and target images is balanced.

1.2构建训练稀疏表示模型1.2 Building a training sparse representation model

稀疏表示的基本思想是信号由大量的基本信号组成，这些基本信号表征信号来自不同角度不同类型的特征。通过这些基本信号的线性叠加，我们可以有效地表示任意信号，将这些基本信号称为原子，将原子的集合称为字典。构建训练步骤如下：The basic idea of sparse representation is that the signal consists of a large number of basic signals, which represent different types of features from different angles. Through the linear superposition of these fundamental signals, we can effectively represent arbitrary signals, which are called atoms, and the collection of atoms is called a dictionary. The steps to build the training are as follows:

步骤S121，使用训练集部分的原子构建字典D，原子的数目应该小于训练集样本数目，以训练集样本的50％～80％为佳。In step S121 , a dictionary D is constructed using the atoms in the training set. The number of atoms should be smaller than the number of samples in the training set, preferably 50% to 80% of the samples in the training set.

假设我们需要处理的输入信号x∈R^m，字典D＝{d₁,d₂,...,d_i,...,d_n}∈R^m×n，其中字典中的每一列d_i,i∈[1,n]表示字典中的一个原子。稀疏表示理论中的基本假设是信号x可以由合适的字典D中的原子的线性组合来表示，如公式(4)所示：Suppose we need to process the input signal x∈R ^m , the dictionary D={d ₁ ,d ₂ ,...,d _i ,...,d _n }∈R ^m×n , where each column d _i in the dictionary ,i∈[1,n] represents an atom in the dictionary. The basic assumption in sparse representation theory is that the signal x can be represented by a linear combination of atoms in a suitable dictionary D, as shown in equation (4):

其中，α_i是信号x在原子d_i上的展开系数，α＝[α₁；α₂；...；α_n]∈Rⁿ。where α _i is the expansion coefficient of the signal x on the atom d _i , α=[α ₁ ;α ₂ ;...;α _n ]∈R ⁿ .

这里，如果字典中原子个数小于信号的维数，即n＜m，则字典中的原子不能完全表示m维的输入信号，这种字典称为非完备字典；如果n＝m且^D中的原子线性无关，则字典可以线性地表示任意m维的输入信号，这种字典称为完备字典。如果字典中的原子彼此正交，则字典中的原子称为完备正交基。例如，传统的傅里叶变换，小波变换，余弦变换等都使用完备正交基；如果n＞m，是多解问题，则输入信号x在字典上的表示形式不唯一，即α的解不唯一，这种字典称为过完备冗余字典。Here, if the number of atoms in the dictionary is less than the dimension of the signal, that is, n<m, the atoms in the dictionary cannot fully represent the m-dimensional input signal, and this dictionary is called a non-complete dictionary; if n=m and ^D in D If the atoms are linearly independent, the dictionary can linearly represent any m-dimensional input signal. This kind of dictionary is called a complete dictionary. Atoms in a dictionary are called complete orthonormal basis if they are orthogonal to each other. For example, traditional Fourier transform, wavelet transform, cosine transform, etc. all use complete orthogonal basis; if n>m, it is a multi-solution problem, then the representation of the input signal x in the dictionary is not unique, that is, the solution of α is not unique. Uniquely, such a dictionary is called an overcomplete redundant dictionary.

完备正交字典通常使用单一类型的正交基作为字典原子，但是现实世界中的信号丰富多样，信号表征需要使用不同类型的基信号，因此在实践中字典D通常是包含多种类型原子的过完备冗余字典，需要求解多解性问题。稀疏表示理论的目的就是在这些解中找到对信号x最简洁但又有效的表示，这样的解称为稀疏解。A complete orthogonal dictionary usually uses a single type of orthogonal bases as dictionary atoms, but the signals in the real world are rich and diverse, and the signal representation needs to use different types of base signals. Therefore, in practice, the dictionary D usually contains multiple types of atoms. A complete redundant dictionary is required to solve multi-solution problems. The purpose of sparse representation theory is to find the most concise but efficient representation of the signal x in these solutions, and such solutions are called sparse solutions.

步骤S122，基于MP算法迭代计算对字典矩阵分解，更新残差求解最优稀疏解。In step S122, the dictionary matrix is decomposed based on the MP algorithm iterative calculation, and the residuals are updated to obtain the optimal sparse solution.

稀疏解稀疏性的最直观度量是，解向量中非零元素的个数，若x在字典D上只包含了k个非零系数α_i,i∈[1,n]，则我们称信号x在字典D的表示下是k稀疏的(K-Sparsity)，可以直接使用向量α的l_p范数度量稀疏性，如公式(5)所示：The most intuitive measure of the sparsity of the sparse solution is the number of non-zero elements in the solution vector. If x only contains k non-zero coefficients α _i , i ∈ [1, n] in the dictionary D, then we call the signal x Under the representation of the dictionary D, it is k-sparse (K- _Sparsity ), and the lp norm of the vector α can be directly used to measure the sparsity, as shown in formula (5):

其中，||α||_p表示向量α的l_p范数。where ||α|| _p represents the _lp norm of the vector α.

l₀范数可以看成l_p范数的特殊情况，用公式(6)计算：The l ₀ norm can be regarded as a special case of the l _p norm, which is calculated by formula (6):

其中，||α||₀表示向量α的l₀范数，即向量α中的非零元素的个数，非零系数表征图像的基本特征和结构，即图像能量的集中区域。Among them, ||α|| ₀ represents the l ₀ norm of the vector α, that is, the number of non-zero elements in the vector α, and the non-zero coefficients represent the basic features and structure of the image, that is, the concentrated area of image energy.

根据稀疏性度量，稀疏表示模型由公式(7)得：According to the sparsity measure, the sparse representation model is obtained by formula (7):

其中，样本x的向量

由尽量少的字典原子表示，p＝2是信号处理领域中的均方误差度量准则，是一个平滑的凸优化问题；p＝1是总变分度量准则；当0＜p＜1时，它的数学分析是非凸的，称为拟范数；p＝0属于组合优化问题，并且是NP-hard问题。可以看出，当0≤p＜2时，都可以度量稀疏性，在这个范围内，p值越大，获得的解稀疏性越差，否则反之。因此，p＝0不仅是最直观的稀疏性度量方法，也是最合适稀疏性度量准则，缺点是求解比较困难，在多项式时间内难以获得解，其具体的求解方法是MP算法。where, the vector of samples x

Represented by as few dictionary atoms as possible, p=2 is the mean square error metric in the field of signal processing, which is a smooth convex optimization problem; p=1 is the total variation metric; when 0<p<1, it is The mathematical analysis of is non-convex, called quasi-norm; p=0 belongs to combinatorial optimization problem and is NP-hard problem. It can be seen that when 0≤p<2, the sparsity can be measured. In this range, the larger the value of p, the worse the sparsity of the obtained solution, otherwise the opposite is true. Therefore, p=0 is not only the most intuitive measure of sparsity, but also the most suitable measure of sparsity. The disadvantage is that it is difficult to solve, and it is difficult to obtain a solution in polynomial time. The specific solution method is MP algorithm.

当使用l₀范数进行稀疏性度量时，稀疏求解的核心问题是用尽可能少的D中的原子来表示输入信号x，即转化为约束方程的求解问题，如公式(8)所示：When using the l ₀ norm to measure the sparsity, the core problem of the sparse solution is to use as few atoms in D as possible to represent the input signal x, that is, to convert the problem of solving the constraint equation, as shown in formula (8):

以上描述是针对理想的无噪图像，并且在实际应用中，图像将不可避免地有噪声干扰，因此上式中的模型不一定与实际得到的结果完全相同，考虑到实际中的误差和噪声，引入一个误差容许值δ，并将上式表示的模型转化为以下表达式，即在误差允许范围内，使用尽量少的原子来表示信号，如公式(9)所示：The above description is for an ideal noise-free image, and in practical applications, the image will inevitably have noise interference, so the model in the above formula is not necessarily exactly the same as the actual result, considering the actual error and noise, An error tolerance value δ is introduced, and the model expressed by the above formula is transformed into the following expression, that is, the signal is represented by as few atoms as possible within the allowable error range, as shown in formula (9):

其中，

表示真实样本x与其稀疏表示的平方差。in,

represents the squared difference of the true sample x and its sparse representation.

或者转化为在特定稀疏程度限制下，求最佳重构精度的解，如公式(10)所示：Or it can be transformed into a solution to find the best reconstruction accuracy under a certain sparsity limit, as shown in formula (10):

其中，T表示α的l₀范数的误差阈值。where T represents the error threshold of the l ₀ norm of α.

为了计算具体的数值，上述描述的约束优化问题可以通过拉格朗日乘子转化成无约束优化问题，如公式(11)所示：In order to calculate specific values, the constrained optimization problem described above can be transformed into an unconstrained optimization problem through Lagrange multipliers, as shown in Equation (11):

其中，λ表示惩罚因子。where λ represents the penalty factor.

MP算法详解：由于实际信号种类的复杂性和多样性，单一类型的原子不能更好地表征多样信号地特征。因此，有必要构建包含多种基本原子的冗余字典以得到信号的稀疏表示。此时，需要求一个病态欠定方程组的解，已知该问题具有多解性，稀疏求解就是从该问题的解空间中找到最稀疏的解，此时问题由求解病态欠定方程组转化成求解l0范数的最小化问题，该问题是一个NP-hard问题，求解也十分困难。在现有的数学理论中，NP问题没有多项式时间的分解算法，因此只能使用次优的算法求解稀疏问题。目前，稀疏求解算法可分为两大类：(1)贪婪算法；(2)凸松弛优化算法。接下来，对贪婪算法MP算法进行简单介绍：Detailed explanation of MP algorithm: Due to the complexity and diversity of actual signal types, a single type of atom cannot better characterize the characteristics of diverse signals. Therefore, it is necessary to construct redundant dictionaries containing various basic atoms to obtain a sparse representation of the signal. At this time, it is necessary to find a solution of an ill-conditioned underdetermined equation system. It is known that the problem has multiple solutions. The sparse solution is to find the sparsest solution from the solution space of the problem. At this time, the problem is transformed by solving the ill-conditioned underdetermined equation system. To solve the minimization problem of the l0 norm, the problem is an NP-hard problem, and the solution is also very difficult. In the existing mathematical theory, there is no polynomial-time decomposition algorithm for NP problems, so only suboptimal algorithms can be used to solve sparse problems. At present, sparse solving algorithms can be divided into two categories: (1) greedy algorithms; (2) convex relaxation optimization algorithms. Next, a brief introduction to the greedy algorithm MP algorithm:

贪婪算法是一种局部优化算法，它是分阶段进行的，在每一个阶段、每一个步骤中，认为当前所作的决定和选择都是最好的，并不考虑该阶段的选择对未来的影响，当算法终止时，我们希望局部最优就是全局最优，但实际上算法经常会终止在次最优解上。MP算法是一个典型的贪婪算法，以下简单介绍MP算法。The greedy algorithm is a local optimization algorithm. It is carried out in stages. In each stage and each step, the current decision and choice are considered to be the best, and the impact of the choice at this stage on the future is not considered. , when the algorithm terminates, we hope that the local optimum is the global optimum, but in fact the algorithm often terminates on the sub-optimal solution. The MP algorithm is a typical greedy algorithm. The MP algorithm is briefly introduced below.

假设在空间中要分解的信号是x∈R^m，过完备冗余字典表示为D＝{d₁,d₂,...,d_i,...,d_n}∈R^m×n，则我们的目标是从字典D中选择一系列原子，并通过它们的线性组合来表示信号x。假设信号可以分解成如公式(12)：Assuming that the signal to be decomposed in space is x∈R ^m , the overcomplete redundancy dictionary is expressed as D={d ₁ ,d ₂ ,...,d _i ,...,d _n }∈R ^m×n , Then our goal is to select a sequence of atoms from the dictionary D and represent the signal x by their linear combination. It is assumed that the signal can be decomposed into equation (12):

x＝<x,d_j0>d_j0+Rx (12)x=<x,d _j0 >d _j0 +Rx (12)

其中，Rx是信号被原子d_j0近似后的残差向量，显然d_j0和Rx是相互正交的，因此信号可表示为公式(13)：Among them, Rx is the residual vector after the signal is approximated by atomic d _j0 , obviously d _j0 and Rx are mutually orthogonal, so the signal can be expressed as formula (13):

||x||²＝|<x,d_j0>|²+||Rx||² (13)||x|| ² =|<x,d _j0 >| ² +||Rx|| ² (13)

为了最小化残差向量Rx的能量，必须选择合适的原子d_j0最大化信号x在该原子上的能量，即d_j0需要满足等式(14)：In order to minimize the energy of the residual vector Rx, a suitable atom d _j0 must be chosen to maximize the energy of the signal x on this atom, i.e. d _j0 needs to satisfy equation (14):

MP算法是一个迭代算法，每一次迭代计算都是在字典D中寻找与残差向量Rx最佳匹配的原子，然后如前所述更新残差，再继续重复这一过程。接下来，进一步解释MP算法：The MP algorithm is an iterative algorithm. Each iterative calculation is to find the atom that best matches the residual vector Rx in the dictionary D, and then update the residual as described above, and then continue to repeat the process. Next, the MP algorithm is further explained:

首先初始化数据，将残差向量初始化为原始信号，即R⁰x＝x，假设迭代第n(n＞0)次后残差向量表示为Rⁿx，则Rⁿx将会分解为公式(15)：First initialize the data, initialize the residual vector to the original signal, that is, R ⁰ x=x, assuming that the residual vector after the nth (n>0) iteration is expressed as R ⁿ x, then R ⁿ x will be decomposed into the formula ( 15):

Rⁿx＝|<Rⁿx，d_jn>|d_jn+Rⁿ⁺¹x (15)R ⁿ x=|<R ⁿ x, d _jn >|d _jn +R ⁿ⁺¹ x (15)

由于Rⁿ⁺¹x与原子d_jn相互正交，因此，残差向量可表示为如公式(16)所示：Since R ⁿ⁺¹ x and atom d _jn are orthogonal to each other, the residual vector can be expressed as formula (16):

||Rⁿx||²＝|<Rⁿx,d_jn>|²+||Rⁿ⁺¹x||² (16)||R ⁿ x|| ² =|<R ⁿ x,d _jn >| ² +||R ⁿ⁺¹ x|| ² (16)

每次分解都可以得到类似的等式，当分解进行到第

次时，将这一系列的等式相加，可以得到信号x表示为公式(17)：Similar equations can be obtained for each decomposition, when the decomposition proceeds to the first

The second time, adding this series of equations, the signal x can be obtained as equation (17):

也可转化为如公式(18)所示：It can also be transformed into equation (18):

类似的，信号x的能量也可以通过一系列的分解求和来获得，如公式(19)所示：Similarly, the energy of the signal x can also be obtained by a series of decomposition and summation, as shown in Equation (19):

联合得到信号x的能量，如公式(20)所示：The energy of the signal x is obtained jointly, as shown in Equation (20):

从以上讨论可以看出，信号x被分解成字典D一系列原子的和，这些原子是在每次迭代过程中与残差向量最匹配的原子。尽管分解过程非线性也非正交分解，但整个过程保持了信号的能量，类似于线性过程和正交分解过程。MP算法的具体操作步骤如下:As can be seen from the above discussion, the signal x is decomposed into a dictionary D of sums of a series of atoms that best match the residual vector during each iteration. Although the decomposition process is nonlinear and non-orthogonal decomposition, the whole process preserves the energy of the signal, similar to the linear process and the orthogonal decomposition process. The specific operation steps of the MP algorithm are as follows:

步骤1221，输入：待求信号x，具体的字典D，允许的误差值δ或最大稀疏值K；Step 1221, input: the signal to be determined x, the specific dictionary D, the allowable error value δ or the maximum sparse value K;

步骤1222，初始化：残差向量初始值R⁰x＝x，迭代次数n＝0；Step 1222, initialization: initial value of residual vector R ⁰ x=x, number of iterations n=0;

步骤1223，开始迭代计算：搜索字典D中原子，如公式(21)所示：Step 1223, start iterative calculation: search for atoms in dictionary D, as shown in formula (21):

Rⁿx＝|<Rⁿx,d_jn>|d_jn+Rⁿ⁺¹x (21)R ⁿ x=|<R ⁿ x,d _jn >|d _jn +R ⁿ⁺¹ x (21)

其中，d_jn满足

Among them, d _jn satisfies

步骤1224，判断是否继续迭代：Step 1224, determine whether to continue the iteration:

若n＞K||||Rⁿ⁺¹x||²＞δ则n＝n+1，转到步骤1223继续计算；If n>K||||R ⁿ⁺¹ x|| ² >δ, then n=n+1, go to step 1223 to continue the calculation;

若n≤K||||Rⁿ⁺¹x||²≤δ则转到步骤1225；If n≤K||||R ⁿ⁺¹ x|| ² ≤δ, go to step 1225;

步骤1225，计算结束，待求信号x如公式(22)所示：Step 1225, the calculation ends, and the signal x to be determined is shown in formula (22):

步骤S123，固定稀疏解，构建误差矩阵，采用奇异值分解，更新原子的非零系数。Step S123, fixing the sparse solution, constructing an error matrix, and using singular value decomposition to update the non-zero coefficients of atoms.

固定α更新字典，在更新字典时是一列一列地更新，并且更新该原子对应的非0系数。Fixed α to update the dictionary. When updating the dictionary, it is updated column by column, and the non-zero coefficient corresponding to the atom is updated.

使用的方法是奇异值分解(Singular Value Decomposition,SVD)，字典更新的具体计算如公式(23)：The method used is Singular Value Decomposition (SVD), and the specific calculation of dictionary update is as formula (23):

公式(23)表示固定α求字典D，使得样本X与其稀疏表示的平方差

最小。Formula (23) expresses the fixed α to find the dictionary D such that the squared difference between the sample X and its sparse representation

minimum.

假设D＝{d₁,d₂,...,d_i,...,d_K}，其中d_i为字典的第i列，当我们需要更新原子d_k时，也需要更新其对应的第k行系数

中的非零系数。此时，目标函数可以转换为公式(24)：Suppose D={d ₁ ,d ₂ ,...,d _i ,...,d _K }, where d _i is the ith column of the dictionary, when we need to update the atom d _k , we also need to update its corresponding The kth row of coefficients

nonzero coefficients in . At this point, the objective function can be transformed into formula (24):

其中，F表示为F范数，

表示原子d_k对应的第k行非零系数，E_k表示字典中未包含第k个原子时，此时所有训练样本在在该字典上的误差矩阵。where F is the F norm,

Represents the non-zero coefficient of the kth row corresponding to the atom _dk , and _Ek represents the error matrix of all training samples on the dictionary when the dictionary does not contain the kth atom.

增加两项正则项进行约束，防止过拟合，上式目标函数改写为公式(25)：Two regular terms are added to constrain and prevent overfitting. The objective function above is rewritten as formula (25):

其中，

为字典D正则项的权重因子，τ为α正则项的权重因子。in,

is the weight factor of the regular term of dictionary D, and τ is the weight factor of the regular term of α.

更新原子d_k时，其余原子固定不变。如果该误差矩阵可以近似分解为一个列乘以一个行向量的形式，从中选取最接近误差矩阵的分解形式，就可以使用该列替换未包含在字典中的第k个原子，该行就是训练样本在该原子上的投影系数，对应于该假设相对应的是著名的奇异值分解。在分解过程中，我们还需要保持相应的稀疏性。如果直接对误差矩阵进行奇异值分解，则可能导致系数扩散，不能保持稀疏性，因此不直接分解误差矩阵。针对这个问题，我们标定第k行系数

中的非零系数的位置，形成向量index^k，如公式(26)所示：When the atom d _k is updated, the remaining atoms are fixed. If the error matrix can be approximately decomposed into a column multiplied by a row vector, and the decomposed form closest to the error matrix is selected, the column can be used to replace the kth atom that is not included in the dictionary, and this row is the training sample. The projection coefficients on this atom, corresponding to this assumption, correspond to the well-known singular value decomposition. During the decomposition, we also need to maintain the corresponding sparsity. If the error matrix is directly decomposed into singular value, it may cause coefficient diffusion and cannot maintain sparsity, so the error matrix is not decomposed directly. To solve this problem, we calibrate the coefficient of the kth row

The positions of the non-zero coefficients in , form the vector index ^k , as shown in equation (26):

这些向量就是训练样本集中在：第k个原子上有所投影的样本集所在的位置。先定义矩阵

该矩阵在位置(index^k(i),i)上的元素为1，其余元素全都为0。那么设由缺失第k个原子引起的误差矩阵

可以通过公式(27)求得：These vectors are where the training samples are concentrated: the position of the projected sample set on the kth atom. First define the matrix

The element of this matrix at position (index ^k (i), i) is 1, and the rest of the elements are all 0. Then let the error matrix caused by the missing kth atom

It can be obtained by formula (27):

直接使用SVD分解新的误差矩阵

根据分解结果更新字典原子及相应位置系数，

其中，误差矩阵

是一个m×n矩阵，对误差矩阵进行奇异值分解，U是一个m×m的左奇异矩阵，Δ代表一个m×n的奇异值矩阵，除了主对角线元素以外其余元素为0，V代表一个n×n右奇异矩阵。将U中的第一列用于更新字典第k个原子d_k，将V中的第一列与Δ_(1,1)的乘积用于更新原第k行系数

中的非零系数，其中，Δ_(1,1)代表矩阵第一行第一列的元素。通过对字典中的原子逐个执行上述操作，即完成K-SVD字典学习。Decompose the new error matrix directly using SVD

Update dictionary atoms and corresponding position coefficients according to the decomposition result,

Among them, the error matrix

is an m×n matrix, which performs singular value decomposition on the error matrix, U is an m×m left singular matrix, Δ represents an m×n singular value matrix, and the remaining elements except the main diagonal elements are 0, V represents an n×n right singular matrix. Use the first column in U to update the kth atom d _k of the dictionary, and use the product of the first column in V and Δ _(1,1) to update the original kth row coefficients

The non-zero coefficients in , where Δ _(1,1) represents the element in the first row and first column of the matrix. By performing the above operations on the atoms in the dictionary one by one, the K-SVD dictionary learning is completed.

2、SVM分类模型2. SVM classification model

支持向量机(SVM)是一种新型的机器学习方法，其具有强大的分类能力、泛化能力、分类方式灵活以及特征参数数量不敏感等特点，已被广泛应用于模式识别领域。Support vector machine (SVM) is a new type of machine learning method, which has the characteristics of strong classification ability, generalization ability, flexible classification method and insensitive feature parameters, and has been widely used in the field of pattern recognition.

基于稀疏表示模型构建的字典呈现的线性标识组合，线性标识分为线性可分和不可分。而SVM分类模型可以将线性不可分的数据点映射到一个新的空间，转换为新空间中线性可分数据以进行分类。如果返回到原始数据的空间，则实际上是非线性分离的数据。The combination of linear identifiers presented by the dictionary constructed based on the sparse representation model, the linear identifiers are divided into linearly separable and inseparable. The SVM classification model can map linearly inseparable data points to a new space and convert them into linearly separable data in the new space for classification. If you go back to the space of the original data, it is actually non-linearly separated data.

采用奇异值分解的字典学习，以SAR训练集样本图像的稀疏表示得到一定的特征值作为属性，通过SVM训练得到分类结果。SVM分类模型的具体训练过程为：The dictionary learning of singular value decomposition is adopted, and certain eigenvalues are obtained from the sparse representation of the sample images of the SAR training set as attributes, and the classification results are obtained through SVM training. The specific training process of the SVM classification model is as follows:

(1)输入训练样本集T＝{(x₁,y₁),(x₂,y₂),...,(x_N,y_N)}，其中，x＝Dα，y∈{-1,+1}；(1) Input training sample set T={(x ₁ ,y ₁ ),(x ₂ ,y ₂ ),...,(x _N ,y _N )}, where x=Dα, y∈{-1 ,+1};

(2)选择惩罚参数C，求解凸二次规划

其中，

时求得最优解

i，j为参数值；(2) Select the penalty parameter C and solve the convex quadratic programming

in,

find the optimal solution

i, j are parameter values;

(3)计算

选取α^*的一个分量

满足

计算

其中，w^*为法向量，b^*为截距；(3) Calculation

pick a component of α ^*

Satisfy

calculate

where w ^* is the normal vector and b ^* is the intercept;

(4)得到分离超平面w^*x+b^*＝0，根据分类决策函数f(x)＝sign(w^*x+b^*)，获得二分类结果。(4) Obtain the separation hyperplane w ^* x+b ^* =0, and obtain the binary classification result according to the classification decision function f(x)=sign(w ^* x+b ^* ).

基于SVM方法分类模型，分别获取差分金字塔的四组图像集的分类结果，采用预设的决策方法得到图像类别，其中预设的决策方法在本实施中优选的是联合投票决策(少数服从多数决策)，也可以选择其他图片决策方式。Based on the classification model of the SVM method, the classification results of the four groups of image sets of the differential pyramid are obtained respectively, and the image categories are obtained by using a preset decision method. decision), you can also choose other picture decision methods.

3、基于稀疏表示的SAR图像目标标记方法3. SAR image target labeling method based on sparse representation

本发明实施例的一种基于稀疏表示的SAR图像目标标记方法，包括以下步骤：A sparse representation-based SAR image target marking method according to an embodiment of the present invention includes the following steps:

步骤S10，获取待标记的SAR图像，对所述SAR图像阈值分割后进行形态学处理，并提取感兴趣区域。Step S10: Obtain the SAR image to be marked, perform morphological processing after threshold segmentation of the SAR image, and extract a region of interest.

阈值分割是指根据图像的特性设置相应的门限从而提取出感兴趣目标的过程。根据图像的灰度信息进行阈值分割是应用最广泛的一种图像分割方法。图像的阈值分割主要分为两个步骤：首先，确定最佳分割阈值；其次，将像素灰度值与分割阈值比较，实现区域的归属划分。Threshold segmentation refers to the process of extracting objects of interest by setting corresponding thresholds according to the characteristics of the image. Threshold segmentation based on the grayscale information of an image is the most widely used image segmentation method. The threshold segmentation of the image is mainly divided into two steps: first, determine the optimal segmentation threshold; secondly, compare the pixel gray value with the segmentation threshold to realize the attribution of the region.

实际情况中往往有大量的噪声干扰，使得仅利用一维灰度直方图无法观测到明显的波峰和波谷，因此无法获得较好的分割效果。基于二维灰度直方图的OTSU法利用了图像的灰度信息，而且考虑了图像中各像素点与其邻域空间的相关信息，具有较好的抗噪效果。In actual situations, there is often a lot of noise interference, so that only one-dimensional grayscale histogram cannot observe obvious peaks and troughs, so a better segmentation effect cannot be obtained. The OTSU method based on two-dimensional grayscale histogram utilizes the grayscale information of the image, and considers the relevant information of each pixel in the image and its neighborhood space, and has a good anti-noise effect.

经过阈值分割和形态学处理，得到感兴趣区域。感兴趣区域是经过OTSU阈值分割和形态学处理后小于设定阈值的区域图像。After threshold segmentation and morphological processing, the region of interest is obtained. The region of interest is the image of the region that is smaller than the set threshold after OTSU threshold segmentation and morphological processing.

在本实施例中，获取待标记的SAR图像，对所述SAR图像阈值分割后进行形态学处理，并提取感兴趣区域。In this embodiment, the SAR image to be marked is acquired, the SAR image is subjected to morphological processing after threshold segmentation, and the region of interest is extracted.

步骤S20，基于提取的感兴趣区域，采用粗过滤，将所述粗过滤后的感兴趣区域作为第一区域图像，并获取其在所述SAR图像中的位置信息。In step S20, based on the extracted region of interest, coarse filtering is adopted, and the coarsely filtered region of interest is used as a first region image, and its position information in the SAR image is obtained.

在本实施例中，基于舰船的尺寸、灰度以及其邻域的灰度特性，对初选的感兴趣区域进行过滤。这种方法首先利用先验知识，将目标的尺寸、灰度以及其邻域的灰度约束在一个给定的范围内，然后利用阈值的方法对其进行过滤。这种方法尽可能的利用了目标的特征，保证了较低的漏检率。In this embodiment, based on the size of the ship, the grayscale, and the grayscale characteristics of its neighborhood, the primary selected region of interest is filtered. This method first uses prior knowledge to constrain the size, grayscale of the target and the grayscale of its neighborhood within a given range, and then uses a threshold method to filter it. This method utilizes the characteristics of the target as much as possible to ensure a low missed detection rate.

基于提取的感兴趣区域，采用粗过滤，将所述粗过滤后的感兴趣区域作为第一区域图像。所述第一区域图像为感兴趣区域经过尺度(几何)特征、空间位置特征、灰度特征后小于设定阈值的区域图像。并获取其在所述SAR图像中的位置信息。Based on the extracted region of interest, coarse filtering is adopted, and the coarsely filtered region of interest is used as the first region image. The first area image is an area image of which the area of interest is smaller than the set threshold after the scale (geometric) feature, the spatial position feature, and the grayscale feature are applied. and obtain its position information in the SAR image.

步骤S30，提取所述第一区域图像的多尺度的特征向量，分别获取不同尺度的特征向量对应的稀疏表示。Step S30: Extract multi-scale feature vectors of the first region image, and obtain sparse representations corresponding to feature vectors of different scales respectively.

稀疏表示的本质思想是模板匹配，基于稀疏表示模型构建的字典是训练样本的线性标识组合。因此，如果我们抽取的图像块含目标，那么它在字典中的表达将呈现出一定的稀疏特性。并且如果我们输入的目标块不包含目标，则图像在字典上的表示将呈现平稳且随机的性质。The essential idea of sparse representation is template matching, and the dictionary constructed based on the sparse representation model is a linear combination of training samples. Therefore, if the image patch we extract contains a target, its representation in the dictionary will show a certain sparse characteristic. And if our input target block does not contain the target, the lexicographic representation of the image will take on a stationary and random nature.

在本实例中，基于所述第一区域图像构造预设尺度的差分金字塔图像集，分别提取该图像集中不同尺度的图像的特征向量，利用稀疏表示模型获取不同尺度的特征向量对应的稀疏表示。In this example, a differential pyramid image set with a preset scale is constructed based on the first region image, feature vectors of images of different scales in the image set are respectively extracted, and sparse representations corresponding to the feature vectors of different scales are obtained by using a sparse representation model.

步骤S40，基于不同尺度的稀疏表示，通过SVM分类模型分别得到其对应的分类结果，并采用预设的决策方法，获取所述第一区域图像的类别。Step S40 , based on the sparse representation of different scales, obtain the corresponding classification results through the SVM classification model, and use a preset decision method to obtain the category of the first area image.

在本实例中，基于不同尺度的稀疏表示，通过SVM分类模型分别得到其对应的分类结果，并采用预设的决策方法选出最优的分类结果作为第一区域图像的类别。其中预设的决策方法在本实施中优选的是联合投票决策(少数服从多数决策)，也可以选择其他图片决策方式。In this example, based on sparse representations of different scales, the corresponding classification results are obtained through the SVM classification model, and a preset decision method is used to select the optimal classification result as the category of the first region image. The preset decision method is preferably joint voting decision (minority obeys majority decision) in this implementation, and other picture decision methods can also be selected.

步骤S50，根据第一区域图像的类别和位置信息，在所述待标记的SAR图像中对所述第一区域图像进行标记。Step S50: Mark the first area image in the to-be-marked SAR image according to the category and location information of the first area image.

本发明属于海洋舰船、机场飞机和阵地汽车检测，需要给出待定位的SAR图像在世界坐标系下的精确值。在本实例中，返回到原始分辨率的待鉴别的SAR图像块中，对第一区域图像采用矩形框或其他标志进行标记。The invention belongs to the detection of marine ships, airport aircraft and position vehicles, and needs to give the precise value of the SAR image to be located in the world coordinate system. In this example, returning to the original resolution SAR image block to be discriminated, the first region image is marked with a rectangular frame or other signs.

本发明第二实施例的一种基于稀疏表示的SAR图像目标标记系统，如图2所示，包括：提取模块100、过滤模块200、稀疏表示模块300、分类识别模块400、标记模块500；A sparse representation-based SAR image target labeling system according to the second embodiment of the present invention, as shown in FIG. 2 , includes: an extraction module 100, a filtering module 200, a sparse representation module 300, a classification and identification module 400, and a labeling module 500;

提取模块100，配置为获取待标记的SAR图像，对所述SAR图像阈值分割后进行形态学处理，并提取感兴趣区域；The extraction module 100 is configured to obtain a SAR image to be marked, perform morphological processing after threshold segmentation of the SAR image, and extract a region of interest;

过滤模块200，配置为基于提取的感兴趣区域，采用粗过滤，将所述粗过滤后的感兴趣区域作为第一区域图像，并获取其在所述SAR图像中的位置信息；The filtering module 200 is configured to use coarse filtering based on the extracted region of interest, and use the coarsely filtered region of interest as a first region image, and obtain its position information in the SAR image;

稀疏表示模块300，配置为提取所述第一区域图像的多尺度的特征向量，分别获取不同尺度的特征向量对应的稀疏表示；The sparse representation module 300 is configured to extract multi-scale feature vectors of the first region image, and obtain sparse representations corresponding to the feature vectors of different scales respectively;

分类识别模块400，配置为基于不同尺度的稀疏表示，通过SVM分类模型分别得到其对应的分类结果，并采用预设的决策方法，获取所述第一区域图像的类别；The classification and identification module 400 is configured to obtain the corresponding classification results through the SVM classification model based on sparse representations of different scales, and adopts a preset decision method to obtain the category of the first area image;

标记模块500，配置为根据第一区域图像的类别和位置信息，在所述待标记的SAR图像中对所述第一区域图像进行标记；The marking module 500 is configured to mark the first area image in the to-be-marked SAR image according to the category and position information of the first area image;

其中，in,

所述技术领域的技术人员可以清楚的了解到，为描述的方便和简洁，上述描述的系统的具体的工作过程及有关说明，可以参考签署方法实施例中的对应过程，在此不再赘述。Those skilled in the technical field can clearly understand that, for the convenience and brevity of description, for the specific working process and related description of the system described above, reference may be made to the corresponding process in the embodiment of the signing method, which will not be repeated here.

需要说明的是，上述实施例提供的基于稀疏表示的SAR图像目标标记系统，仅以上述各功能模块的划分进行举例说明，在实际应用中，可以根据需要而将上述功能分配由不同的功能模块来完成，即将本发明实施例中的模块或者步骤再分解或者组合，例如，上述实施例的模块可以合并为一个模块，也可以进一步拆分成多个子模块，以完成以上描述的全部或者部分功能。对于本发明实施例中涉及的模块、步骤的名称，仅仅是为了区分各个模块或者步骤，不视为对本发明的不当限定。It should be noted that the sparse representation-based SAR image target labeling system provided by the above embodiments is only illustrated by the division of the above functional modules. In practical applications, the above functions can be allocated to different functional modules as required. To complete, that is, the modules or steps in the embodiments of the present invention are decomposed or combined. For example, the modules in the above embodiments can be combined into one module, or can be further split into multiple sub-modules to complete all or part of the functions described above. . The names of the modules and steps involved in the embodiments of the present invention are only for distinguishing each module or step, and should not be regarded as an improper limitation of the present invention.

本发明第三实施例的一种存储装置，其中存储有多条程序，所述程序适用于由处理器加载并实现上述的基于稀疏表示的SAR图像目标标记方法。A storage device according to the third embodiment of the present invention stores a plurality of programs, and the programs are suitable for being loaded by a processor and implementing the above-mentioned sparse representation-based SAR image target marking method.

本发明第四实施例的一种处理装置，包括处理器、存储装置；处理器，适于执行各条程序；存储装置，适于存储多条程序；所述程序适于由处理器加载并执行以实现上述的基于稀疏表示的SAR图像目标标记方法。A processing device according to a fourth embodiment of the present invention includes a processor and a storage device; the processor is adapted to execute various programs; the storage device is adapted to store multiple programs; the programs are adapted to be loaded and executed by the processor In order to realize the above-mentioned sparse representation-based SAR image target labeling method.

所述技术领域的技术人员可以清楚的了解到，未描述的方便和简洁，上述描述的存储装置、处理装置的具体工作过程及有关说明，可以参考签署方法实例中的对应过程，在此不再赘述。Those skilled in the technical field can clearly understand that the undescribed convenience and brevity, the specific working process and related description of the storage device and the processing device described above, can refer to the corresponding process in the example of the signing method, and will not be repeated here. Repeat.

本领域技术人员应该能够意识到，结合本文中所公开的实施例描述的各示例的模块、方法步骤，能够以电子硬件、计算机软件或者二者的结合来实现，软件模块、方法步骤对应的程序可以置于随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的存储介质中。为了清楚地说明电子硬件和软件的可互换性，在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以电子硬件还是软件方式来执行，取决于技术方案的特定应用和设计约束条件。本领域技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能，但是这种实现不应认为超出本发明的范围。Those skilled in the art should be able to realize that the modules and method steps of each example described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, computer software or a combination of the two, and the programs corresponding to the software modules and method steps Can be placed in random access memory (RAM), internal memory, read only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or as known in the art in any other form of storage medium. In order to clearly illustrate the interchangeability of electronic hardware and software, the components and steps of each example have been described generally in terms of functionality in the foregoing description. Whether these functions are performed in electronic hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may use different methods of implementing the described functionality for each particular application, but such implementations should not be considered beyond the scope of the present invention.

术语“第一”、“第二”等是用于区别类似的对象，而不是用于描述或表示特定的顺序或先后次序。The terms "first," "second," etc. are used to distinguish between similar objects, and are not used to describe or indicate a particular order or sequence.

术语“包括”或者任何其它类似用语旨在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者设备/装置不仅包括那些要素，而且还包括没有明确列出的其它要素，或者还包括这些过程、方法、物品或者设备/装置所固有的要素。The term "comprising" or any other similar term is intended to encompass a non-exclusive inclusion such that a process, method, article or device/means comprising a list of elements includes not only those elements but also other elements not expressly listed, or Also included are elements inherent to these processes, methods, articles or devices/devices.

至此，已经结合附图所示的优选实施方式描述了本发明的技术方案，但是，本领域技术人员容易理解的是，本发明的保护范围显然不局限于这些具体实施方式。在不偏离本发明的原理的前提下，本领域技术人员可以对相关技术特征作出等同的更改或替换，这些更改或替换之后的技术方案都将落入本发明的保护范围之内。So far, the technical solutions of the present invention have been described with reference to the preferred embodiments shown in the accompanying drawings, however, those skilled in the art can easily understand that the protection scope of the present invention is obviously not limited to these specific embodiments. Without departing from the principle of the present invention, those skilled in the art can make equivalent changes or substitutions to the relevant technical features, and the technical solutions after these changes or substitutions will fall within the protection scope of the present invention.

Claims

1. A SAR image target marking method based on sparse representation is characterized by comprising the following steps:

step S10, acquiring an SAR image to be marked, performing morphological processing after threshold segmentation of the SAR image, and extracting an interested region;

step S20, based on the extracted region of interest, adopting coarse filtering, taking the region of interest after the coarse filtering as a first region image, and acquiring the position information of the region of interest in the SAR image;

step S30, extracting multi-scale feature vectors of the first region image, and respectively acquiring sparse representations corresponding to the feature vectors of different scales;

step S40, based on sparse representation of different scales, obtaining corresponding classification results through SVM classification models respectively, and obtaining the category of the first region image by adopting a preset decision method;

step S50, according to the category and the position information of a first area image, marking the first area image in the SAR image to be marked;

wherein,

in the training process of the SVM classification model, a characteristic value obtained by singular value decomposition of an SAR training set sample image is used as an attribute;

"respectively obtain sparse representations corresponding to feature vectors of different scales", the method is as follows: obtaining through a sparse representation model; the construction method of the sparse representation model comprises the following steps:

randomly extracting part of atoms in a training set to construct a dictionary;

the dictionary obtains an over-complete dictionary according to a preset approximation error value, and an over-complete dictionary matrix is constructed;

decomposing the overcomplete dictionary matrix based on an MP algorithm to obtain an optimal sparse solution;

constructing an error matrix according to the optimal sparse solution, decomposing the error matrix by adopting a singular value decomposition method, and updating a dictionary according to a decomposition result;

wherein, the method for decomposing the error matrix by adopting a singular value decomposition method comprises the following steps: carrying out non-zero coefficient position calibration on the error matrix to form a new error matrix, and decomposing the new error matrix by adopting a singular value decomposition method;

the singular value decomposition method carries out constraint optimization based on a regular term, and the corresponding objective function is as follows:

wherein, X represents a sample,

represents the squared difference of sample X and its sparse representation,

is the weight factor of dictionary D regular term, tau is the weight factor of alpha regular term, F is expressed as F norm,

represents an atom d_kCorresponding non-zero coefficient of k-th row, E_kWhen the representation dictionary does not contain the kth atom, all training samples are on the dictionaryThe error matrix of (a) is calculated,

k-th row coefficient, d, representing an error matrix_kAtom representing the k-th column in the dictionary, d_jRepresenting atoms in the dictionary except for column k.

2. The sparse representation-based SAR image target labeling method of claim 1, wherein in step S30, "extracting multi-scale feature vectors of the first region image" is performed by: and constructing a difference pyramid image set with a preset scale based on the first region image, and respectively extracting the feature vectors of the images with different scales in the image set.

3. The SAR image target marking method based on sparse representation as claimed in claim 1, wherein step S20 "based on the extracted region of interest, using coarse filtering to take the region of interest after coarse filtering as the first region image", wherein the coarse filtering includes geometric feature filtering, spatial position filtering, and gray feature filtering.

4. The SAR image target marking method based on sparse representation as claimed in claim 2, wherein the scale ratio between the images of adjacent scales in the differential pyramid image set is 2, and the smoothing coefficients between the images of adjacent layers of the same scale are different.

5. The SAR image target labeling method based on sparse representation as claimed in claim 1, wherein a dictionary is constructed by using training set partial atoms, and the number of the partial atoms is 50% -80% of the training set atoms.

6. A SAR image target marking system based on sparse representation is characterized by comprising an extraction module, a filtering module, a sparse representation module, a classification identification module and a marking module;

the extraction module is configured to acquire an SAR image to be marked, perform morphological processing after threshold segmentation of the SAR image, and extract an interested region;

the filtering module is configured to adopt coarse filtering based on the extracted region of interest, take the region of interest after the coarse filtering as a first region image, and acquire position information of the region of interest in the SAR image;

the sparse representation module is configured to extract multi-scale feature vectors of the first region image and respectively obtain sparse representations corresponding to the feature vectors of different scales;

the classification recognition module is configured to obtain corresponding classification results through SVM classification models based on sparse representations of different scales, and acquire the category of the first region image by adopting a preset decision method;

the marking module is configured to mark a first region image in the SAR image to be marked according to the category and the position information of the first region image;

wherein,

randomly extracting part of atoms in a training set to construct a dictionary;

constructing an error matrix according to the optimal sparse solution, decomposing the error matrix by adopting a singular value decomposition method, and updating a dictionary according to a decomposition result; wherein, the method for decomposing the error matrix by adopting a singular value decomposition method comprises the following steps: carrying out non-zero coefficient position calibration on the error matrix to form a new error matrix, and decomposing the new error matrix by adopting a singular value decomposition method;

wherein, X represents a sample,

represents the squared difference of sample X and its sparse representation,

represents an atom d_kCorresponding non-zero coefficient of k-th row, E_kRepresenting the error matrix of all training samples on the dictionary when the kth atom is not contained in the dictionary,

7. A storage device having stored therein a plurality of programs, wherein said program applications are loaded and executed by a processor to implement the sparse representation based SAR image target labeling method of any of claims 1-5.

8. A processing arrangement comprising a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; characterized in that the program is adapted to be loaded and executed by a processor to implement the sparse representation based SAR image target labeling method of any of claims 1-5.