CN116740561B

CN116740561B - SAR target recognition method based on fusion of ASC features and multi-scale depth features

Info

Publication number: CN116740561B
Application number: CN202310552287.8A
Authority: CN
Inventors: 王英华; 刘靓; 刘宏伟
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2023-05-16
Filing date: 2023-05-16
Publication date: 2025-08-12
Anticipated expiration: 2043-05-16
Also published as: CN116740561A

Abstract

The present invention discloses a SAR target recognition method based on the fusion of ASC features and multi-scale depth features, comprising: obtaining the original SAR complex image of the observed target and extracting the attribute scattering center corresponding to each SAR complex image; reconstructing the attribute scattering center to obtain global and local reconstruction images of different scales; constructing a deep neural network including a feature extraction module and a feature fusion module to perform feature fusion at different levels on the multi-scale depth feature map and the binarized image corresponding to the reconstructed image; inputting the original SAR complex image and the binarized image corresponding to the reconstructed image into a trained deep neural network for processing, and outputting the target recognition result. The method uses accurately estimated ASC parameters to perform multiple types of reconstructions on the target and fuses them with depth features of different scales, providing more information to the network, thereby improving recognition performance.

Description

SAR target recognition method based on fusion of ASC features and multi-scale depth features

Technical Field

The invention belongs to the technical field of radar target identification, and particularly relates to an SAR target identification method based on fusion of ASC features and multi-scale depth features.

Background

SAR (SYNTHETIC APERTURE RADAR ) is an active earth observation system, and is beneficial to a unique electromagnetic scattering imaging mechanism, can work all the day around the clock and can image in a long distance and high resolution, so that the SAR is widely applied to the military and civil fields. SAR images lack color information and are susceptible to speckle noise, compared to optical images, and interpretation of SAR images is therefore more difficult. ATR (automatic target recognition ) is a key topic for intelligent interpretation of SAR images and has received extensive attention from researchers.

In recent years, with rapid development of deep learning, researchers have proposed various SAR target recognition algorithms based on deep learning. Deep learning is a data driven algorithm whose excellent performance often depends on a large amount of training data. However, the acquisition of large amounts of marked measured SAR data requires high costs, the number of SAR image data sets is small and the size is small compared to the optical data sets. The problem of insufficient samples is greatly affected in the SAR ATR algorithm using deep learning. However, SAR images also have their unique features, and by effectively using these features, the problem of insufficient training samples can be alleviated. Because of its unique electromagnetic scattering properties, the ASC (Attributed SCATTERING CENTER, attribute scattering center) model, which is an effective method for interpreting SAR measurements, can be used to extract and estimate parameters of SAR targets, which provides the physical correlation characteristics of complex targets and has a certain degree of interference immunity, and by using ASC characteristics, can improve the recognition performance of SAR ATR algorithm.

In view of the above problems, some work has been conducted to develop a related study. For example, yang et al in 2020, in its published paper "Efficient Attributed Scatter Center Extraction Based on Image-Domain Sparse Representation"(IEEE Transactions on Single Processing), proposed a rapid attribute scattering center extraction algorithm based on sparse representation of image domain, which proposed and demonstrated that scattering centers have translatory and additivity in image domain, and uses these properties to perform dictionary reduction, and uses newton method to optimize continuous valued parameter estimation results when solving sparse coefficients, so as to obtain rapid and highly accurate attribute scattering center parameter estimation results. In the paper "Multiscale CNN Based on Component Analysis for SAR ATR"(IEEE Transactions on Geoscience and Remote Sensing) published by Li et al 2021, an algorithm for performing multi-scale fusion on deep learning features and attribute scattering center features is used, the overall attribute scattering center reconstruction and binarization results of eight types of attribute scattering center reconstruction are respectively fused with the same deep network features, and the fused features are used as the use features of a target recognition task, so that the recognition accuracy of the network is improved. In the paper "AConvolutional Neural Network Combined with Attributed Scattering Centers for SAR ATR"(MDPI Remote Sensing) published by Zhou et al 2021, the extraction result of the attribute scattering center of the SAR image is subjected to 3D imaging of the scattering center model, the imaging result is sent to a convolutional neural network for training, the trained features are fused with depth features obtained by SAR image training, and the fusion features are used for performing a target recognition task. Each attribute scattering center extracted from the target is imaged in paper "Integrating the Reconstructed Scattering Center Feature Maps With Deep CNN Feature Maps for Automatic SAR Target Recognition"(IEEE Geoscience and Remote Sensing Letters) published by Zhang et al in 2021, imaging results of all the individual attribute scattering centers are directly spliced with depth network features, the spliced features are used for completing the target identification task, and a parameter migration method in migration learning is used when the depth network is trained, so that training efficiency is effectively improved.

However, in the above four methods, the first method has an artificial zeroing operation when performing parameter estimation of the target attribute scattering center, that is, after one attribute scattering center is extracted, all the pixel points of the area covered by the scattering center are zeroed, so as to ensure that the extraction cannot be repeated at the same position, and this step increases inaccuracy of the parameter estimation result of the attribute scattering center. The second method and the third method are based on the extraction mode of the attribute scattering center of the first method, the depth feature extraction of the SAR image uses a convolution neural network structure designed by the user, the network is fully trained, the higher time cost is needed, the multi-scale feature fusion of the second method actually only uses the last layer of features of the depth network, the multi-scale features of the depth network are not utilized enough, and the target recognition performance is also influenced. In the fourth method, the extraction of the attribute scattering center adopts a frequency domain extraction algorithm, the number of the attribute scattering center points required to be extracted is more under the condition of obtaining the same reconstruction performance of the target, and the frequency domain extraction method has low calculation efficiency and higher memory requirement.

In summary, the accuracy and the extraction efficiency of the attribute scattering center features used by the existing method are always influenced by high memory and high computational complexity, so that the target recognition performance is influenced, and the depth network used in the existing algorithm can be trained into a depth network with good classification performance through a plurality of iterations, so that the training cost is high.

Disclosure of Invention

In order to solve the problems in the prior art, the invention provides an SAR target recognition method based on fusion of ASC features and multi-scale depth features. The technical problems to be solved by the invention are realized by the following technical scheme:

an SAR target recognition method based on fusion of ASC features and multi-scale depth features comprises the following steps:

Step 1, acquiring original SAR complex images of an observation target, and extracting an attribute scattering center corresponding to each SAR complex image based on an ASC parameter estimation algorithm of improved image domain sparse representation;

step2, performing image global and local reconstruction on the attribute scattering center, and obtaining global and local reconstruction images with different scales in a downsampling mode;

Step 3, constructing a deep neural network comprising a feature extraction module and a feature fusion module, wherein,

The characteristic extraction module is used for carrying out multi-scale characteristic extraction on the amplitude image corresponding to the original SAR complex image to obtain a multi-scale depth characteristic image;

the feature fusion module is used for carrying out feature fusion on different layers on the extracted multi-scale depth feature map and the binarized image corresponding to the reconstruction map;

And 4, inputting the original SAR complex image and the binarized image corresponding to the reconstruction image into a trained deep neural network for processing, and outputting a target recognition result.

The invention has the beneficial effects that:

1. The method fully utilizes the physical properties of the targets reflected by the ASC model, uses the ASC parameter set accurately estimated to reconstruct various types of observed targets in an image domain, fuses the ASC parameter set with depth features of different scales, and provides more information for a network, thereby improving the target identification performance;

2. The method is improved and optimized aiming at the defects of a newer rapid ASC extraction algorithm based on an image domain, and the artificial zero setting operation is removed in the ASC extraction process of the target by changing the processing mode of the residual image and the corresponding initial dictionary atom selection mode when the sparse coefficient is solved, so that the ASC parameter estimation result of the target is more accurate and the reconstruction error of the target is smaller;

3. According to the invention, a parameter migration mode is used in the deep network training process, and the front 13-layer parameter result of VGG16Net which is fully trained on an image Net data set is used as the initial parameter of the front 13 layers of the deep network, so that the network can obtain better recognition performance through less rounds of training, and the training efficiency can be improved.

The present invention will be described in further detail with reference to the accompanying drawings and examples.

Drawings

FIG. 1 is a schematic flow chart of a SAR target recognition method based on fusion of ASC features and multi-scale depth features provided by an embodiment of the present invention;

FIG. 2 is a flow chart of a prior art fast attribute scattering center extraction algorithm based on sparse representation of image domains;

FIG. 3 is a flowchart of a fast attribute scattering center extraction algorithm for improved sparse representation of image domain provided by an embodiment of the present invention;

FIG. 4 is a schematic diagram of the structure and forward propagation of a deep neural network according to an embodiment of the present invention;

FIG. 5 is a T72 raw SAR image in simulation experiments;

FIG. 6 is a result of attribute scattering center extraction and image reconstruction of a T72 raw SAR image using the algorithm of the present invention;

fig. 7 is a binarized image corresponding to the reconstruction result in fig. 6.

Detailed Description

The present invention will be described in further detail with reference to specific examples, but embodiments of the present invention are not limited thereto.

Example 1

Referring to fig. 1, fig. 1 is a flowchart of a SAR target recognition method based on fusion of ASC features and multi-scale depth features, provided in an embodiment of the present invention, which includes:

Step 1, acquiring original SAR complex images of an observation target, and extracting an attribute scattering center corresponding to each SAR complex image based on an ASC parameter estimation algorithm of improved image domain sparse representation.

The existing fast attribute scattering center extraction algorithm flow based on image domain sparse representation is shown in fig. 2, and when the parameter estimation of the target attribute scattering center is performed, the artificial zero setting operation exists, namely after one attribute scattering center is extracted, all the regional pixel points covered by the scattering center are set to zero, so that the situation that the extraction cannot be repeated at the same position is ensured, and the inaccuracy of the parameter estimation result of the attribute scattering center is increased. Aiming at the problem, the embodiment improves a fast attribute scattering center extraction algorithm based on image domain sparse representation, and the parameter estimation result is inaccurate due to the fact that the artificial zeroing operation exists in the extraction algorithm.

In this embodiment, the improved ASC parameter estimation algorithm of image domain sparse representation is used to extract the scattering center of SAR image S, the number of extracted scattering center points Q is set to 25, each scattering center point corresponds to a feature vector, and the feature vector corresponding to the ith scattering center can be expressed asWhere A _i denotes the complex amplitude, a _i denotes the frequency dependent factor, x _i and y _i denote the position coordinates in the distance and azimuth directions, respectively, L _i denotes the length of the scattering center,And γ _i denotes the direction angle and azimuth dependence factor of the scattering center.

Specifically, referring to fig. 3, fig. 3 is a flowchart of an improved fast attribute scattering center extraction algorithm for image domain sparse representation according to an embodiment of the present invention, which includes:

11 Firstly, SAR echo signals are converted into an image domain, and the problem to be solved by the improved rapid attribute scattering center extraction algorithm based on image domain sparse representation is still to estimate attribute scattering center parameters of the target from the back scattering echoes of the target, namely the number Q of the attribute scattering centers forming a complex target and parameter sets theta _m of the attribute scattering centers, m=1, 2. This problem can be described as follows:

wherein f represents the operating frequency of the radar, Represents the range of synthetic pore diameters, f _k,Respectively, represent a discrete f-number,K. H represents the number of discrete points in the frequency direction and the azimuth direction, x, y represents the coordinates of the pixel points after conversion to the image domain, f ₀ represents the center frequency, c represents the speed of light,Back-scattered echo data representing the object,Echo data representing the qth attribute scattering center, Q representing the total number of attribute scattering centers, σ _q representing a sparse coefficient, θ _q representing a set of parameters for the qth scattering center, ε representing an error coefficient and ε >0,S (x, y), D (x, y; θ _q) representing, respectivelyAnd (3) withA corresponding image domain representation; And (3) with Respectively represent after discretizationAnd (3) with

Equation (1) remains a sparse representation problem by applying a binary pattern toAnd (3) withBy applying the same linear imaging operator beta {.cndot }, the specific expression is represented by the expression (2), the expression (1) can be converted into the image domain expression, as represented by the expression (3):

wherein S (x, y), D (x, y; θ _q) respectively represent And (3) withThe corresponding image domain representation.

12 Using NOMP algorithm to solve the parameters of the attribute scattering centers of S (x, y), and adding a parameter fine correction process in the solving process to obtain a plurality of attribute scattering centers.

First, an initial dictionary is established, and a residual image R (x, y) is initialized, and R (x, y) =s (x, y). Then, the optimized NOMP algorithm is utilized to extract the attribute scattering center. The extraction of the attribute scattering center of the observation target is divided into four steps, namely atom selection, atom parameter fine estimation, least square solution calculation and residual error calculation.

Specifically, step 12) includes:

a) An initial dictionary phi is established, and the expression is as follows:

in the formula,

Wherein, the Represents a normalized attribute scattering center image, Θ _loc and Θ _dis represent parameter sets corresponding to a local attribute scattering center and a distributed attribute scattering center, respectively, Θ _A,Θ_αloc,Θ_Lloc,Θ _γloc,Θ_x,Θ_y corresponds to the parameters a, α, L,Γ, x, y, where A represents complex amplitude, α represents a frequency dependent factor, x and y represent position coordinates in the distance and azimuth directions, respectively, L represents the length of the scattering center,And gamma represents the direction angle and the azimuth dependence factor of the scattering center respectively, - _A,Θ_αdis,Θ_Ldis,Θ _γdis,Θ_x,Θ_y corresponds to the parameters a, α, L,The specific meaning of each parameter is the same as described above for γ, x, y, "×" represents the cartesian product.

When the initialization dictionary phi is built, let Θ _A＝{1},Θ_αloc＝{0},Θ_αdis＝{0},Θ_Lloc＝{0},Θ_Ldis＝{2ΔL,4ΔL,...,2N_L deltal },Θ_γloc＝{0},Θ_γdis={0},This is because the values of the frequency-dependent factor α and the azimuth-dependent factor γ have little influence on the attribute scattering center echo signal and can be ignored temporarily, so that Θ _αloc,Θ_αdis,Θ_γloc is set to {0}. The initialization dictionary Φ is thus built up.

B) And (3) performing atom selection on the initial dictionary phi, and selecting an atom theta _{i_chose} with the maximum similarity with the current residual image R (x, y) from the initial dictionary phi as a rough estimation result of the current ith ASC parameter.

Specifically, for a given initial dictionary Φ, the first step of the NOMP algorithm is to select the atom from Φ that best matches (has the largest inner product) the residual image R (x, y), as shown in the following formula:

wherein (-) ^* represents a conjugate operation.

C) Judging whether the position parameter of the currently selected atom theta _{i_chose} is the same as the position parameter of the atom which is selected last time, discarding the currently selected atom theta _{i_chose}, and re-selecting the atom with the maximum similarity after removing the atom theta _{i_chose} as a rough estimation result of the round, otherwise, executing the step d).

Specifically, the embodiment performs optimization of the atom selection method, records that the currently selected dictionary atom is θ _{i_chose}, that is, θ _{i_chose}＝θ_i, records that the dictionary atom selected in the last execution of step b) is θ _{i_last}, compares whether θ _{i_chose} and θ _{i_last} are the same atom, if θ _{i_last}≠θ_{i_chose}, continues the following steps to perform parameter estimation, if θ _{i_last}＝θ_{i_chose}, gives up the atom with the largest current similarity, and selects the atom with the largest similarity after removing θ _{i_last} as θ _{i_chose} of the present round.

D) And taking the rough estimation result theta _{i_chose} as an initial point, carrying out fine estimation on the ith ASC parameter to obtain a fine estimation result theta _i,opt, and putting the fine estimation result theta _i,opt into the selected atom set phi _Gen.

Specifically, since the values of most of the attribute scattering center parameters are continuous, the parameters obtained in step b) are inaccurate and require further refined estimation. Therefore, the following equation is solved by newton method using the rough estimation result θ _{i_chose} of the current ith ASC parameter obtained in step b) as an initial point:

where θ/A represents the remaining set of parameters except for the removal parameter A.

Note Φ _Gen is the collection of the attribute scattering center images corresponding to the parameter θ _i,opt, and initializeThe method comprises the following steps:

e) The input image S (x, y) is approximated by atoms in Φ _Gen and the sparse coefficients are solved by the least squares method.

Specifically, using least squares estimation, the input S (x, y) is approximated by an atom in Φ _Gen, as shown in the following equation:

Wherein the method comprises the steps of Representing the set of coefficients corresponding to each atom,Represents the optimal coefficients for approximating the input S (x, y) using Φ _Gen.

F) And updating the residual image according to the sparse coefficient.

Specifically, the update formula is:

g) Repeating the operations from step b) to step f) until the current residual image R (x, y) can no longer extract valid attribute scattering centers, and exiting the loop.

So far, the extraction of the attribute scattering center of the observed target is finished by using the improved algorithm.

Preferably, in the present embodiment, 25 attribute scattering centers are extracted for each SAR complex image using the above method.

The method is improved and optimized for the defects of a recently proposed image domain-based quick ASC extraction algorithm, and the artificial zeroing operation is removed in the process of extracting the ASC of the target by changing the processing mode of the residual image and the selection mode of initial dictionary atoms when the sparse coefficient is solved, so that the ASC parameter estimation result of the target is more accurate and the reconstruction error of the target is smaller.

And 2, performing image global and local reconstruction on the attribute scattering center, and obtaining global and local reconstruction images with different scales in a downsampling mode.

21 And (3) bringing the attribute scattering center parameters of the observed target obtained in the step (1) into a defined expression of an attribute scattering center model to obtain integral back scattering echo data of the target and back scattering echo data of an independent attribute scattering center.

Specifically, according to the definition of the attribute scattering center model, that is, the backward scattering echo of an object in the high frequency region can be regarded as the superposition of many independent scattering point echoes, the specific form is as follows:

Wherein, the Indicating that the signal is a frequency domain signal; representing the set of all parameters, and respectively representing the backscattering coefficient, the frequency dependent factor, the length, the inclination angle, the azimuth dependent factor, the distance-oriented coordinate and the azimuth-oriented coordinate from left to right; Represents the back scattering echo signal of the observed target, Q represents the total number of attribute scattering centers constituting the current complex target, and theta _q represents the parameter set of the Q-th attribute scattering center.

Wherein, the Echo signals representing the qth individual attribute scattering center; representing additive white Gaussian noise, where The specific representation of (2) is as follows:

Wherein, the Sinc (·) =sin (·)/(·), f ₀ represents the center frequency of radar operation, c represents the speed of light, a _q represents the backscattering coefficient, a _q represents the frequency dependent factor, x _q and y _q represent the position coordinates of the range and azimuth directions respectively, L _q,The three parameters gamma _q describe the length, tilt angle and azimuth dependence factor of the attribute scattering center, respectively.

The back scattering echo of the single scattering point can be obtained by respectively introducing the estimation result of the scattering center parameters of the observation target into the step (13)The target back-scattered echo reconstructed by the extracted attribute scattering center result can be obtained by recombination (12)Then, a linear imaging operator beta {. Is applied to the frequency domain echo, and a reconstructed image S (x, y) can be obtained.

22 Respectively applying linear imaging operators to the whole back scattering echo data of the target and the back scattering echo data of the independent attribute scattering center to correspondingly obtain a global reconstruction map and a local reconstruction map of a plurality of independent scattering points;

The global reconstruction size and the global reconstruction size are both 128×128 identical to the original SAR complex image size, and the number of the obtained local reconstruction is 25 because of 25 attribute scattering centers extracted in the embodiment.

23 A global reconstruction having a size of 128×128 is downsampled to obtain a global reconstruction S _{recon_all_64} having a size of 64×64 and a global reconstruction S _{recon_all_32} having a size of 32×32, respectively, and a local reconstruction having a size of 128×128 is downsampled to obtain a local reconstruction S _{recon_single_64} having a size of 64×64.

It can be understood that, in this embodiment, corresponding thresholds are further set for three images obtained after the downsampling operation, so as to obtain three types of binarized graphs, which specifically includes the following steps:

For the global reconstruction S _{recon_all_64}, a threshold t=0.01 is set, all pixel values on S _{recon_all_64} are compared with the threshold t, the point where the pixel value is greater than the threshold t is set to 255, and the point where the pixel value is less than the threshold t is set to 0, resulting in a binary image B _{recon_all_64}.

For 25 partial reconstruction images S _{recon_single_64}, a threshold t=0.01 is set, and two-value processing is performed on 25 partial reconstruction images S _{recon_single_64}, respectively. Comparing all pixel values on the S _{recon_single_64} with a threshold t, wherein the point with the pixel value larger than the threshold t is set to 255, and the point with the pixel value smaller than the threshold t is set to 0. 25 binary images B _{recon_single_64} were obtained.

For the global reconstruction S _{recon_all_32}, a threshold t=0.01 is set, all pixel values on S _{recon_all_32} are compared with the threshold t, the point where the pixel value is greater than the threshold t is set to 255, and the point where the pixel value is less than the threshold t is set to 0, resulting in a binary image B _{recon_all_32}.

The feature extraction module is used for carrying out multi-scale feature extraction on the amplitude image corresponding to the original complex SAR image to obtain a multi-scale depth feature map;

the feature fusion module is used for carrying out feature fusion on different layers on the extracted multi-scale depth feature map and the binary image corresponding to the reconstruction image.

First, a feature extraction module is constructed.

In the embodiment, the constructed feature extraction module comprises 12 layers of convolution layers and 3 layers of maximum pooling layers, and the structure of the feature extraction module sequentially comprises a first convolution layer L _C1, a second convolution layer L _C2, a third maximum pooling layer L _p3, a fourth convolution layer L _C4, a fifth convolution layer L _C5, a sixth maximum pooling layer L _p6, a seventh convolution layer L _C7, an eighth convolution layer L _C8, a ninth convolution layer L _C9, a tenth maximum pooling layer L _p10, an eleventh convolution layer L _C11, a twelfth convolution layer L _C12, a thirteenth convolution layer L _C13, a fourteenth convolution layer L _C14 and a fifteenth convolution layer L _C15, wherein the network structure of the front 13 layers is the same as the structure of the front 13 layers of the VGG16Net network.

The depth global feature output by the fifth convolution layer L _C5 is used as a first-scale depth feature to be fused extracted by the feature extraction module;

the depth global feature output by the ninth convolution layer L _C9 is used as a second-scale depth feature to be fused extracted by the feature extraction module;

The depth global feature output by the fifteenth convolution layer L _C15 is used as a third-scale depth feature to be fused extracted by the feature extraction module.

Specifically, the parameters of each layer are set as follows: the number of convolution kernels of the 12 convolution layers is set to 64, 128, 256, 512, 256, 32, the convolution kernel sizes are all set to 3 multiplied by 3, the convolution kernel step sizes are all set to 1, and the ReLU activation functions are used as the activation functions; the core sizes of the 3 layers of the maximum pooling layers are all 2×2, and the step sizes are all set to 2.

Because the front 13-layer network structure of the feature extraction module constructed in this embodiment is identical to the VGG16Net network structure, when the network is trained subsequently, the parameter result trained on the ImageNet dataset can be used as the initial parameter of the front 13-layer of the feature extraction network in this embodiment.

Then, a feature fusion module is constructed.

In the embodiment, the feature fusion module comprises a local feature image layer fusion unit and a whole feature image layer fusion unit, wherein,

The local feature image layer fusion unit fuses the first-scale depth feature to be fused and the second-scale depth feature to be fused extracted by the feature extraction module with binary images corresponding to three different reconstruction images for multiple times, and three fusion feature images are correspondingly obtained;

The integrated feature map layer fusion unit is used for fusing the three fusion feature maps and the third-scale depth feature to be fused extracted by the feature extraction module to obtain a final fusion feature.

Specifically, as shown in fig. 4, the local feature fusion is performed three times in this embodiment, namely, the depth global feature of the 5 th layer is fused with 25 binary images B _{recon_single_64} reconstructed by a single scattering point, the depth global feature of the 5 th layer is fused with the binary image B _{recon_all_64} reconstructed integrally, and the depth global feature of the 9 th layer is fused with the binary image B _{recon_all_32} reconstructed integrally. The specific operation is as follows:

step1 first fusion:

The binary image B _{recon_single_64} reconstructed by the single scattering point has the size of 64×64×25, the binary image B _{recon_single_64} is multiplied by the global feature of depth of the depth feature extraction module 5 layer with the size of 64×64×128 along the channel dimension to obtain a fusion feature of 64×64×25×128, and global average pooling (Global Average Pooling, GAP) operation is carried out on the fusion feature to obtain a feature of 25×128 dimension.

In order to compress the 25 component features into one vector, statistical functions may be used at corresponding positions of the 25 vectors, and in consideration of representativeness and calculation amount, the embodiment uses max (·) and mean (·) statistical functions to complete the vector compression process, which is specifically shown in the formula (14):

C(·)=max(·)+mean(·) (14)

Wherein, C (-) represents the fusion mode used, namely, max (-) and mean (-) are respectively used at the corresponding positions of 25 local component feature vectors, and then the results are added to finally obtain a feature of 1X 128 dimension, namely, a first fusion feature.

Step2 second fusion:

the overall reconstruction binary image B _{recon_all_64} with the size of 64×64 is multiplied by the global feature with the size of 64×64×128 of the depth feature extraction module 5 layer along the channel dimension to obtain a 64×64×128 fusion feature, and then GAP operation is performed on the fusion feature to obtain a1×128-dimensional feature, namely a second fusion feature.

Step3 third fusion:

the overall reconstruction binary image B _{recon_all_32} with the size of 32×32 is multiplied by the global feature with the size of 32×32×256 of the 9 th layer of the depth feature extraction module along the channel dimension to obtain a 32×32×256 fusion feature, and then the GAP operation is performed on the fusion feature to obtain a1×256-dimensional feature, namely a third fusion feature.

For the overall feature map layer, four features are fused together in this embodiment, that is, the depth feature extraction module outputs the depth feature with the final output size of 16×16×32, and obtains the global network feature with the size of 1×32 (that is, the depth feature to be fused with the third dimension) after GAP operation, the first fusion feature with 1×128 dimensions is generated in the local feature fusion, the second fusion feature with 1×128 dimensions is generated in the local feature fusion, and the third fusion feature with 1×256 dimensions is generated in the third fusion in the local feature fusion. These four features are stitched along a first dimension to yield an overall fused feature of size 1 x 544.

And finally, constructing a fully-connected network to classify the final fusion characteristics to obtain a target classification result.

The fully-connected network FC comprises two fully-connected layers, an activation layer, a Dropout layer and a classifier layer, and has the structure that a first fully-connected layer L _F1, a second activation layer L _F2, a third Dropout layer L _d3, a fourth fully-connected layer L _F4 and a fifth classification layer L _F5 are respectively arranged, the input of the network is a feature vector of 544-dimension after fusion, and the output of the network is a category prediction vector of 3-dimension

The parameters of each layer are set as follows, the dimensions of the two fully connected layers are 544 multiplied by 512 and 512 multiplied by 3, the active layer uses a ReLU activation function, the drop probability of the Dropout layer is 0.5, and the classifier layer uses a softmax classifier.

The above parts are combined together in the order shown in fig. 4 to obtain the deep neural network ψ.

It can be appreciated that after the deep neural network is built, the network needs to be trained, and then the trained network is used for target recognition.

In this embodiment, the deep neural network is trained in the following manner:

Extracting attribute scattering centers from the actual measured SAR complex images with the labels, performing global and local reconstruction, and obtaining global and local reconstruction images with different scales in a downsampling mode;

Inputting the labeled SAR complex images and the binary images corresponding to the global and local reconstruction images with different scales into a constructed deep neural network for forward propagation as shown in fig. 4;

And calculating the classification loss, and updating network parameters in a back propagation mode to obtain a trained network. Wherein the classification loss uses a cross entropy loss function, as shown below,

Wherein n represents the number of training samples, y _i represents the category label of the i-th input image in the form of one-time thermal coding,Representing the corresponding prediction category label.

According to the method, the physical properties of the targets reflected by the ASC model are fully utilized, the accurately estimated ASC parameter set is used for carrying out multi-type reconstruction on the observed targets in the image domain, the ASC parameter set is fused with depth features of different scales, more information is provided for a network, and therefore target identification performance is improved.

And 4, inputting the original complex SAR image of the test data and the binary image corresponding to the reconstructed image into a trained deep neural network for processing, and outputting a target recognition result.

The invention uses the parameter migration mode in the deep network training process, and uses the front 13 layers of parameter results of VGG16Net which are fully trained on the ImageNet data set as the initial parameters of the deep network, so that the network can obtain better recognition performance through less times of training, and can also improve training efficiency.

Example two

The method provided by the invention is simulated by taking a specific scene as an example, so as to verify the effectiveness of the attribute scattering center extraction algorithm in the invention.

Specifically, in this experiment, the T72 original SAR image shown in fig. 5 is taken as an extraction object, and the method of the present invention is used to extract the attribute scattering center and reconstruct the image, and the results are shown in fig. 6 and 7.

Fig. 6 is a result of performing attribute scattering center extraction and image reconstruction on a T72 original SAR image by using the algorithm of the present invention, wherein (a) is a global reconstruction of 128 pixels in size reconstructed using the extracted 25 scattering center points, (b) is a global reconstruction of 64 pixels in size obtained by downsampling, (c) is a local reconstruction of 64 pixels in size obtained by downsampling, and (d) is a global reconstruction of 32 pixels in size obtained by downsampling.

Fig. 7 is a binarized image corresponding to the reconstruction result in fig. 6. Wherein, (a) the image is a global reconstruction binarized image of 64 pixels in size obtained through a downsampling operation, (b) the image is a local reconstruction binarized image of 64 pixels in size obtained through a downsampling operation, and (c) the image is a global reconstruction binarized image of 32 pixels in size obtained through a downsampling operation.

In order to further verify the SAR target recognition method based on fusion of ASC features and multi-scale depth features, the embodiment also recognizes the SAR target recognition method on the disclosed moving and static target MSTAR data sets.

The MSTAR dataset used for the experiment was a complex image with a resolution of 0.3m x 0.3m, each 128 x 128 pixels in size. The data set used in this experiment identifies scenes for three categories of MSTAR. The three categories of target data are T72, BMP2, BTR70, respectively. Wherein BMP2 and T72 each contain three different sequence numbers, each class of training data contains only one sequence number of each type, and the test data contains all sequence numbers of each type. The experimental data are specifically set forth in table 1.

TABLE 1 MSTAR class 3 target recognition scenario

The following table 2 shows the identification results of the method of the present invention on the MSTAR three categories of identification data shown in the above table 1, and combines the SAR ATR method of the attribute scattering center and the convolutional neural network with the existing identification method (ACNNC for short, the multiscale SAR ATR convolutional neural network based on component analysis from article A Convolutional Neural Network Combined with Attributed Scattering Centers for SAR ATR,IEEE Transactions on Geoscience and Remote Sensing,Zhou Y,2021)、 for short, CA-MCNN for short, and article Multiscale CNN Based on Component Analysis for SAR ATR,IEEE Transactions on Geoscience and Remote Sensing,Li Y,2021) for comparison.

TABLE 2 detailed recognition results of different recognition methods on MSTAR 3 class target data

Application method	Accuracy of identification
		The method provided by the invention	0.9890
ACNNC	0.9795
		CA-MVCC	0.9861

Because the SAR image has the problem of insufficient training data, the problem of small samples is very prominent in SAR image recognition, in order to further verify the effectiveness of the invention, small sample experiments are carried out on three categories of MSTAR target data shown in table 1, small sample experimental conditions are simulated by randomly selecting a certain proportion of training samples, and the average result of 10 experiments is selected as a recognition result. And compared with the existing SAR target recognition method based on limited training data of a network generated by angular rotation by a small sample recognition method (ARGN for short, the SAR ATR method based on an improved polar coordinate mapping classifier from article SAR Target Recognition With Limited Training Data Based on Angular Rotation Generative Network,IEEE Geoscience and Remote Sensing Letters,Sun Y,2019)、 (M-PMC for short, the SAR target recognition method based on a data enhanced convolutional neural network from article Modified Polar Mapping Classifier for SAR Automatic Target Recognition,IEEE Transactions on Aerospace and Electronic Systems,Park J,2014)、 (DA-CNN for short, the SAR image classification method based on a deep convolutional network from article Convolutional Neural Network with Data Augmentation for SAR Target Recognition,IEEE Geoscience and Remote Sensing Letters,Ding J,2016)、 (A-ConvNet for short, the recognition results of the method and the method under the small sample environment from article Target Classification using the Deep Convolutional Networks for SAR images,IEEE Transactions on Geoscience and Remote Sensing,Chen S,2016). are shown in the table 3).

TABLE 3 comparison of the identification performance of the inventive method with some prior methods in a small sample environment

The results in table 3 above were all trained by randomly selecting training samples in the corresponding proportions, with the sample ratios representing the ratio of the number of randomly selected samples to the number of all training samples. 10 experiments are carried out on the duty ratio value of each sample, and the average value of the 10 experiments is taken as the recognition result of the final sample duty ratio. It can be seen that when only less than half of training samples are used, the recognition accuracy of the method is superior to that of other comparison methods, which effectively proves the effectiveness of the model in the case of insufficient training data, and when the sample ratio is 0.1, namely each training data has only 22 samples, the average recognition accuracy of 1365 test samples reaches 89.46 percent.

The experimental results can show that in the identification experiments of three types of MSTAR target data, the method obtains better identification results under all training sample experimental conditions and less than 50% of small sample experimental conditions. The validity of the method is verified, and the method can better utilize global information and local information to obtain effective and stable target feature representation, so that the method has certain validity and feasibility.

The foregoing is a further detailed description of the invention in connection with the preferred embodiments, and it is not intended that the invention be limited to the specific embodiments described. It will be apparent to those skilled in the art that several simple deductions or substitutions may be made without departing from the spirit of the invention, and these should be considered to be within the scope of the invention.

Claims

1. A SAR target recognition method based on the fusion of ASC features and multi-scale depth features, characterized by comprising:

Step 1: Obtain the original SAR complex image of the observed target and extract the attribute scattering center corresponding to each SAR complex image based on the improved image domain sparse representation ASC parameter estimation algorithm;

11) Target backscatter echo data in the original SAR complex image data and the echo data of the qth attribute scattering center Apply the same linear imaging operator and estimate its attribute scattering center parameters; where, Respectively represent discrete f represents the operating frequency of the radar, represents the synthetic aperture range, _θq represents the parameter set of the qth scattering center;

12) Use NOMP algorithm to solve the attribute scattering center parameters of S(x,y), and add parameter fine correction process in the solution process to obtain several attribute scattering centers; where S(x,y) represents The corresponding image domain representation, x, y respectively represent the pixel coordinates after conversion to the image domain;

a) Establish the initial dictionary Φ; at the same time, initialize the residual image R(x,y) and set R(x,y) = S(x,y);

b) performing atom selection on the initial dictionary Φ, and selecting the atom θ _{i_chose} having the greatest similarity with the current residual image R(x, y) as the rough estimation result of the current i-th ASC parameter;

c) determining whether the position parameters of the currently selected atom θ _{i_chose} are the same as those of the last selected atom. If they are the same, discard the currently selected atom θ _{i_chose} and reselect the atom with the greatest similarity after removing θ _{i_chose} as the rough estimation result of this round; otherwise, proceed to step d);

d) using the rough estimation result as an initial point, performing a fine estimation on the i-th ASC parameter, obtaining a fine estimation result and adding it to the selected atom set Φ _Gen ;

e) using all atoms in the selected atom set Φ _Gen to approximate the input image S (x, y), and solving the sparse coefficients using the least squares method;

f) updating the residual image according to the sparse coefficients;

g) Repeat steps b) to f) until no valid attribute scattering centers can be extracted from the current residual image R(x, y), and then exit the loop;

Step 2: reconstruct the image globally and locally for the attribute scattering center, and obtain global and local reconstruction images of different scales by downsampling;

Step 3: Construct a deep neural network including a feature extraction module and a feature fusion module;

The feature extraction module is used to perform multi-scale feature extraction on the amplitude image corresponding to the original SAR complex image to obtain a multi-scale depth feature map;

The feature fusion module is used to perform feature fusion at different levels on the extracted multi-scale depth feature map and the binarized image corresponding to the reconstructed image;

Step 4: Input the original SAR complex image and the binarized image corresponding to the reconstructed image into the trained deep neural network for processing, and output the target recognition result.

2. The SAR target recognition method based on the fusion of ASC features and multi-scale depth features according to claim 1, characterized in that in step 11), the linear imaging operator is expressed as:

The attribute scattering center parameter is expressed as:

Where K and H represent the number of discrete points in frequency and azimuth, respectively; _f0 represents the center frequency; c represents the speed of light; Q represents the total number of attribute scattering centers; _σq represents the sparsity coefficient; ε represents the error coefficient, and ε>0;D(x,y; _θq ) represents The corresponding image domain representation.

3. The SAR target recognition method based on the fusion of ASC features and multi-scale depth features according to claim 1 is characterized in that, in step 1, 25 attribute scattering centers are extracted for each SAR complex image.

4. The SAR target recognition method based on the fusion of ASC features and multi-scale depth features according to claim 1, wherein step 2 comprises:

21) Substituting the attribute scattering center parameters of the observed target obtained in step 1 into the definition expression of the attribute scattering center model to obtain the backscattered echo data of the entire target and the backscattered echo data of the individual attribute scattering centers;

22) applying the same linear imaging operator to the backscatter echo data of the entire target and the backscatter echo data of the individual attribute scattering centers, respectively, to obtain a global reconstruction image and a plurality of local reconstruction images of the individual scattering points;

The size of the global reconstructed image and the size of the local reconstructed image are consistent with the size of the original SAR complex image, both of which are 128×128;

23) Down-sampling the global reconstructed image to obtain a global reconstructed image S _{recon_all_64} of size 64×64 and a global reconstructed image S _{recon_all_32} of size 32×32, and down-sampling the local reconstructed image to obtain a local reconstructed image S _{recon_single_64} of size 64×64.

5. The SAR target recognition method based on the fusion of ASC features and multi-scale deep features according to claim 4, characterized in that in step 3, the constructed feature extraction module includes 12 convolutional layers and 3 maximum pooling layers, and its structure is as follows: first convolutional layer L _C1 , second convolutional layer L _C2 , third maximum pooling layer L _p3 , fourth convolutional layer L _C4 , fifth convolutional layer L _C5 , sixth maximum pooling layer L _p6 , seventh convolutional layer L _C7 , eighth convolutional layer L _C8 , ninth convolutional layer L _C9 , tenth maximum pooling layer L _p10 , eleventh convolutional layer L _C11 , twelfth convolutional layer L _C12 , thirteenth convolutional layer L _C13 , fourteenth convolutional layer L _C14 , and fifteenth convolutional layer L _C15 ; wherein the network structure of the first 13 layers is the same as the first 13 layers of VGG16Net;

Among them, the deep global features output by the fifth convolutional layer _LC5 are used as the first-scale deep features to be fused extracted by the feature extraction module;

The deep global features output by the ninth convolutional layer _LC9 are used as the second-scale deep features to be fused by the feature extraction module;

The deep global features output by the fifteenth convolutional layer _LC15 are used as the third-scale deep features to be fused by the feature extraction module.

6. The SAR target recognition method based on the fusion of ASC features and multi-scale depth features according to claim 5 is characterized in that in step 3, the constructed feature fusion module includes a local feature map level fusion unit and an overall feature map level fusion unit; wherein,

The local feature map layer fusion unit fuses the first-scale depth features to be fused and the second-scale depth features to be fused extracted by the feature extraction module with the binarized images corresponding to the three different reconstructed images multiple times to obtain three fused feature maps;

The overall feature map level fusion unit is used to fuse the three fused feature maps with the third-scale depth feature to be fused extracted by the feature extraction module to obtain the final fused feature.

7. The SAR target recognition method based on the fusion of ASC features and multi-scale depth features according to claim 6 is characterized in that the local feature map layer fusion unit fuses the first-scale depth features to be fused and the second-scale depth features to be fused extracted by the feature extraction module with the binarized images corresponding to different reconstructed images three times to obtain three fused feature maps, including:

Multiplying the binary image corresponding to the local reconstructed image S _{recon_single_64} of the single scattering point by the first-scale depth feature to be fused along the channel dimension, and then performing a GAP operation on the image to obtain a first fused feature map;

Multiplying the binarized image corresponding to the global reconstruction map S _{recon_all_64} by the first-scale depth feature to be fused along the channel dimension, and then performing a GAP operation on the multiplied image to obtain a second fused feature map;

The binarized image corresponding to the global reconstruction map S _{recon_all_32} is multiplied by the second-scale depth feature to be fused along the channel dimension, and then a GAP operation is performed on the image to obtain a third fused feature map.

8. The SAR target recognition method based on the fusion of ASC features and multi-scale depth features according to claim 7, characterized in that in step 3, it further comprises:

Constructing a fully connected network to classify the final fusion features to obtain a target classification result;

The fully connected network includes two fully connected layers, one activation layer, one Dropout layer and one classifier layer, and its structure is as follows: first fully connected layer L _F1 , second activation layer L _F2 , third Dropout layer L _d3 , fourth fully connected layer L _F4 , fifth classification layer L _F5 ;

The input of the fully connected network is the feature vector corresponding to the final fused image output by the feature fusion module, and the output is a 3D category prediction vector.

9. The SAR target recognition method based on the fusion of ASC features and multi-scale depth features according to claim 8, wherein the deep neural network is trained in the following manner:

Extract attribute scattering centers from labeled measured SAR complex images and reconstruct them to obtain global and local reconstruction images at different scales.

Input the labeled SAR complex image and the binarized images corresponding to the global and local reconstruction images at different scales into the constructed deep neural network for forward propagation; at the same time, use the parameter results trained on the ImageNet dataset as the initial parameters of the first 13 layers in the feature extraction module;

Calculate the classification loss and update the network parameters through back propagation to obtain a trained network.