[go: up one dir, main page]

Academia.eduAcademia.edu
HYPERSPECTRAL AND MULTISPECTRAL DATA FUSION BASED ON NONLINEAR UNMIXING Naoto Yokoya,1 Jocelyn Chanussot,2 and Akira Iwasaki3 1 Department of Aeronautics and Astronautics, The University of Tokyo, Japan GIPSA-lab, Signal and Image Dept., Grenoble Institute of Technology, Grenoble, France 3 Research Center for Advanced Science and Technology, The University of Tokyo, Japan 2 ABSTRACT Data fusion of low spatial-resolution hyperspectral (HS) and high spatial-resolution multispectral (MS) images based on a linear mixing model (LMM) enables the production of high spatial-resolution HS data with small spectral distortion. This paper extends the LMM based HS-MS data fusion to nonlinear mixing model using a bilinear mixing model (BMM), which considers second scattering of photons between two distinct materials. A generalized bilinear model (GBM) is able to deal with the underlying assumptions in the BMM. The GBM is applied to HS-MS data fusion to produce high-quality fused data regarding multiple scattering effect. Semi-nonnegative matrix factorization (Semi-NMF), which can be easily incorporated with the existing LMM based fusion method, is introduced as a new optimization method for the GBM unmixing. Comparing with the LMM based HSMS data fusion, the proposed method showed better results on synthetic datasets. Index Terms— Data fusion, nonlinear unmixing, bilinear mixing model, semi-nonnegative matrix factorization 1. INTRODUCTION Hyperspectral (HS) imaging sensors generally have larger ground sampling distance (GSD) than multispectral (MS) imaging sensors. Data fusion of low spatial-resolution HS and high spatial-resolution MS images enables the production of high spatial-resolution HS data with small spectral distortion [1], [2]. The fused data is useful for accurate classification with fine spatial resolution and thereby enhance applications of HS remote sensing. Several HS-MS fusion methods are proposed using pan-sharpening techniques, stochastic method, and unmixing [1], [2]. A maximum a posteriori (MAP) estimation method was developed to enhance the spatial resolution of HS data using higher spatial-resolution data such as MS and panchromatic images [1]. This approach used a stochastic mixing model (SMM), which estimates the underlying spectral scene characteristics, in order to develop a cost function that optimizes the estimated HS data relative to the observed HS and MS data. An unmixing based fusion method, namely the ’coupled nonnegative matrix factorization’ (CNMF), was recently proposed to enhance spatial resolution of all HS bands. It is based on a linear mixing model (LMM) [2]. This algorithm extracts endmember spectra and high spatial-resolution abundance maps from two images by alternate nonnegative matrix factorization (NMF) [3]. Since CNMF can increase the number of endmembers compared with the MAP/SMM, it has a possibility to deal with spectrally more varied scenes. Many researchers have worked on spectral unmixing with the LMM that assumes that an observed spectrum is a linear combination of several endmember spectra. Although LMM based unmixing methods can obtain physically meaningful results, nonlinearity can appear in spectral mixing model [4], [5]. In recent years, nonlinear unmixing for HS images is receiving growing attention in remote sensing image interpretation. A bilinear mixing model (BMM), which considers second scattering of photons between two distinct materials, has been studied by several groups [6], [7]. A generalized bilinear model (GBM) introduces an effective mean to deal with the underlying assumptions in the BMM [7], [8]. In this work, the authors apply the GBM unmixing to HS-MS data fusion to improve the quality of the fused data introducing semi-nonnegative matrix factorization (Semi-NMF) [9] as a new optimization for the GBM unmixing. 2. GBM VIA SEMI-NMF The BMM considers second-order interactions between different D endmembers as additional terms in the LMM assuming that third or higher order interactions are negligible. In the BMM, the observed L-spectrum of a single pixel z ∈ RL×1 is given by: z = Ea + D−1 ∑ D ∑ i=1 j=i+1 bi,j ei ⊙ ej + n, (1) where E ∈ RL×D is the endmember matrix with the i th column vector, ei ∈ RL×1 representing the i th endmember spectrum, a ∈ RD×1 is the abundance vector, bi,j is the interaction abundance between the i th and j th endmembers, ⊙ is the Hadamard (element-wise product) operation, and n ∈ RL×1 is the additive noise. On the right side, the first term denotes the linear mixing and the second term represents the bilinear mixing. From a physical perspective, the GBM introduces the nonlinear mixing coefficient ci,j as bi,j = ci,j ai aj and assumes the following constraints: ai ≥ 0 ∀i ∈ 1, ..., D and D ∑ ai = 1, i=1 (2) 0 ≤ ci,j ≤ 1 ∀i ∈ 1, ..., D − 1 ∀j ∈ i + 1, ..., D. When the endmembers are known, the GBM unmixing turns to the optimization of ai and ci,j under the constraint of (2). Several optimization methods are proposed in [8]. The GBM method was applied to small images of synthetic and real HS data with three endmembers [7], [8]. When applied to larger images in an unsupervised manner with many endmembers, the optimization process becomes more challenging. The new optimization method based on semi-nonnegative matrix factorization (Semi-NMF) [9] is introduced to speed up the optimization process of a whole image in a matrix form, which can be easily incorporated with CNMF. The observed HS image can be reshaped as a matrix form Z ∈ RL×P with P representing the number of pixels. The BMM for the whole image is given in a matrix form by Z = EA + MB + N, (3) where A ∈ RD×P is the abundance matrix, M ∈ RL×D(D−1)/2 is the bilinear endmember matrix, B ∈ RD(D−1)/2×P is the interaction abundance matrix, and N ∈ RL×P is the noise matrix. The GBM unmixing becomes the ∑ minimization of D ∥Z − EA − MB∥2F , subject to A ⪰ 0 , i=1 Ai,k = 1 ∗ ∗ (∀k ∈ 1, ..., P ), 0 ⪯ B ⪯ A , where A(i,j),k = Ai,k Aj,k . By introducing Z1 = Z − MB and Z2 = Z − EA, (3) is written as follows Z1 = EA + N and Z2 = MB + N. (4) Owing to physical constraints, all components of E, M, A, and B are nonnegative. Therefore, minimization of ∥N∥2F in (4) can be solved by Semi-NMF that factorizes a nonrestricted matrix X into a non-restricted matrix F and a nonnegative matrix GT as X ≈ FGT [9]. Semi-NMF optimization is guaranteed to converge to a local optimum with the alternative update rules. With E given and M calculated from E, the GBM unmixing is solved by the following update rules for A and B: √ T T T T T + − − + (5) AT ←AT .∗ ((ZT 1 E) +A (E E) )./((Z1 E) +A (E E) ) √ T T T T T + − − + (6) BT ←BT .∗ ((ZT 2 M) +B (M M) )./((Z2 M) +B (M M) ) where .∗ and ./ denote elementwise multiplication and division. (C)+ and (C)− are the positive and negative parts of a matrix C defined as C+ = (|C| + C)/2, C− = (|C| − C)/2. The GBM unmixing via Semi-NMF is as follows. First, A is initialized by the fully constrained least square (FCLS) method [10], A∗ is calculated, and B is set as δ × A∗ with small value of δ. Next, A and B are alternately updated by (5) and (6). If any element of B exceeds that of A∗ , it is replaced by that of A∗ . To satisfy the abundance sum-to-one constraint, the method from [10] is adopted. 3. HS-MS DATA FUSION BASED ON GBM The aim of HS and MS data fusion is to estimate unobservable high spatial-resolution HS data (Z ∈ RL×P ) from observed low spatial-resolution HS data (X ∈ RL×Ph ) and high spatial-resolution MS data (Y ∈ RLm ×P ). Lm denotes the number of spectral channels of MS sensor. Ph denotes the number of pixels of HS images. Owing to the trade-off between spectral and spatial resolutions of two sensors, L > Lm and Ph < P are satisfied. The observed two data are assumed to be obtained under the same atmospheric and illumination conditions, and geometrically co-registered with radiometric correction. The HS and MS images can be considered as the degraded versions from the high spatial-resolution HS image in spatial and spectral domains, respectively. Therefore, X and Y are modeled as X = ZS + Ns , (7) Y = RZ + Nr . (8) Here, S ∈ RP ×Ph is the spatial spread transform matrix P ×1 h with each column vector {sl }P representing the l=1 ∈ R transform of the point spread function (PSF) from the MS image to the HS l th∑ pixel value. Each PSF is assumed to Lm Lm ×L be normalized, i.e., is the k=1 skl = 1. R ∈ R spectral response transform matrix with each row vector 1×L m {ri }L representing the transform of the spectral i=1 ∈ R response function (SRF) from the HS sensor to the MS i th band detector. Ns and Nr are the residuals. 3.1. The CNMF algorithm First, we summarize the CNMF method that was proposed for HS-MS data fusion based on the LMM [2]. The CNMF method is composed of alternate NMF unmixing for HS and MS images to extract high spectral-resolution endmember spectra and high spatial-resolution abundance maps. In the LMM, the spectrum at each pixel is assumed to be a linear combination of several endmember spectra, which is the simple version without the bilinear term in (3). Therefore, Z is formulated as Z ≈ EA. The spatially degraded abundance matrix Ah ∈ RD×Ph and the spectrally degraded endmember matrix Em ∈ RLm ×D are defined as Ah = AS and Em = RE, respectively. By substituting Z ≈ EA into (7) and (8), X and Y are approximated as X ≈ EAh and Y ≈ Em A, respectively. GBM for HS data (X) L X ≈ GBM of MS data (Y) P Lm Y D(D-1)/2 D Ph E R ≈ Em Ah M + Bilinear Interpolation A R + Mm Bh Bilinear Interpolation B Fig. 1. Illustration of HS-MS data fusion based on GBM. Owing to the non-negative characteristics of S, R, E, and A, all components of Ah and Em are also non-negative. Z can be estimated by unmixing X and Y to obtain E and A, using alternate NMF unmixing. The CNMF algorithm starts from the NMF unmixing of X to use its spectral advantage. E is initialized by the vertex component analysis (VCA) [11]. Next, X and Y are alternately unmixed by NMF using Ah = AS and Em = RE for the initialization of Ah and Em , respectively. Finally, the high spatial-resolution HS image is estimated by multiplication of E and A. More details concerning CNMF is given in [2]. 3.2. CNMF with GBM The GBM via Semi-NMF can be incorporated into the framework of the CNMF algorithm. Here, the authors define the spatially degraded interaction abundance matrix Bh ∈ RD×Ph and the spectrally degraded bilinear endmember matrix Mm ∈ RLm ×D as Bh = BS and Mm = RM, respectively. By substituting (3) into (7) and (8), X and Y can be approximated as X ≈ EAh + MBh , (9) Y ≈ Em A + Mm B. (10) First, E, Ah , Em , and A are initialized by CNMF. Next, Ah and Bh are estimated by the GBM unmixing of X via Semi-NMF. A and B are initialized by bilinear interpolation of Ah and Bh , respectively. Finally, A and B are optimized by the GBM unmixing of Y, and the fused image (Z) is obtained by Z ≈ EA + MB. The bilinear interpolation is empirically important for the initialization of A and B in the last GBM unmixing of MS image because the GBM unmixing via Semi-NMF converges to a local minimum. Fig. 1 shows the illustration of the proposed method (GBM-CNMF). 4. EXPERIMENTAL STUDY The proposed HS-MS data fusion technique is applied to synthetic datasets generated from the airborne visible infrared imaging spectrometer (AVIRIS) data. The image was taken over Cuprite, Nevada, in 1997 with 224 spectral bands in the 400-2500 nm region. The data initially measured as radiance was converted into reflectance. The 150x150 pixel-size image is selected with 189 bands eliminating noisy bands and the MS and low spatial-resolution HS datasets are generated by down-sampling the original HS data in the spectral and spatial domains, respectively. The spatial-resolution difference and SNR of two sensors are determined considering the specification of Hyperspectral Imager Suite (HISUI) [12], which is composed of HS and MS sensors and will be launched on the Japanese next generation earth observing satellite (Advanced Land Observing Satellite 3 (ALOS-3)) in 2015. The MS data is produced with an uniform SRF corresponding to Landsat TM bands 1-5 and 7. The low spatial-resolution HS data is generated by a Gaussian PSF with full width at half maximum, corresponding to six pixels in the original high spatialresolution HS image, which makes a six times difference in the spatial resolution between two sensors. Therefore, R and S are given as sparse matrices with each row vector corresponding to a uniform SRF and with each column vector representing a Gaussian PSF, respectively. In addition, Gaussian noise was added to the two data assuming that the signal-tonoise ratios (SNR) of HS and MS sensors are 300 and 200, respectively. The performance of HS and MS data fusion was evaluated by comparing the estimated high spatial-resolution HS data with the original data with two criteria: the spatial quality of each spectral band image and the spectral quality of each spectrum for a single pixel. To evaluate the spatial reconstruction quality, the peak signal-to-noise ratio (PSNR) is adopted. To evaluate the spectral reconstruction quality, the authors used the spectral angle error (SAE) between the estimated spectrum and the actual spectrum. Fig. 2 shows the low spatial-resolution HS image and the fused image obtained by GBM-CNMF. The proposed method is evaluated by comparing with CNMF. Fig. 3 shows the average values of PSNR and SAE, and computational time with a varying number of endmembers (D). The performances of two methods converge around D=20 and GBM-CNMF shows better results in the two criteria with all numbers of endmembers. This result proves that the nonlinear mixing effect of multiple reflection is contained in the original HS image and that the GBM unmixing via Semi-NMF works well for both HS and MS images reducing unmixing residual errors. Although there is an obvious quality improvement of GBMCNMF from CNMF, computational time becomes large. For practical use, regarding the computational time, the data fusion process should be processed with partial images using parallel computing. Fig. 4 shows the PSNRs for all bands and the histogram of SAE with SAE maps with D=10. GBMCNMF improves the PSNRs of the fused data in near infrared and short wavelength infrared spectral ranges. It indicates that the GBM unmixing mainly deal with multiple scattering effects in these spectral ranges for this scene. The histogram of Multispectral bands (b) 10 (a) 55 Number of Pixels PSNR (dB) 50 45 40 35 30 25 Fig. 2. Low spatial-resolution HS image (left) and GBMCNMF fused image (right). (a) 44.5 (b) 44 1.05 SAE (deg) PSNR (dB) 42.5 42 41.5 41 GBM-CNMF CNMF 40.5 10 15 20 25 30 35 0.85 0.7 5 10 15 20 25 30 35 40 Number of endmembers 10 5 Computational Time (sec) CNMF 10 1 0 1000 1500 2000 Wavelength (nm) 2500 10 0 0 1 2 3 4 5 SAE (deg) 6. REFERENCES Number of endmembers (c) 500 0.9 0.8 40 10 2 GBM-CNMF CNMF 0.95 0.75 40 39.5 5 4 10 3 GBM-CNMF CNMF 1 43 GBM-CNMF GBM-CNMF CNMF Fig. 4. Comparisons of (a) PSNRs and (b) histogram of SAE with SAE distribution maps between GBM and LMM. 1.1 43.5 4 [1] M. T. Eismann and R. C. Hardie, “Hyperspectral resolution enhancement using high-resolution multispectral imagery with arbitrary response functions,” IEEE Trans. Geosci. Remote Sens., vol. 43, no. 3, pp. 455–465, Mar. 2005. [2] N. Yokoya, T. Yairi, and A. Iwasaki, “Coupled nonnegative matrix factorization unmixing for hyperspectral and multispectral data fusion,” IEEE Trans. Geosci. Remote Sens., vol. 50, no. 2, pp. 528–537, Feb. 2012. GBM-CNMF CNMF 10 4 10 3 [3] D. D. Lee and H. S. Seung, “Learning the parts of objects by nonnegative matrix factorization,” Nature, vol. 401, pp. 788-791, Oct. 1999. 10 2 10 1 5 10 15 20 25 30 35 40 Number of endmembers Fig. 3. Effect of the number of endmembers on (a) PSNR, (b) SAE, and (c) computational time. SAE and SAE maps show the spectral quality improvement of GBM-CNMF. The areas where there remain some amount of SAE have a possibility to contain the other nonlinear mixing effect. 5. CONCLUSION In this work, a new HS-MS data fusion method based on the GBM unmixing is proposed. A semi-NMF based optimization method is introduced for the GBM unmixing and incorporated into the HS-MS data fusion algorithm based on linear unmixing (CNMF) to consider the nonlinear mixing caused by multiple scattering. The experiments with synthetic HSMS datasets generated from AVIRIS image show that the proposed HS-MS data fusion method based on the nonlinear unmixing outperforms the CNMF method. The GBM unmixing works well for both HS and MS images with better unmixing residual errors and thereby improves the quality of the HS-MS fused data. Future works include further experiments for various kinds of scenes and the improvement of computational cost using parallel computing. [4] D.A. Roberts, M.O. Smith, and J.B. Adams, “Green vegetation, nonphotosynthetic vegetation, and soils in AVIRIS data,” Remote Sens. Environ., vol. 44, pp. 117–126, 1993. [5] L. Zhang, D. Li, Q. Tong, and L. Zheng, “Study of the spectral mixture model of soil and vegetation in PoYang Lake area, China,” Int. J. Remote Sens., vol. 19, no. 11, pp. 2077–2084, 1998. [6] W. Fan, B. Hu, J. Miller, and M. Li, “Comparative study between a new nonlinear model and common linear model for analysing laboratory simulated-forest hyperspectral data,” Int. J. Remote Sens., vol. 30, no. 11, pp. 2951–2962, Jun. 2009. [7] A. Halimi, Y. Altmann, N. Dobigeon, and J. Tourneret, “Nonlinear unmixing of hyperspectral images using a generalized bilinear model,” IEEE Trans. Geosci. Remote Sens., vol. 49, no. 11, pp. 4153–4162, Nov. 2011. [8] A. Halimi, Y. Altmann, N. Dobigeon, and J. Tourneret, “Unmixing hyperspectral images using the generalized bilinear model,” in Proc. IEEE IGARSS, Vancouver, Canada, 2011, pp. 519–522. [9] C. Ding, T. Li, and M. I. Jordan, “Convex and semi-nonnegative matrix factorization,” IEEE Trans. Patten Anal. Mach. Intell., vol. 32, no. 1, pp. 45–55, 2010. [10] D. C. Heinz and C.-I. Chang, “Fully constrained least squares linear spectral mixture analysis method for material quantification in hyperspectral imagery,” IEEE Trans. Geosci. Remote Sens., vol. 39, no. 3, pp. 529–545, Mar. 2001. [11] J. M. Nascimento and J. M. Bioucas-Dias, “Vertex component analysis: A fast algorithm to unmix hyperspectral data,” IEEE Trans. Geosci. Remote Sens., vol. 43, no. 4, pp. 898–910, Apr. 2005. [12] N. Ohgi, A. Iwasaki, T. Kawashima, and H. Inada, “Japanese hypermulti spectral mission,” in Proc. IEEE IGARSS, Hawaii, USA, 2010, pp. 3756-3759.