[go: up one dir, main page]

Next Article in Journal
Block Sparse Compressed Sensing of Electroencephalogram (EEG) Signals by Exploiting Linear and Non-Linear Dependencies
Previous Article in Journal
Vision-Based Pose Estimation for Robot-Mediated Hand Telerehabilitation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Improved STARFM with Help of an Unmixing-Based Method to Generate High Spatial and Temporal Resolution Remote Sensing Data in Complex Heterogeneous Regions

1
State Key Laboratory of Earth Surface Processes and Resource Ecology, Beijing Normal University, Beijing 100875, China
2
College of Resources Science and Technology, Beijing Normal University, Beijing 100875, China
*
Author to whom correspondence should be addressed.
Sensors 2016, 16(2), 207; https://doi.org/10.3390/s16020207
Submission received: 27 October 2015 / Revised: 31 January 2016 / Accepted: 2 February 2016 / Published: 5 February 2016
(This article belongs to the Section Remote Sensors)
Figure 1
<p>Flowchart of the USTARFM algorithm.</p> ">
Figure 2
<p>The location of study area.</p> ">
Figure 3
<p>Scatterplots of the real reflectance and the predicted product produced by the three algorithms for the green, red and NIR-infrared bands.</p> ">
Figure 4
<p>The distribution of different heterogeneity levels in the study area. (<b>a</b>) class types within a grid (MODIS pixel scale), (<b>b</b>) the heterogeneity levels of MODIS pixels.</p> ">
Figure 5
<p><span class="html-italic">γ</span> and RMSE of the three methods at different heterogeneity levels.</p> ">
Figure 6
<p>Scatterplots of three bands generated by the three methods in the region with the heterogeneity level of 1.</p> ">
Figure 7
<p>The comparison of real and fusion images produced by the three algorithms (NIR-red-green combination). (<b>a</b>) shows the reference Landsat 8 images observed on 4 September 2014; (<b>b</b>–<b>d</b>) are the prediction images by USTARFM, STARFM, and unmixing-based method, respectively; (<b>e</b>–<b>h</b>) indicate the enlarged subset images of (<b>a</b>–<b>d</b>), respectively.</p> ">
Figure 8
<p>Landsat images collected 26 February 2002 (1) and May 17 (2) and their enlarged sub-images (<b>a1</b>) and (<b>a2</b>); and the corresponding prediction sub-images by USTARFM (<b>b1,</b> <b>b2</b>), STARFM (<b>c1,</b> <b>c2</b>), and unmixing-based method (<b>d1,</b> <b>d2</b>).</p> ">
Versions Notes

Abstract

:
Remote sensing technology plays an important role in monitoring rapid changes of the Earth's surface. However, sensors that can simultaneously provide satellite images with both high temporal and spatial resolution haven’t been designed yet. This paper proposes an improved spatial and temporal adaptive reflectance fusion model (STARFM) with the help of an Unmixing-based method (USTARFM) to generate the high spatial and temporal data needed for the study of heterogeneous areas. The results showed that the USTARFM had higher accuracy than STARFM methods in two aspects of analysis: individual bands and of heterogeneity analysis. Taking the predicted NIR band as an example, the correlation coefficients (r) for the USTARFM, STARFM and unmixing methods were 0.96, 0.95, 0.90, respectively (p-value < 0.001); Root Mean Square Error (RMSE) values were 0.0245, 0.0300, 0.0401, respectively; and ERGAS values were 0.5416, 0.6507, 0.8737, respectively. The USTARM showed consistently higher performance than STARM when the degree of heterogeneity ranged from 2 to 10, highlighting that the use of this method provides the capacity to solve the data fusion problems faced when using STARFM. Additionally, the USTARFM method could help researchers achieve better performance than STARFM at a smaller window size from its heterogeneous land surface quantitative representation.

1. Introduction

High spatial and temporal resolution remote sensing technology plays an important role in land-cover detection, crop growth monitoring and phenological parameter inversion [1]. Unfortunately, it is impossible to obtain high temporal resolution and high spatial resolution images simultaneously from one sensor mounted on a satellite [2,3]. For example, Landsat series multi-spectral images at 30-m resolution have wide applications in extracting vegetation indices, monitoring land cover dynamic changes and ecological system variation studies. This system is widely used because of its finer spatial resolution, rich archive, and free availability [4,5,6]. However, the 16-day revisit cycle and the influence of bad weather, such as rain and clouds, make it difficult to acquire the continuous and cloudless remote sensing images that may be required to monitor certain Earth surface changes [7,8]. The Moderate Resolution Imaging Spectroradiometer (MODIS) onboard the Terra/Aqua satellites can provide remote sensing images at 1-day temporal, 250–1000 m spatial resolution, which is a potential alternative used in monitoring the Earth surface land coverage at large scales [9,10,11]. However, the low spatial resolution of MODIS data confines its application in fragmented and extremely heterogeneous landscapes [12]. If high spatial resolution and temporal resolution characteristics could be achieved at the same time, the advantages of Landsat and MODIS would be integrated and significantly improve the applicability of remote sensing technology to monitor land surface changes [13,14].
In recent years, researchers have developed a series of data fusion models to generate high spatial and temporal resolution remote sensing data. The unmixing-based method is one of these methods, which utilizes the Linear Spectral Mixture (LSM) model to extract endmembers reflectance at sub-pixel scale [15]. This method has previously been applied to fuse MODIS and Landsat images to produce high temporal and spatial resolution data [16,17,18,19,20,21,22]. Liu et al. [23] utilized the model to unmix Thermal Infrared Radiance for subpixel land surface temperature retrieval, and Wu et al. [24] proposed the MSTDFA model based on the LSM theory to generate daily synthetic Landsat imagery by combining Landsat and MODIS data. The Spatial and Temporal Adaptive Reflectance Fusion Model (STARFM) proposed by Gao et al. [25] is another effective fusion method to predict the phenology reflectance changes. Xu et al. [26] proposed a regularized spatial unmixing (RSpatialU)-based method, which introduced the prior class spectra estimated by STARFM to reduce the unmixing method error among the mixed pixels. The combination of the unmixed-based model and STARFM can utilize the advantages of the two modes to produce more accurate fusion data. Inspired by STARFM, Gevaert et al. [22] proposed a method entitled Spatial and Temporal Reflectance Unmixing Model (STRUM) which incorporates STARFM and unmixing-based fusion methods by combining the temporal change information obtained from the residue of two date MODIS images to improve the accuracy of unmixing-based method based on Bayesian theory. The fusion data generated by these methods are applicable to research in crop monitoring [20], environmental process detection [27], and mapping of forest disturbance [28], etc.
STARFM originated from the idea of capturing quantitative changes in radiometry caused by phenology through fusion of Landsat and MODIS data and has been widely applied for land surface detection. The method is applied to predict Landsat-like daily surface reflectance with one or several pairs of Landsat and MODIS images acquired on the same day and MODIS observations from the predicted date by weighing similar pixels for the central pixel [25]. The STARFM considers both the discrepancy of space and spectrum reflectance difference from multi-temporal MODIS images and has been widely introduced to produce high spatial-temporal data for detecting gradual changes over large areas [3,25,28,29]. However, several limitations for STARFM should be noted: it is incapable of estimating the transient or mutated surface changed information which is not captured by the base Landsat images [25,28,30]. STARFM also depends on temporal information from pure homogenous patches of land cover at the MODIS pixel scale and the predicted results can be misleading when used in heterogeneous landscapes, including cases of small-scale agriculture [25,28]. Many improvements of data spatial-temporal fusion models were conducted for STARFM. The Spatial Temporal Adaptive Algorithm for Mapping Reflectance Change (STAARCH), developed by Hilker et al. [28], was designed to detect the date on which land-cover change occurs and to record this information in a Landsat image to improve the final predicted result of the original STARFM approach [26,30]. For the latter limitation of STARFM, it is more difficult to find the pure MODIS pixels for central pixels in the fragmented area due to low resolution of MODIS when there are multiple land covers within one pixel. Zhu et al. [30] also attempted to develop an Enhanced STARFM method (ESTARFM) for heterogeneous landscapes regions, which introduces a conversion coefficient to succeed in enhancing the performance of prediction for heterogeneous landscapes. However, the precondition of at least two pairs of fine and coarse spatial resolution images on the same day for this method increases the difficulty of data acquisition and limits its applicability [30].
To resolve the difficulties that STARFM presents from the mixed pixel of MODIS in heterogeneous regions, we developed an improved STARFM named Unmixing-based STARFM (USTARFM) with the help of an Unmixing-based algorithm. The main objectives of this study include: (1) improve the performance of STARFM in heterogeneous areas, through unmixing the coarse pixel to gain specific land cover reflectance as the basis for further fusion rather than directed resampled data; (2) compare the fusion results of USTARFM and STARFM at different window sizes, and (3) assess the influence of landscape heterogeneity on the fusion performance of USTARFM. The USTARFM approach was applied to generate Landsat-like resolution and MODIS-like frequency images, and was tested using the Landsat 8/7 and MODIS reflectance as the reference data in two study areas.

2. Description of USTARFM

The coarse image on date t0 (predicted date) and a pair of fine-resolution and coarse-resolution remote sensing images on the same date tk (basis date) are needed for the USTARFM algorithm. Incorporating the clusters obtained from the fine resolution image on the tk as input component definition, the coarse resolution images (t0 and tk) were unmixed to get the land cover cluster reflectance. Unmixing data was used in place of the directly resampled data for the STARFM. The subsequent steps, like the STARFM, were run to predict the Landsat-like image. A sketch map of the USTARFM algorithm is shown in Figure 1, and the detailed implementation steps are listed as follows.

2.1. Land Cover Cluster and Abundance Extraction

The class types and related abundance within the MODIS mixed pixel are the two fundamental parameters for unmixing mixed pixels. The ISODATA algorithm has some further refinements by splitting and merging of clusters. Clusters are merged if either the number of members (pixel) in a cluster is less than a certain threshold or if the centers of two clusters are closer than a certain threshold. The ISODATA method has been used in many previous studies [1,18,19,20,21]. In our study, land cover was clustered using this method. The abundance of each class within MODIS pixel was calculated by summing the number of pixels occupied by each class and then we compared that to the total pixel number of coarse pixels [31,32]. The calculated abundance of each class and the class types were assumed constant during the prediction period [18,19,31], which is the precondition to downscaling the low resolution images at different dates (i.e., tk and t0).

2.2. Unmixing Data

Mixed pixel decomposition based on the linear spectral mixture model is a popular method for estimating land cover fraction [15]. The surface reflectance of mixed pixels can be calculated as the sum of mean reflectance values of the different land-cover classes within the pixel, weighted by the corresponding abundance [33]. The abundance of classes and surface reflectance within coarse pixels (usually in window size) as input parameters can be solved with the ordinary least squares technique, producing the mean surface reflectance value ( r ¯ ) for each land cover component as shown in Equation (1). Then mean reflectance values are assigned to each fine-resolution pixel of the classification map according to the class labels.
The spectral unmixing model is sensitive to the co-linearity caused by high correlations between endmembers, leading to inversion of ill-posed matrices [34]. This problem could be decreased by increasing the number of equations through enlarging the search window to adopt more classes, i.e., increasing n in Equation (1) [22]. Furthermore, due to the existence of spatial heterogeneity, the solution of Equation (1) should also be conducted in a rectangular window, which can preserve spatial heterogeneity to the extent [31]. The oversize window scale containing too many pixels to make the model weakens spatial variation and leads to degradation in the accuracy of mixed pixel decomposition [1,31]. Hence, an appropriate window size is crucial to the decision of the coarse pixels decomposition. A set of different window sizes were tested to determine the optimal window size, and the optimal window size was measured with three metrics: the correlation coefficient ( γ ), Root Mean Square Error (RMSE), and the Erreur Relative Globale Adimensionalle de Synthèse (ERGAS) between the synthetic image and the reference fine-resolution image [1,31,35]:
[ R ( 1 , t ) R ( i , t ) R ( n , t ) ] = [ f c ( 1 , 1 ) f c ( 1 , c ) f c ( 1 , k ) . . . f c ( i , 1 ) f c ( i , c ) f c ( i , k ) .. . . . f c ( n , 1 ) f c ( n , c ) f c ( n , k ) ] [ r ¯ ( 1 , t ) r ¯ ( c , t ) r ¯ ( k , t ) ] + [ ξ ( 1 , t ) ξ ( c , t ) ξ ( k , t ) ] constraints : c = 0 k f c ( i , c ) = 1 ;   0 < r ¯ ( c , t ) < 1
where R ( i , t ) is the reflectance of coarse-pixel i at time t, fc(i,c) is the abundance of class c in the ith coarse pixel, r ¯ ( c , t ) is the mean reflectance of class c at time t, ξ ( i , t ) is the residual error term, and k is the number of classes, n is the total coarse pixels number within the predefined window.
Apparently, the class number and the window size in Equation (1) are key parameters for the unmixing-based method. Therefore, by varying the class number (k) and the sliding window size (W), a series of unmixing data can be obtained and assessed by the metrics of quantative assessment indices. The optimal combination (k,W) of class number and window size was finally determined to unmix the MODIS scale resolution images [1].

2.3. Fused Image Generation

The two unmixed images from MODIS on date tk and t0, and one fine-resolution image on date tk are applied to predict the fine-resolution image on date t0. The critical step to implement the USTARFM algorithm, as well as the STARFM, is weighting the spatial information from neighboring pixels which are used to estimate reflectance of the central pixel and allow the weight function to be flexibly adjusted according to land surface complexity and heterogeneity [25]. To ensure that suitable information is acquired from neighboring pixels, similar spectral pixels from fine-resolution images within the moving window are involved to compute the reflectance of the central pixel. Then these similar pixels are weighed and used to calculate the reflectance of the central pixel as shown in Equation (3).
W i j k = ( 1 S i j k × T i j k × D i j k ) / i = 1 w j = 1 w k = 1 n ( 1 S i j k × T i j k × D i j k )
L ( x w / 2 , y w / 2 , t 0 ) = i = 1 w j = 1 w k = 1 n W i j k × ( M ( x i , y j , t 0 ) + L ( x i , y j , t k ) M ( x i , y j , t k ) )
where L(xw/2,yw/2,t0) represents the central pixel estimated reflectance of the moving window w at Landsat-like scale, L(xi,yj,tk) and M(xi,yj,tk) are the given location ( xi,yj ) pixel surface reflectance of the fine resolution image and unmixing data at the base date tk and M(xi,yj,t0) is the value of the unmixing data of MODIS on the prediction date t0. The Wijk refers to the weight of similar pixel (xi,yj,tk) , which is determined by three indices: spectral difference (Sijk), temporal difference (Tijk) and spatial distance between the central pixel and the similar pixel (Dijk).
A smaller value of Sijk implies that the fine-resolution pixel has spectral features that are similar to the coarse pixel, and thus, pixel reflectance should be assigned a higher weight in Equation (2). A smaller Tijk means there is less change in the surface reflectance between tk and t0; thus, the pixel should be assigned with a higher weight. A smaller Dijk means there is a closer distance to the central pixel, and therefore a higher weight for the pixel should be assigned.
For a more detailed description of the above algorithm readers may refer to [25].
Figure 1. Flowchart of the USTARFM algorithm.
Figure 1. Flowchart of the USTARFM algorithm.
Sensors 16 00207 g001

3. Algorithm Test

3.1. Test Data and Preprocessing

The study region is located in the territory of Hengshui, Hebei Province in China (37.52°–37.73° N, 115.44°–115.71° E) (Figure 2), and covers approximately 25 km × 25 km. This region is dominated by fragmented cultivated land as shown in Figure 2a that is mixed with small areas of urbanized land (Figure 2b) and water bodies (Figure 2c). Broken parcels of cultivated land in this area are a typical feature because small land parcels are the basic management unit for farmers in China’s agricultural policy. These characteristics make this area a good place to test the applicability of USTARFM to identify fragmented agricultural units in the landscape.
Figure 2. The location of study area.
Figure 2. The location of study area.
Sensors 16 00207 g002
The study dataset includes two Landsat 8 OLI images and two date 500 m daily MODIS surface reflectance products (MOD09GA) acquired on 19 August and 4 September 2014 (Table 1), which were downloaded from the USGS website (URL = http://earthexplorer.usgs.gov/).
Table 1. The main characteristics of Landsat 8 and MODIS data.
Table 1. The main characteristics of Landsat 8 and MODIS data.
DataAcquisition Date(Path/Row)Data Usage
Landsat 8 (OLI)8/19/2014123/034Classification and similar pixels selection ( t k )
9/4/2014Accuracy assessment ( t 0 )
MOD09GA8/19/2014h27/v05Unmixing data acquisition
9/4/2014
Images from these dates are of good quality and represent clear sky conditions. The Landsat 8 data products have been processed in geometric correction based on the terrain data. The main pre-processing included two steps: radiance calibration and atmospheric correction, which were conducted using FLAASH tools [36], then capable of comparing with the corresponding band of MODIS [26]. MOD09GA is the standard surface reflectance and provides MODIS band 1-7 daily surface reflectance at 500 m resolution accompanied with high-precision geolocation (approximately 50 m at NADIR) [1]. These MODIS images were re-projected from the native SIN projection to a UTM-WGS84 geospatial system and sampled by means of the MRT (MODIS Re-projection Tool). Similar to previous research, no further geometric correction was needed because the MODIS data were assumed to be well co-registered and matched the geographical position with the Landsat 8 data [1]. Additionally, compared to the models of bilinear filtering and cubic convolution, nearest neighbor resampling is an effective method to inherit the maximum original remote sensing spectral value. The MODIS at 500 m resolution were resampled into 480 m in order to match 16 times of 30 m of Landsat images, which facilitated the subsequent decomposition of MOD09GA pixels [1,31]. The green, red, and near-infrared bands, like [1,30], were chosen to test USTARFM performance.

3.2. Implementation Procedure

The ISODATA algorithm was applied to cluster a Landsat image collected August 19, generating land cover component maps with class number (k) as [5,10,15,20,25,30]. Two kinds of window scales which have pronounced impacts on the performance of the USTARFM method are concerned: the optimized window (W, MODIS pixel) for unmixing MODIS spectrum and the window (w, Landsat pixel) of searching the similar pixels for STARFM and USTARFM. For the W, a series of window size scales, [5 × 5, 7 × 7, 11 × 11, 15 × 15, 21 × 21, 31 × 31, 41 × 41], are used to unmix the MODIS data acquired on Aug. 19 at different class numbers, respectively. Next, the correlation coefficient (γ), RMSE and ERGAS were iteratively tested to unmix MODIS images to achieve the best results. This was completed as the popular benchmark to measure the difference between the reference Landsat 8 image and the unmixing data at different window sizes (W) and class number (k), and the optimal combination of the two parameters. An assumption considered here was that the relationship between land surface and landscape derived from the Landsat data are stable under the condition of the same area and same sensor [31]. So, the optimal window size determined from the 19 August 2014 images was inherited and further used to unmix the MODIS data acquired on 4 September 2014. The unmixing data for 19 August 19 and 4 September 2014 and the base date Landsat 8 data for 19 August 2014 were used together to predict the Landsat-like images on 4 September 2014 by weighting the similar pixels within a certain window size. The window sizes w (Landsat OLI pixel) were set as [7 × 7, 11 × 11, 31 × 31, 61 × 61, 101 × 101, 151 × 151].
The performance of the synthetic images were assessed using the correlation coefficient (γ) and p-value, RMSE, and ERGAS metrics [20,22]. The γ indicates the linear correlativity between fusion data and the reference data, and higher γ values represent better correlation between synthetic images and reference images; lower RMSE values show the lower fusion errors of synthetic images. In addition, ERGAS is a common comprehensive evaluation index for assessing the quality of a synthetic image which is able to quantity the spatial similarity degree of the fusion image and reference image [18,37]. These assessment metrics have different focus to show the performance of the synthetic images as shown in Equations (4)–(6):
γ = j = 1 N ( x j x ¯ ) ( y j y ¯ ) j = 1 N ( x j x ¯ ) 2 . j = 1 N ( y j y ¯ ) 2
RMSE = j = 1 N ( x j y j ) 2 N
where N refers to the pixels number of fine-resolution image; xj and yj refer to the jth pixel values of the synthetic and reference images, respectively; and x ¯ and y ¯ are the mean value of synthetic data and reference data, respectively. ERGAS is calculated as:
ERGAS = 100 H L 1 N b a n i = 1 N b a n ( R M S E i / M i ) 2
where H is the fine-resolution pixel size, such as 30 m; L is the coarse-resolution pixel size (480 m); Nban refers to the band number; RMSE refers to the ith band between the synthetic and reference data; and Mi refers to the mean spectrum value of reference data in ith band.

4. Results and Discussion

4.1. Algorithm Performance Analysis Influenced by W, k and w

We tested whether USTARFM is an improved STARFM algorithm and compared the results with STARFM to determine whether improvements could be obtained from this algorithm.
Table 2. The accuracy of MODIS unmixing data on 19 August and 4 September 2014 at different combination of window size scales W and class number k.
Table 2. The accuracy of MODIS unmixing data on 19 August and 4 September 2014 at different combination of window size scales W and class number k.
DateWkγRMSEERGAS
GreenRedNIRGreenRedNIRGreenRedNIR
8/19/2014750.690.740.920.02040.02330.03782.04672.64940.7457
100.640.700.860.02180.02540.05112.18182.87781.0087
150.490.550.760.02800.03390.07192.79953.85191.4185
200.490.550.700.02740.03430.08502.74523.89681.6775
250.330.410.440.03930.04690.16823.92985.32303.3205
300.270.360.370.04550.05340.21174.55256.05754.1790
1150.750.800.940.01920.02120.03341.92562.40740.6586
100.780.820.900.01860.02050.04361.85942.32300.8598
150.660.700.880.02160.02550.04802.15872.89780.9484
200.570.610.740.02440.03010.07792.44403.41971.5382
250.490.550.470.02830.03450.15862.82973.91003.1305
300.440.500.410.03080.03820.19503.08074.33643.8494
1550.760.810.950.01910.02100.03211.91262.38590.6342
100.820.860.920.01790.01890.03991.79082.14730.7884
150.740.780.920.01950.02180.03951.94892.47080.7790
200.680.730.750.02080.02420.07552.08312.74721.4906
250.610.670.470.02300.02740.15962.30253.11423.1503
300.560.610.400.02500.03090.19802.50073.50343.9086
2150.770.810.950.01900.02100.03131.90472.38050.6171
100.850.880.920.01740.01790.03871.74262.03010.7636
150.800.850.940.01810.01890.03381.80752.14430.6671
200.760.810.750.01890.02090.07621.89152.36971.5033
250.720.770.470.01990.02260.17341.99722.56793.4237
300.680.710.390.02110.02510.20742.11182.85304.0942
3150.770.810.950.01900.02120.02971.90212.37490.5360
100.860.900.920.01710.01730.03911.71071.94020.7714
150.810.870.950.01780.01800.03051.77962.02280.6023
200.830.880.700.01730.01770.08771.73681.98831.7310
250.780.840.430.01850.01940.18571.84862.17573.6662
300.780.820.370.01850.02020.22541.85402.26644.4490
4150.750.800.950.01930.02140.03001.93602.43140.5446
100.850.890.920.01730.01740.03871.73101.97550.7643
150.810.880.950.01780.01780.04011.78522.02060.5926
200.850.890.700.01730.01740.09111.79651.97871.7981
250.800.870.430.01800.01840.19561.80472.08343.8616
300.820.860.340.01760.01860.27041.75982.10795.3374
9/4/201431100.820.860.900.01820.02220.04011.80522.30200.8737
Note: Underlined bold values indicate the best value to determine the optimal window size (p-value < 0.001).
The unmixed image as traditional Unmixing model is also produced during the USTARFM process, so we put the three fusion methods together to achieve a more comprehensive understanding of the synthetic image performance. Table 2 shows the performances of the three fused methods with varying W and k settings using the metrics of γ, RMSE and ERGAS as benchmarks to determine the optimal combination of window size and class number for unmixing MODIS images (the best options are underlined and bold in Table 2). For the green and red bands, the optimal combination of window size (W) and class number is 31 × 31 and 10, and for NIR, the window size (W) and class number are 31 × 31 and 5, respectively. Considering that the metrics for the NIR band are similar to the green and red band, and more classes are feasible to guarantee class features uniform, we chose the window size of 31 × 31 MODIS pixels and the class number of 10 as the optimal combination for further unmixing MODIS data acquired Sept. 4.
From the above analysis, the W and k were determined. The next step was to focus on how to calculate w for USTARFM. Table 3 shows the performance of the USTARFM and STARFM methods at different window sizes (w) to search the similar pixels for the central pixel. The USTARFM algorithm achieves the optimal effect when the window size w is 11 × 11 OLI pixels for red and green bands, and 31 × 31 OLI pixels for the NIR band. Compared to 31 × 31 for the STARFM, the difference of optimal window size between NIR and visible bands may be caused by their reflectance characteristics which affect the similar pixels search efficiency. Relative to NIR, the green and red are the short-wave bands, which are sensitive to weather related factors, such as haze.
Though the atmosphere process has been applied for MODIS data, the atmosphere effects cannot be entirely removed [38]. For the other two bands, the indices for the three methods have the same trend as the green band. Table 3 also illustrates that the best window size of USTARFM is smaller than that of STARFM, and at the same window size, the three indices of USTARFM are also better than those of STARFM. This is mainly because the unmixing data used in USTARFM is efficient to reflect the surface reflectance comparing to the directed resampled data used in STARFM.
Table 3. The accuracy of the STARFM and USTARFM at different window size scales.
Table 3. The accuracy of the STARFM and USTARFM at different window size scales.
MethodWindow Size w n × n OLI PixelsγRMSEERGAS
GreenRedNIRGreenRedNIRGreenRedNIR
STARFM70.88220.89260.94160.01300.01720.03731.28231.80610.8088
110.88800.89870.93940.01300.01720.03511.28651.81290.7611
310.88950.90000.94890.01270.01700.03001.25591.79160.6507
610.88440.89480.94900.01310.01760.03021.29681.84980.6553
1010.88040.89310.94740.01330.01770.03091.31171.86730.6702
1510.87920.89210.94750.01330.01790.03101.31471.87820.6735
USTARFM70.91160.92260.96000.01180.01510.02601.16781.58760.5654
110.91290.92290.96310.01160.01510.02491.15021.58500.5416
310.91210.91920.96500.01170.01540.02451.15501.62240.5317
610.91060.91710.96500.01170.01560.02451.15641.64370.5326
1010.90940.91580.96500.01170.01580.02461.16151.65720.5334
1510.90830.91450.96500.01180.01590.02461.16711.67000.5341
Note: Underlined bold values indicate the best value to determine the optimal window size (p-value < 0.001).

4.2. Accuracy Assessment Under the Best Parameters Setting

For the green band at the optimal window size (W = 31 × 31), the γ = 0.91 of the USTARFM method is higher than those of the STARFM (0.89) and the unmixing method (0.82); meanwhile, USTARFM shows the lowest error (RMSE = 0.0116) compared to the prediction of the STARFM (RMSE = 0.0127) and the unmixing method (RMSE = 0.0182); ERGAS shows the same trend (1.1502 vs. 1.2559 vs. 1.8052, respectively).
Figure 3 shows the scatterplots between reflectance estimate from USTARFM, STARFM and unmixing based method, with the optimized parameter setting, (W, k) and w, and the corresponding reference bands of Landsat 8 image. USTARFM inspired from STARFM improves the performance of central pixel reflectance estimation by weighting the neighboring similar pixels within the moving window, which reflected the spatial variability well, producing scatter point distributions close to 1:1. The γ of each band of images generated by USTARFM and STARFM is superior to that of the unmixing method, by about 0.02. The reason is that the reflectance calculated by the unmixing method is the mean reflectance of each clustered class within the window, which blurs the spatial heterogeneity and makes the scatterplot appear in “stripe” pattern (Figure 3g–i). Generally, the images at Landsat-like image predicted by the USTARFM are more accurate than those derived from the STARFM and the unmixing methods.
Figure 3. Scatterplots of the real reflectance and the predicted product produced by the three algorithms for the green, red and NIR-infrared bands.
Figure 3. Scatterplots of the real reflectance and the predicted product produced by the three algorithms for the green, red and NIR-infrared bands.
Sensors 16 00207 g003

4.3. Landscape Heterogeneity Impact on USTARFM Performance

The heterogeneity over the regions where complex mixtures of land-cover type commonly exist represents a big challenge for STAFRM to identify pure pixels [25,30]. To analyze the performance of the USTARFM algorithm in a complex landscape, the MODIS pixel scale, 480 m × 480 m, was defined as a grid to divide the study area, and each grid was marked according to its heterogeneous degree. Here, we developed the degree of heterogeneity (h) metric using the clustered of thematic map from the Landsat 8 image as described in Equation (7). This idea is based on the quantity of land cover classes in one grid. More than one class type in one grid implies a higher degree of heterogeneity . If (Nc > 0) (Nc is the pixel number of class c) is a determinant function to judge whether class c appears in the grid. P is marked as 1 if Nc is more than 1:
h = c = 1 N u m ( P , i f ( N c > 0 ) )
Num is the total class in the thematic map, here maximum number of the land cover type is 10. From this definition of the heterogeneity metric, the heterogeneity level of a grid with only one class is defined as 1 which represents the lowest heterogeneous degree. Otherwise, the higher the number of classes that appear in a pixel, the higher the degree of heterogeneity. The highest level of heterogeneity assigned was 10 (Figure 4). From Figure 2 and Figure 4, it can be seen clearly that the areas of higher heterogeneity are distributed from the north east to south west area where fragmented cultivated land is common. The areas of lower heterogeneous grids were concentrated in the northwest which is dominated by larger cultivated fields.
Figure 4. The distribution of different heterogeneity levels in the study area. (a) class types within a grid (MODIS pixel scale), (b) the heterogeneity levels of MODIS pixels.
Figure 4. The distribution of different heterogeneity levels in the study area. (a) class types within a grid (MODIS pixel scale), (b) the heterogeneity levels of MODIS pixels.
Sensors 16 00207 g004
Figure 5 shows the RMSE and γ of Green, Red and NIR data generated by the three fused methods at different heterogeneity levels. For the three bands, the change trend decreases is accompanied by the heterogeneity degree increases, highlighting that the heterogeneity is a sensitive variable for the STARFM and has side-effects on the performance of the proposed method. However, an interesting result is that higher performance was achieved from USTARFM than those of STARFM at each heterogeneity level, except for level one (Figure 5a–d).
Figure 5. γ and RMSE of the three methods at different heterogeneity levels.
Figure 5. γ and RMSE of the three methods at different heterogeneity levels.
Sensors 16 00207 g005
Table 4 analyzes the relationship of the unmixing and resampled MODIS data at different heterogeneity levels to the Landsat reference data on 4 September 2014, which illustrates why USTARFM is superior to the STARFM. The relationship was obviously higher from 2 to 10 level, consistent with the trend of USTARFM trend. This is because the unmixing data instead of directed sampling data as basis for USTARFM can describe the mixed landscape clearly, especially in the higher heterogeneity level area, while the resampled method just offer the STARM uniform spot without reflecting the land cover heterogeneous characteristics. However, the areas at the heterogeneity level of 1 are special as the most homogeneous region, where STARFM has better performance. The USTARFM precedes the unmixing method but is inferior to STARFM for the green and red bands. Taking the result of the green band as an example, the RMSE and γ of the green band predicted by the STARFM are better than those for USTARFM (for green, RMSE: 0.0097 vs. 0.0102; γ: 0.82 vs. 0.79). This is because the accuracy of the directly resampled data in the homogeneous area for the STARFM is better than the unmixing data for USTARFM (for green, γ: 0.89 vs. 0.77; RMSE: 0.0102 vs. 0.0128) (Table 4), resulting in the lower accuracy of the proposed method.
Table 4. The relationship of data generated from the unmixing method and resampled data to the reference data at different heterogeneity levels (h).
Table 4. The relationship of data generated from the unmixing method and resampled data to the reference data at different heterogeneity levels (h).
hUnmixing DataResampled Data
GreenRedNIRGreenRedNIR
γ10.770.830.990.890.900.98
20.750.800.980.720.750.96
30.820.850.940.740.760.91
40.810.840.880.670.690.80
50.820.850.870.520.530.73
60.800.840.810.430.420.58
70.780.840.830.330.330.51
80.780.830.820.260.250.36
90.770.830.840.170.200.19
100.810.840.860.110.110.11
RMSE10.01280.01420.04870.01020.01110.0492
20.01670.01650.03770.01580.01520.0448
30.01770.01870.03550.01830.02000.0433
40.01730.01870.03340.01980.02310.0412
50.01800.02200.03840.02430.03270.0530
60.01940.02470.04120.02710.03800.0574
70.02000.02560.04560.02870.04090.0711
80.01960.02470.04840.02870.04070.0803
90.01840.02320.05170.02700.03870.0969
100.02070.02830.05800.03430.05050.1187
ERGAS11.45211.96751.77211.15641.54542.3189
22.15692.66780.87002.03402.45321.0356
32.15292.71510.70562.22022.90250.8611
41.98722.46810.67942.27633.05010.8381
51.79092.32770.80902.41463.45711.1177
61.74582.24900.89312.43263.45861.2431
71.70652.16961.05842.44773.47211.6493
81.64232.06221.16642.39943.40341.9355
91.60742.08151.23012.36733.47312.3083
101.62902.17371.41362.69593.87512.8950
In Figure 6g, the higher reflectance values for red and green bands seem to be underestimated in some scatter points (residential region, a MODIS pixel scale), which may be because the pure coarse pixel was affected by the surrounding pixels when it is decomposed in Equation (1), and the error of the unmixing data causes accumulated error in the USTARFM (Figure 6a). However, for the NIR band, the accuracy of the unmixing data is slightly better than directly resampled data in the homogeneous area (γ: 0.99 vs. 0.98; RMSE: 0.0487 vs. 0.0492), which makes the RMSE of the USTARFM lowest compared to the STARFM and unmixing methods (0.0198 vs. 0.0235 vs. 0.0487) (Figure 5e) and the scatterplot more close to 1:1 (Figure 6c). This is mainly because, relative to the green and red, NIR is the long-wave bands, which are not sensitive to weather related factors such as haze and have a high signal-to-noise when they are decomposed linearly [37].
Figure 6. Scatterplots of three bands generated by the three methods in the region with the heterogeneity level of 1.
Figure 6. Scatterplots of three bands generated by the three methods in the region with the heterogeneity level of 1.
Sensors 16 00207 g006

4.4. Synthetic Image Analysis

The synthetic images of these three methods look similar to the reference Landsat 8 images acquired 4 September 2014 (Figure 7a). Figure 7b–d shows that land cover with large area, e.g., water bodies, residential areas and wide roads, can be clearly identified. From the subplots of the synthetic images (Figure 7f–h), the synthetic image in the heterogeneous region produced by the unmixing method appears somewhat “blurry” and a homogeneous “plot” in which we could not discern the small objects. In contrast, areas from the images generated by the USTARFM and STARFM represent more spatial details (Figure 7g–h). Compared with the image predicted by STARFM (Figure 7g), the result produced by the USTARFM has a lower spectral deviation to the actual spectrum (Figure 7e) in the fragmentation region. This is because the directly resampled data used in STARFM has uniform spectral reflectance, which is unable to reveal the heterogeneity within the MODIS pixel.
Figure 7. The comparison of real and fusion images produced by the three algorithms (NIR-red-green combination). (a) shows the reference Landsat 8 images observed on 4 September 2014; (bd) are the prediction images by USTARFM, STARFM, and unmixing-based method, respectively; (eh) indicate the enlarged subset images of (ad), respectively.
Figure 7. The comparison of real and fusion images produced by the three algorithms (NIR-red-green combination). (a) shows the reference Landsat 8 images observed on 4 September 2014; (bd) are the prediction images by USTARFM, STARFM, and unmixing-based method, respectively; (eh) indicate the enlarged subset images of (ad), respectively.
Sensors 16 00207 g007

4.5. Algorithm Applicability Analysis

We analyzed another series of study dates to test the applicability of USTARFM. This study area is situated in a region of Virginia, in the Eastern United States (about 37.51°–37.72° N and 76.99°–77.26° W) where there are many small patches of different land types including forest, bare soil, and residential patches. The dataset includes Landsat 7 ETM+ and MOD09GA images acquired for 25 January, 26 February, and 17 May, in 2002. In the study, the Landsat 7 ETM+ and MOD09GA for 25 January is the base data for predicting the Landsat-like data collected 26 February and 17 May 2002. The dataset has been applied for data fused study, and the detailed description about study area and dataset can be found in publication [30].
The series parameters for the USTARFM were the same as for the previous experiment. Through analyzing the unmixed results of MODIS data for data acquired 25 January 2002, the optimal combination of window size and the class number (W = 31, k = 10) were also determined for unmixing the MODIS data acquired for 26 February, and 17 May 2002. Then these unmixed data are input to USTARFM for predicting the surface reflectance for data collected 26 February and 17 May 2002. The result for 26 February 2002 shows the accuracy of USTARFM is better than STARFM (i.e., for green band, the γ: 0.92 vs. 0.90; RMSE: 0.0097 vs. 0.0116; ERGAS: 0.9058 vs. 1.0797), which achieved higher accuracy at a smaller window size compared to STARFM (w = 15 × 15 vs. 45 × 45). The predicted reflectance of 17 May 2002 by USTARFM are nearly the same for the unmixing method, and are still better than STARFM (i.e., for green band, the γ: 0.80 vs. 0.83 vs. 0.74; RMSE: 0.0159 vs. 0.0148 vs. 0.0178; ERGAS: 1.4529 vs. 1.3487 vs. 1.6289). However, the performance of the predicted result is lower than those acquired on 26 February 2002, because some changes of land-cover types from the larger time span occurred, which is also consistent with Zhu’s result [30].
Figure 8 shows the synthetic images of 26 February and 17 May 2002 for the three methods. The prediction image by USTARFM (Figure 8(b1)) on 26 February 2002 is almost the same as that by STARFM (Figure 8(c1)) visually, but the quantitative evaluation of the former is better than the latter, and the unmixed image is worse for loss of texture details of surface ground in Figure 8(d1) (marked in circle). However, due to land cover changed over the long time span that single date Landsat was unable to record the changes, it was also unfulfillable to reflect the changed information from the fusion data [25]. This is why, in this paper, the predicted images from 17 May 2002 by USTARFM and STARFM have some variances from the actual image shown in Figure 8(a2) (marked by a rectangle). Therefore, the STAARCH model, proposed by Hilker [28], will offer us some new ideas to solve the problem. Besides, the high temporal frequency MODIS at long time scale can record the land surface changed trace, which may be a good alternative to reflect the crop changes in agricultural landscape.
Figure 8. Landsat images collected 26 February 2002 (1) and May 17 (2) and their enlarged sub-images (a1) and (a2); and the corresponding prediction sub-images by USTARFM (b1, b2), STARFM (c1, c2), and unmixing-based method (d1, d2).
Figure 8. Landsat images collected 26 February 2002 (1) and May 17 (2) and their enlarged sub-images (a1) and (a2); and the corresponding prediction sub-images by USTARFM (b1, b2), STARFM (c1, c2), and unmixing-based method (d1, d2).
Sensors 16 00207 g008

5. Conclusions

The temporal-spatial fusion of remote sensing data is an effective approach to solve the dilemma of the simultaneous attainment of high temporal and spatial resolution remote sensing images. In this study, an USTARFM algorithm based on improved STARFM supported by an unmixing-based method was developed to generate high-temporal-spatial remote sensing data. The USTARFM algorithm was tested by the experimental data from Landsat 8/Landsat 7 and MODIS images in two study areas.
USTARFM adopts the unmixed fraction from the unmixing data, which is able to reflect surface information in heterogeneous areas, to take place of the directed resampled data used for STARFM, which can increase the probability of searching the “pure pixels” and guarantee the fusion accuracy. This enables the USTARFM to achieve better performance than STARFM at the same window size, even smaller (w, 11 × 11 vs. 31 × 31). In the Hengshui study area, taking the predicted NIR band reflectance as an example, with the best combination of window size, class number and similar pixels searching window, γ of the USTARFM, STARFM and unmixing were 0.96, 0.95, 0.90, respectively (p-value < 0.001); RMSE were 0.0245, 0.0300, 0.0401, respectively; and ERGAS were 0.5416, 0.6507, 0.8737, respectively. However, the scatterplots of synthetic data produced by USTARFM are more close to 1:1. The synthetic images produced by the USTARFM look more similar to the reference images in RGB combination manner (Figure 7). Especially in the broken area, USTARFM is skilled in reducing the homogeneous “plot” generated by the unmixing method and maintains the fidelity of the spectrum. In addition, the similar conclusions for USTARFM were also obtained for a second study area in Virginia which is a relatively heterogeneous area.
The influence of heterogeneity on the three methods shows the certain regularity (Figure 5). The USTARM showed consistent higher performance than those of STARM, except for the Red and Green bands when heterogeneity degree as 1, highlighting that this method is capable of solving the problem of fused data that is present in STARM. It is an exciting result that, for the NIR band, the USTARFM is performs consistently better than STARFM when heterogeneity degree ranges from 1 to 10. Compared to the short-wave band as Red and Blue, the long-wave band, NIR, it is insensitive to the atmospheric effect to ensure the unmixing result, even in the area when heterogeneous degree is 1.
On the whole, USTARFM works better than STARFM and the unmixing based method. As the heterogeneity increased, the USTARFM was better than the STARFM and unmixing method. But if the land-cover patches are “extremely” broken and the high-spatial remote sensing images are unable to capture the texture details, it is still difficult to achieve an acceptable result.
There are some issues that need to be addressed in further study. First, the accuracy of the USTARFM depends on the performance of the unmixing data generated from Equation (1). The USTARFM method could get better results depending on the higher precision unmixing data. How to achieve better unmixed data from development of unmixing model or finer remote sensing is still an important research topic. Next, the land-cover types changed greatly over the long time span of this research, the performances of USTARFM and STARFM are degraded, such as 0.12 and 0.16 of correlation coefficient in green band lower than usual analyzed in Virginia area in this study. This problem urgently needs to be addressed because this affects the ability of USTARFM to detect cultivated land where usually had abrupt change at short temporal scale. Third, the co-registration errors between Landsat 8 and MODIS is still potential factor with side-effect on the fusion result, which are usually neglected. Previous researchers have done related research on the influence of co-registration errors on data fusion [39,40]. Maybe, this is another way needing to be concerned to improve USTARFM performance. Last but not least, the applicability of the USTARFM method in inversing parameters, such as e.g. NDVI or biomass, need to be further tested. Although the work in this paper is based on Landsat and MODIS surface reflectance data are popular satellite sources, the USTARFM algorithm still has a prospect for other high resolution images (e.g., Chinese GF-1 satellite data with 16-m resolution, French SPOT-5/6satellite data with 10/6 m resolution, etc.) and medium resolution images (e.g., the NPOESS VIIRS data U.S. operational polar satellite constellation which is the new generation satellite after MODIS).

Acknowledgments

This work are supported by the Natural Science Foundation of China (Grant No. 41301444), the "Youth talent plan" of Institutions of higher learning, Beijing, and the Major Project of High Resolution Earth Observation System, China. We thank USGS data center for providing free Landsat data and MODIS science team for providing free MODIS products.

Author Contributions

Dengfeng Xie and Jinshui Zhang designed research; Dengfeng Xie performed research; Dengfeng Xie, Jinshui Zhang, Xiufang Zhu, Yaozhong Pan, Hongli Liu, Miqizhou Yuan and Ya Yun analyzed data; Dengfeng Xie, Jinshui Zhang and Xiufang Zhu wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhang, W.; Li, A.; Jin, H.; Bian, J.; Zhang, Z.; Lei, G.; Qiu, Z.; Huang, C. An enhanced spatial and temporal data fusion model for fusing Landsat and MODIS surface reflectance to generate high temporal Landsat-like data. Remote Sens. 2013, 5, 5346–5368. [Google Scholar] [CrossRef]
  2. Price, J.C. How unique are spectral signatures? Remote Sens. Environ. 1994, 49, 181–186. [Google Scholar] [CrossRef]
  3. Emelyanova, I.V.; McVicar, T.R.; Van Niel, T.G.; Li, L.T.; van Dijk, A.I. Assessing the accuracy of blending Landsat-MODIS surface reflectance in two landscapes with contrasting spatial and temporal dynamics: A framework for algorithm selection. Remote Sens. Environ. 2013, 133, 193–209. [Google Scholar] [CrossRef]
  4. Rees, W.G.; Williams, M.; Vitebsky, P. Mapping land cover change in a reindeer herding area of the Russian arctic using Landsat TM and ETM+ imagery and indigenous knowledge. Remote Sens. Environ. 2003, 85, 441–452. [Google Scholar] [CrossRef]
  5. Masek, J.G.; Huang, C.; Wolfe, R.; Cohen, W.; Hall, F.; Kutler, J.; Nelson, P. North American forest disturbance mapped from a decadal Landsat record. Remote Sens. Environ. 2008, 112, 2914–2926. [Google Scholar] [CrossRef]
  6. Schroeder, T.A.; Wulder, M.A.; Healey, S.P.; Moisen, G.G. Mapping wildfire and clear-cut harvest disturbances in boreal forests with Landsat time series data. Remote Sens. Environ. 2011, 115, 1421–1433. [Google Scholar] [CrossRef]
  7. González-Sanpedro, M.C.; Le Toan, T.; Moreno, J.; Kergoat, L.; Rubio, E. Seasonal variations of leaf area index of agricultural fields retrieved from Landsat data. Remote Sens. Environ. 2008, 112, 810–824. [Google Scholar] [CrossRef]
  8. Ju, J.; Roy, D.P. The availability of cloud-free Landsat ETM+ data over the conterminous United States and globally. Remote Sens. Environ. 2008, 112, 1196–1211. [Google Scholar] [CrossRef]
  9. Arvor, D.; Jonathan, M.; Meirelles, M.S.P.; Dubreuil, V.; Durieux, L. Classification of MODIS EVI time series for crop mapping in the state of Mato Grosso, Brazil. Int. J. Remote Sens. 2011, 32, 7847–7871. [Google Scholar] [CrossRef]
  10. Notarnicola, C.; Duguay, M.; Moelg, N.; Schellenberger, T.; Tetzlaff, A.; Monsorno, R.; Costa, A.; Steurer, C.; Zebisch, M. Snow cover maps from MODIS images at 250 m resolution, Part 1: Algorithm description. Remote Sens. 2013, 5, 110–126. [Google Scholar] [CrossRef]
  11. Zhou, H.; Aizen, E.; Aizen, V. Deriving long term snow cover extent dataset from AVHRR and MODIS data: Central Asia study. Remote Sens. Environ. 2013, 136, 146–162. [Google Scholar] [CrossRef]
  12. Shabanov, N.V.; Wang, Y.; Buermann, W.; Dong, J.; Hoffman, S.; Smith, G.R.; Tian, Y.; Knyazikhin, Y.; Myneni, R.B. Effect of foliage spatial heterogeneity in the MODIS LAI and FPAR algorithm over broadleaf forests. Remote Sens. Environ. 2003, 85, 410–423. [Google Scholar] [CrossRef]
  13. Lunetta, R.S.; Lyon, J.G.; Guindon, B.; Elvidge, C.D. North American landscape characterization dataset development and data fusion issues. Photogramm. Eng. Remote Sens. 1998, 64, 821–828. [Google Scholar]
  14. Arai, E.; Shimabukuro, Y.E.; Pereira, G.; Vijaykumar, N.L. A multi-resolution multi-temporal technique for detecting and mapping deforestation in the Brazilian amazon rainforest. Remote Sens. 2011, 3, 1943–1956. [Google Scholar] [CrossRef]
  15. Bioucas-Dias, J.M.; Plaza, A.; Dobigeon, N.; Parente, M.; Du, Q.; Gader, P.; Chanussot, J. Hyperspectral unmixing overview: Geometrical, statistical, and sparse regression-based approaches. IEEE J. Appl. Earth Obs. Remote Sens. 2012, 5, 354–379. [Google Scholar] [CrossRef]
  16. Zhukov, B.; Oertel, D.; Lanzl, F.; Reinhackel, G. Unmixing-based multisensor multiresolution image fusion. IEEE Trans. Geosci. Remote Sens. 1999, 37, 1212–1226. [Google Scholar] [CrossRef]
  17. Keshava, N.; Mustard, J.F. Spectral unmixing. IEEE Signal Proc. Mag. 2002, 19, 44–57. [Google Scholar] [CrossRef]
  18. Zurita-Milla, R.; Clevers, J.G.P.W.; Schaepman, M.E. Unmixing-based Landsat TM and MERIS FR data fusion. IEEE Geosci. Remote Sens. Lett. 2008, 5, 453–457. [Google Scholar] [CrossRef] [Green Version]
  19. Zurita-Milla, R.; Kaiser, G.; Clevers, J.G.P.W.; Schneider, W.; Schaepman, M.E. Unmixing time series of MERIS full resolution data to monitor vegetation seasonal dynamics. Remote Sens. Environ. 2009, 113, 1874–1885. [Google Scholar] [CrossRef]
  20. Amorós-López, J.; Gómez-Chova, L.; Alonso, L.; Guanter, L.; Zurita-Milla, R.; Moreno, J.; Camps-Valls, G. Multitemporal fusion of Landsat/TM and ENVISAT/MERIS for crop monitoring. Int. J. Appl. Earth Obs. Geoinf. 2013, 23, 132–141. [Google Scholar] [CrossRef]
  21. Zurita-Milla, R.; Gómez-Chova, L.; Guanter, L.; Clevers, J.G.; Camps-Valls, G. Multitemporal unmixing of medium-spatial-resolution satellite images: A case study using MERIS images for land-cover mapping. IEEE Trans. Geosci. Remote Sens. 2011, 49, 4308–4317. [Google Scholar] [CrossRef]
  22. Gevaert, C.M.; García-Haro, F.J. A Comparison of STARFM and an Unmixing-based algorithm for Landsat and MODIS data fusion. Remote Sens. Environ. 2015, 156, 34–44. [Google Scholar] [CrossRef]
  23. Liu, D.; Pu, R. Downscaling thermal infrared radiance for subpixel land surface temperature retrieval. Sensors 2008, 8, 2695–2706. [Google Scholar] [CrossRef]
  24. Wu, M.Q.; Huang, W.J.; Niu, Z.; Wang, C.Y. Generating daily synthetic Landsat imagery by combining Landsat and MODIS data. Sensors 2015, 15, 24002–24025. [Google Scholar] [CrossRef] [PubMed]
  25. Gao, F.; Masek, J.; Schwaller, M.; Hall, F. On the blending of the Landsat and MODIS surface reflectance: Predicting daily Landsat surface reflectance. IEEE Trans. Geosci. Remote Sens. 2006, 44, 2207–2218. [Google Scholar]
  26. Xu, Y.; Huang, B.; Xu, Y.; Cao, K.; Guo, C.L.; Meng, D.Y. Spatial and Temporal Image Fusion via Regularized Spatial Unmixing. IEEE Geosci. Remote Sens. Lett. 2015, 12, 1362–1366. [Google Scholar]
  27. Wu, M.Q.; Li, H.; Huang, W.J.; Niu, Z.; Wang, C.Y. Generating daily high spatial land surface temperatures by combining ASTER and MODIS land surface temperature products for environmental process monitoring. Environ. Sci. Process. Impacts. 2015, 17, 1396–1404. [Google Scholar] [CrossRef] [PubMed]
  28. Hilker, T.; Wulder, M.A.; Coops, N.C.; Linke, J.; McDermid, G.; Masek, J.G.; Gao, F.; White, J.C. A new data fusion model for high spatial-and temporal-resolution mapping of forest disturbance based on Landsat and MODIS. Remote Sens. Environ. 2009, 113, 1613–1627. [Google Scholar] [CrossRef]
  29. Singh, D. Generation and evaluation of gross primary productivity using Landsat data through blending with MODIS data. Int. J. Appl. Earth Obs. Geoinf. 2011, 13, 59–69. [Google Scholar] [CrossRef]
  30. Zhu, X.; Chen, J.; Gao, F.; Chen, X.; Masek, J.G. An enhanced spatial and temporal adaptive reflectance fusion model for complex heterogeneous regions. Remote Sens. Environ. 2010, 114, 2610–2623. [Google Scholar] [CrossRef]
  31. Wu, M.; Niu, Z.; Wang, C.; Wu, C.; Wang, L. Use of MODIS and Landsat time series data to generate high-resolution temporal synthetic Landsat data using a spatial and temporal reflectance fusion Model. J. Appl. Remote Sens. 2012, 6. [Google Scholar] [CrossRef]
  32. Wu, C.; Chen, J.; Huang, N. Predicting gross primary production from the enhanced vegetation index and photosynthetically active radiation: Evaluation and calibration. Remote Sens. Environ. 2011, 115, 3424–3435. [Google Scholar] [CrossRef]
  33. Settle, J.J.; Drake, N.A. Linear mixing and the estimation of ground cover proportions. Int. J. Remote Sens. 1993, 14, 1159–1177. [Google Scholar] [CrossRef]
  34. García-Haro, F.J.; Sommer, S.; Kemper, T. A new tool for variable multiple endmember spectral mixture analysis (VMESMA). Int. J. Remote Sens. 2005, 26, 2135–2162. [Google Scholar] [CrossRef]
  35. Busetto, L.; Meroni, M.; Colombo, R. Combining medium and coarse spatial resolution satellite data to improve the estimation of sub-pixel NDVI time series. Remote Sens. Environ. 2008, 112, 118–131. [Google Scholar] [CrossRef]
  36. Jia, K.; Liang, S.; Zhang, N.; Wei, X.; Gu, X.; Zhao, X.; Yao, Y.; Xie, X. Land cover classification of finer resolution remote sensing data integrating temporal features from time series coarser resolution data. ISPRS J. Photogramm. Remote Sens. 2014, 93, 49–55. [Google Scholar] [CrossRef]
  37. Lillo-Saavedra, M.; Gonzalo, C.; Arquero, A.; Martinez, E. Fusion of multispectral and panchromatic satellite sensor imagery based on tailored filtering in the Fourier domain. Int. J. Remote Sens. 2005, 26, 1263–1268. [Google Scholar] [CrossRef]
  38. Vermote, E.F.; El Saleous, N.Z.; Justice, C.O. Atmospheric correction of MODIS data in the visible to middle infrared: First results. Remote Sens. Environ. 2002, 83, 97–111. [Google Scholar] [CrossRef]
  39. Amorós-López, J.; Gómez-Chova, L.; Alonso, L.; Guanter, L.; Moreno, J.; Camps-Valls, G. Regularized multiresolution spatial unmixing for ENVISAT/MERIS and Landsat/TM image fusion. IEEE Geosci. Remote Sens. Lett. 2011, 8, 844–848. [Google Scholar] [CrossRef]
  40. Tan, B.; Woodcock, C.E.; Hu, J.; Zhang, P.; Ozdogan, M.; Huang, D.; Myneni, R.B. The impact of gridding artifacts on the local spatial properties of MODIS data: Implications for validation, compositing, and band-to-band registration across resolutions. Remote Sens. Environ. 2006, 105, 98–114. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Xie, D.; Zhang, J.; Zhu, X.; Pan, Y.; Liu, H.; Yuan, Z.; Yun, Y. An Improved STARFM with Help of an Unmixing-Based Method to Generate High Spatial and Temporal Resolution Remote Sensing Data in Complex Heterogeneous Regions. Sensors 2016, 16, 207. https://doi.org/10.3390/s16020207

AMA Style

Xie D, Zhang J, Zhu X, Pan Y, Liu H, Yuan Z, Yun Y. An Improved STARFM with Help of an Unmixing-Based Method to Generate High Spatial and Temporal Resolution Remote Sensing Data in Complex Heterogeneous Regions. Sensors. 2016; 16(2):207. https://doi.org/10.3390/s16020207

Chicago/Turabian Style

Xie, Dengfeng, Jinshui Zhang, Xiufang Zhu, Yaozhong Pan, Hongli Liu, Zhoumiqi Yuan, and Ya Yun. 2016. "An Improved STARFM with Help of an Unmixing-Based Method to Generate High Spatial and Temporal Resolution Remote Sensing Data in Complex Heterogeneous Regions" Sensors 16, no. 2: 207. https://doi.org/10.3390/s16020207

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop