Open AccessArticle

River Extraction under Bankfull Discharge Conditions Based on Sentinel-2 Imagery and DEM Data

State Key Laboratory of Hydroscience and Engineering, Tsinghua University, Beijing 100084, China

Emergency Science Research Institute, China Coal Research Institute, Beijing 100013, China

Author to whom correspondence should be addressed.

Remote Sens. 2021, 13(14), 2650; https://doi.org/10.3390/rs13142650

Submission received: 25 April 2021 / Revised: 4 June 2021 / Accepted: 30 June 2021 / Published: 6 July 2021

(This article belongs to the Section Remote Sensing in Geology, Geomorphology and Hydrology)

Download

Browse Figures

Graphical abstract
"> Figure 1
The technical flowchart of river extraction under bankfull discharge. "> Figure 2
Geographical location of the study area and distribution of the hydrological stations and training samples. Subregions are marked as “First”, “Second”, and “Third”. Hydrological stations are marked with yellow triangles. Water samples are marked with blue crosses, and nonwater samples are marked with green crosses. "> Figure 3
Cross-section morphology of the year 2015 (a) and the water level-discharge rating curve (b) of a typical cross section at the Mentang hydrological station. "> Figure 4
The ratios of the daily discharges to bankfull discharges. "> Figure 5
Ranking of the importance of the feature variables in three subregions. (a–c) ranking of MDA for the first, second, and third region; (d–f) ranking of MDG for the first, second, and third region. "> Figure 6
Comparisons between the extracted connected rivers with a width over 30 m and river networks above order 3 that were generated from a 90-m resolution DEM. "> Figure 7
Comparisons between all the extracted rivers and drainage networks above order 2 that were generated from a 90-m resolution DEM. "> Figure 8
Comparison between the estimated river width and in situ-measured data. "> Figure 9
Distribution characteristics of the (a) bankfull river widths and (b) contributing area and bankfull discharge along the main stream of the upper Yellow River. BRW—Bankfull river width. "> Figure 10
Regression relationships between the estimated river widths and the contributing areas and bankfull discharges. (a,b) The regressions of single-thread river reaches, excluding the hydrological stations affected by reservoirs; (c,d) the regressions of all river reaches, excluding the hydrological stations affected by reservoirs. ">

Review Reports Versions Notes

Abstract

River discharge and width, as essential hydraulic variables and hydrological data, play a vital role in influencing the water cycle, driving the resulting river topography and supporting ecological functioning. Insights into bankfull river discharge and bankfull width at fine spatial resolutions are essential. In this study, 10-m Sentinel-2 multispectral instrument (MSI) imagery and digital elevation model (DEM) data, as well as in situ discharge and sediment data, are fused to extract bankfull river widths on the upper Yellow River. Using in situ cross-section morphology data and flood frequency estimations to calculate the bankfull discharge of 22 hydrological stations, the one-to-one correspondence relationship between the bankfull discharge data and the image cover data was determined. The machine learning (ML) method is used to extract water bodies from the Sentinel-2 images in the Google Earth Engine (GEE). The mean overall accuracy was above 0.87, and the mean kappa value was above 0.75. The research results show that (1) for rivers with high suspended sediment concentrations, the water quality index (SRMIR-Red) constitutes a higher contribution; the infrared band performs better in areas with greater amounts of vegetation coverage; and for rivers in general, the water indices perform best. (2) The effective river width of the extracted connected rivers is 30 m, which is 3 times the image resolution. The R², root mean square error (RMSE), and mean bias error (MBE) of the estimated river width values are 0.991, 7.455 m, and −0.232 m, respectively. (3) The average river widths of the single-thread sections show linear increases along the main stream, and the R² value is 0.801. The river width has a power function relationship with bankfull discharge and the contributing area, i.e., the downstream hydraulic geometry, with R² values of 0.782 and 0.630, respectively. More importantly, the extracted river widths provide basic data to analyze the spatial distribution of bankfull widths along river networks and other applications in hydrology, fluvial geomorphology, and stream ecology.

Keywords:

Sentinel-2 imagery; bankfull discharge; downstream hydraulic geometry; machine learning; Google Earth Engine; river width

Graphical Abstract

1. Introduction

Currently, water resource availability is severely deficient, and the protection of water source areas and estimation of changes in runoff are receiving considerable attention. Rivers and streams are essential parts of the global hydrologic cycle; 90% of the water flux transported from continents to the ocean (approximately 37% of the total terrestrial precipitation) is carried by rivers [1,2]. River discharge and width are essential hydraulic variables and hydrological data needed to inform river management and restoration efforts. Bankfull discharge is morphologically critical because it represents the link between within-bank processes and floodplain processes, and bankfull discharge is frequently used to estimate the channel forming or dominant discharge of alluvial rivers [3,4]. Therefore, understanding the dynamics of bankfull discharge and bankfull river width at fine spatial resolutions is essential for applications in hydrology, fluvial geomorphology, and stream ecology [5].

Remote sensing is an effective method for extracting open-surface inland water bodies over a variety of spatiotemporal scales compared with other field survey methods employed in the past decades [6,7,8,9,10,11,12,13]. Among all the extraction methods, the water index method is widely used to detect open-surface water bodies. In particular, the modified normalized difference water index (mNDWI) proposed by Xu [14] has been used by many researchers to extract global water bodies [15,16]; however, this technique produces errors in mixed pixels with water bodies and other land-cover types [17]. To reduce the effects of other land-cover types on the successful identification of water bodies, researchers have improved the accuracy of water detection by simultaneously utilizing several water body indices [18,19,20]. Wang et al. combined the mNDWI, land surface water index (LSWI), and two greenness-based vegetation indices (enhanced vegetation index (EVI) and normalized difference vegetation index (NDVI)) to detect open-surface water bodies [21]. Many researchers have conducted large-scale, rapid river extraction, and detected river channel changes based on the Google Earth Engine (GEE) platform [16,21,22,23,24]. GEE integrates many open source satellite images and various derivative products, providing strong support for efficient water body extraction [20]. However, for rivers with high suspended sediment concentrations, the existing methods are usually less accurate in river width extraction. The Yellow River has the highest suspended sediment concentration in the world. To extract water bodies from the Yellow River with high accuracy, new data and algorithms are needed to improve upon previously implemented methods.

River width datasets of various spatial resolutions, ranging from the global scale to the basin scale, have recently been provided by many researchers. Allen and Pavelsky provided the Landsat-derived North American River Width (NARWidth) dataset, which contains river width at mean annual discharge and extrapolates the strong relationship observed between the river width and its total surface area [25]. Allen and Pavelsky provided the first global river width database under mean annual discharge conditions based on Landsat imagery and found that rivers and streams likely play a great role in controlling land–atmosphere fluxes [26]. Li et al. extracted small and open-surface river information in the upper Yellow River by fusing a digital elevation model (DEM) and Sentinel-2 imagery corresponding to the average discharge during the summer flood season [20]. Gleason et al. extracted instantaneous cross-sectional flow widths during mean daily flow conditions from Landsat imagery and used these measurements to approximate the at-many-stations hydraulic geometry (AMHG) of the area; then, an ensemble of genetic algorithms was used to retrieve the instantaneous river discharge for each satellite image acquisition date [27]. Bankfull discharge is often used as a surrogate for channel forming or dominant discharge, which is the morphologically significant discharge that shapes the river [3,28]. Bankfull river width is one of the fundamental measures of stream size, and it is also a key parameter in the study of river geomorphology. However, most existing river width datasets were not acquired under bankfull discharge conditions due to (1) the limited number of high-quality images constrained by the satellite revisitation periods and the influence of snow and clouds; and (2) limited in situ measurement data can be used to calculate bankfull discharge. Therefore, attention should be given to river width extraction under bankfull discharge to better understand the river morphology and sediment transport spatial distributions, such as in the northeast Qinghai-Tibetan Plateau (QTP) region.

Yamazaki et al. developed the Global Width Database for Large Rivers (GWD-LR) for rivers wider than 183 m by applying a new algorithm to the SRTM Water Body Database and the HydroSHEDS flow direction map [29]. Pavelsky provided a river width dataset of Tanana for rivers wider than 150 m and calculated the power law relationship between the river widths and discharge amounts [30]. Allen and Pavelsky proposed the NARWidth dataset, which contains measurements of >2.4 × 10⁵ km for rivers wider than 30 m. The researchers extrapolated the strong relationship observed between river width and the total surface area measured at different river widths (r² > 0.99 for 100–2000 m widths) to narrower rivers and streams [25]. Pekel et al. mapped the global surface water and global hydromorphic features observed by Landsat satellites with a 30-m resolution over the past 32 years [15]. Allen and Pavelsky provided the first detailed global river width database for rivers wider than 90 m [26]. Based on the classification of river size proposed by Meybeck et al. [31], most of the existing river width datasets focus on medium to large rivers (small, medium, and large rivers have widths of 40–200, 200–800, and 800–1500 m, respectively). Very few studies have focused on small to medium river width estimations in mountainous areas using high-resolution satellite images [32,33,34,35]. As the headwaters of many large rivers, there is an abundance of small rivers with widths less than 200 m and smaller rivers with widths less than 40 m in the QTP. Therefore, a bankfull river width dataset with a finer resolution (river width < 90 m) for the QTP is necessary to facilitate research on fluvial geomorphology and hydrological modeling.

To date, there are no in-depth studies of the river network structure and runoff characteristics of the QTP, which is known as the “Asian Water Tower”, containing the headwaters of ten major rivers across the Asian continent, and related research has just begun [19,34,35,36]. With the development of global climate change, the QTP has received more attention, and there is an urgent need for high-precision extraction of small rivers to understand the dynamic changes and hydraulic geometric relationships of rivers in this mountainous area. However, most river discharge and width observations are measured at ground-based gauges in the QTP region [36], which may limit a deeper understanding of river morphology, sediment transport, and flood routing, as well as their ecological impacts on the QTP.

In terms of the aforementioned research gaps, regarding the low width extraction precision of rivers with high suspended sediment concentrations, the less established physical meaning of the extracted river widths at annual mean discharge and mean discharge of summer flood season conditions, and the fact that the minimum extracted river widths are usually >90 m, the following objectives are proposed in this research: (1) obtain the river widths in the upper Yellow River Basin under bankfull discharge, combining a DEM, in situ hydrological data, and Sentinel-2 images with high temporal (5 days) and spatial resolutions (10 m); and (2) detect open-surface water bodies and monitor the downstream dynamic changes of river width under bankfull discharge conditions on the upper Yellow River Basin. A detailed technical flowchart is shown in Figure 1.

Considering the classification results of Meybect et al. [31] and the measured data from hydrological stations in the upper Yellow River Basin, rivers with widths smaller than 90 m are seen as small rivers in this study. It is anticipated that this research can close gaps in areas lacking hydrological data and assist in understanding the changes in river geometry within river networks.

2. Study Area and Data Preprocessing

2.1. Study Area

The study area is in the upper Yellow River (upstream from the Anningdu hydrological station) on the northeastern margin of the QTP. The total drainage area is approximately 250,944.65 km² (Figure 2). This area is composed of a series of alternating mountains, valleys, and hills with an elevation range of 1344–6295 m. The elevation shows a decreasing trend from more than 6200 m in the source area to less than 1400 m in the northeastern area. There are abundant river landforms and erosional types and many river canyons. A series of cascade reservoirs, such as Longyangxia, Laxiwa, Liujiaxia, and Lijiaxia, are built on the mainstream.

The upper Yellow River is in the mid-latitude area, which has a typical plateau continental climate with little rain in winter and concentrated precipitation in summer. Controlled by the southwest and southeast air currents, the distribution of precipitation during the year is uneven, with large interannual variability. Approximately 60–80% of precipitation is concentrated from June to September, with the least precipitation occurring in December and January. Climate changes with terrain height lead to differences in precipitation and temperature throughout various regions, with small annual temperature differences, large daily temperature differences, long sunshine hours, and strong solar radiation days.

The suspended sediment concentrations of the different river sections in the study area are quite different; these conditions directly affect the extraction accuracy of river width. From June to October 2017, the suspended sediment concentration in the main stream of the Yellow River increased from 0.0456 kg/m³ at the HHY4 hydrological station at the source to 0.5552 kg/m³ at the TNH station; the concentration dropped rapidly to 0.0596 kg/m³ at the GD2 station, and then gradually increased to 1.3898 kg/m³ at the exit of the AND station. The reasons we use the suspended sediment concentration from June to October 2017 are (1) this time period corresponds to the Sentinel-2 image selection period, and the SSCs of the other time periods have no effects on the remote sensing images selection; and (2) the sediment transport of the upper reaches of the Yellow River is mainly concentrated in June to October, accounting for 89–100% of the annual sediment transport.

The study area was divided into three subregions mainly on the basis of the surface features and river-suspended sediment concentrations. The cloud and snow cover conditions and mountain shadows were also considered (Figure 2). The underlying surface features and river-suspended sediment concentrations are the main criteria for subregional classification. The former is based on the national land-use cover change (LUCC2015) dataset collected in 2015, and the latter is based on the Annual Hydrological Reports of the People’s Republic of China (2017) [37]. The main underlying surface types and river suspended sediment concentrations of the three subregions are shown in Table 1.

2.2. Hydrological Data Collection and Processing

2.2.1. Hydrological Data Collection

The data used in this study were acquired from the Annual Hydrological Reports of the People’s Republic of China (1967–2019), and these data include river width, flow depth, flow velocity, flow discharge, suspended sediment concentration, and cross-section information. Referring to the research of Qin et al. [38], detailed information (river names, where the rivers flow to, longitude, latitude, altitude, contributing area, distance to estuary, cross sections selected to extract river widths, cross sections of the mainstream without reservoir effects, and annual peak discharge used to calculate flood frequency) of all 68 in situ-measured cross sections are presented in Table S1. The locations of the hydrological stations are represented with yellow triangles in Figure 2.

2.2.2. Bankfull Discharge Calculation

Bankfull river width is one of the basic channel geometry parameters associated with bankfull discharge. Therefore, the river width under the condition of bankfull discharge is the most significant in the river width extraction. In situ cross-sectional data (2007–2017) and in situ discharge data (1967–2019) from the upper Yellow River were used to determine the bankfull discharge of the main stream and its tributaries (Table S1).

Calculation of Bankfull Discharge Based on Cross-Section Morphology

In this study, cross sections that were less influenced by human activities (e.g., no hydropower stations or artificial diversions 10 km upstream or downstream of the measured cross section and located outside the backwater zone of a dam) and extreme events (e.g., glacial outbursts and landslides) were selected to maximize removal of external disturbances. The morphology of each cross section was determined based on in situ measurement data during 2007–2017.

The surveyed bankfull-stage indicator and its corresponding water level were detected for each cross section. Figure 3a shows a typical cross section of the main stream of the upper Yellow River at the Mentang (MT) hydrological station. The bankfull stage was obtained from the bankfull-stage indicator (red dot in Figure 3a) and the bankfull width was estimated from the surveyed cross-section geometry during 2007–2017. The bankfull discharge that corresponded to bankfull stage was obtained from either the measured data or stage–discharge relation (Figure 3b).

Calculation of Bankfull Discharge Based on Flood Frequency

At many sections, the bankfull indicator is not available on mountainous river reaches, so bankfull discharges were determined based on flood frequency analysis. The annual maximum peak discharge of these cross sections was selected, and the Pearson III (P-III) curve was used to estimate the flood frequencies and corresponding discharges.

Two cases were used to determine the flood frequency and bankfull discharge. For the first case, there are more than two cross sections located within the same river reach. The morphology of each cross section was first depicted to see whether there is bankfull turning points (red point in Figure 3a). Then, the flood frequencies of all cross sections were estimated with P-III curves. For those cross sections that have no bankfull turning points, the flood frequencies of these cross sections were assumed to be the same as those cross sections with bankfull turning points. Lastly, the hydraulics (river widths and discharges) of the cross sections with no bankfull turning points under bankfull conditions were estimated through their shared flood frequency.

The second case occurs when there is only one cross section in the same river reach and when the cross section has no bankfull turning point. The flood frequency of the cross section was assumed to be the same as those cross sections with bankfull turning points within the same steam order. Then, the river width and discharge of the cross section were estimated under bankfull conditions through the shared flood frequency.

Referring to former studies of our research group [38,39] and comprehensively considering the remote sensing images coverage, we set up a criteria for hydrological data screening: (1) the completeness of the hydrological dataset (river width, discharge, and cross-section morphology); (2) have had relatively low anthropogenic influence (e.g., no hydropower station and artificial diversion 5 km upstream and downstream of the measured cross section, and located outside the backwater zone of a dam); (3) act as a natural riverway with perennial drainage; and (4) the quality and coverage of Sentinel-2 images under bankfull conditions. According to the above criterion, 22 hydrological stations (12 on the main stream and 10 on the tributaries) were ultimately selected from all 68 stations evaluated in this study (see Table S1). All 22 stations are located at single-thread river reaches though multi-thread reaches do exist in the upper Yellow River Basin. The bankfull discharges, calculation methods, and corresponding flood frequencies of the 22 hydrological stations are shown in Table 2.

2.3. Sentinel-2 Images Selection and DEM Processing

2.3.1. Sentinel-2 Images Selection under Bankfull Discharge

In order to be as close as possible to the time when the satellite passed through the hydrological stations, we chose the average discharge between 10 a.m. and 2 p.m. as the daily discharge (DD). To best represent the bankfull conditions, the dates on which the daily discharge falls within the bankfull discharge (BD × 1 ± 15%) interval were extracted from the in situ data of each of the 22 hydrological stations from June to October in 2017–2019. Then, Sentinel-2 multispectral instrument (MSI) images that can completely cover the contributing area of each hydrological station on the corresponding dates were selected. A website (https://scihub.copernicus.eu/userguide/, accessed on 21 February 2021) provides in-depth descriptions of the products and algorithms of Sentinel-2, as well as their performances. For individual hydrological stations with incomplete image cover or large amounts of cloud cover, the selection range of DD was expanded to BD × 1 ± 25% (Figure 4). Figure 4 shows the ratio relationship between the DD of the Sentinel-2 image cover and the BD of the 22 hydrological stations.

Figure 4 shows that 81.25% of the images have DD values within BD × 1 ± 15%. The images located in the range of BD × 1 ± 15% to BD × 1 ± 25% are mostly used to supplement the corner position of the control basins. The left side of the red line in the figure consists of images of the mainstream of the upper Yellow River, while the right side comprises images of the tributaries. The DD along the mainstream is closer to the BD than the tributaries.

2.3.2. River Network Extraction from DEM

Drainage network extraction plays an important role in geomorphologic analyses and hydrologic modeling studies, among other applications. Bai [40] and Wu et al. [41] used an enhanced flow enforcement method without elevation modification towards accurate and efficient drainage network extraction. In this study, the Drainage Network Extraction Tool (DNET) developed by Wu et al. [41] and Bai [40] was used to extract a total of seven orders of river networks from the SRTM 90-m DEM according to the minimum confluence area of 7.29 km².

In addition to the rivers, there are other ground features in the study area, such as roads, buildings, vegetation, and bare land, which directly affect the efficiency and accuracy of river extraction. River networks with orders of 3–7 were used to construct buffer zones for the river networks according to a fixed distance, and the Sentinel-2 images under bankfull discharge conditions were masked using buffer zones. Through this operation, all the information outside the buffer zone of the river networks can be removed to reduce the impact of other noise. In this study, based on the measured river widths from the hydrological data and Google Earth, the buffer distance of the river networks with orders of 3–4 was set to 1000 m, and the buffer distance of the river networks with orders of 5–7 was set to 3000 m.

3. Method of River Extraction

The underlying surface and suspended sediment concentrations of water bodies directly affect the river extraction accuracy and the data postprocessing workload [20]. Some problems exist when using the water body index threshold method to extract water bodies. When the threshold is too large, despite achieving basically complete water body extraction, too many noise points exist, and the data postprocessing workload is considerable; when the threshold is too small, there are fewer noise points, and the postprocessing workload is less intense, but the water body cannot be completely extracted. To improve the accuracy of the river extraction while also reducing the workload of data postprocessing, a machine learning (ML) random forest (RF) algorithm was used to extract the water bodies according to different underlying surface features and river-suspended sediment concentrations (Table 1). ML involves the use of data or past experiences to optimize the performance standards of computer programs. RF is a classification tree-based algorithm proposed by Breiman [42]. RF is essentially an extension of the traditional decision tree algorithm, and it improves the classification accuracy of models by combining multiple decision trees.

3.1. Training Samples Selection

Based on Sentinel-2 images under bankfull discharge in the control basin of 22 hydrological stations determined in Section 2.3, water and nonwater samples (including vegetation, residential land, roads, farmland, bare land, and snow) were manually selected from the land-cover images on the GEE platform. The first subregion has a total of 713 samples (400 water bodies, 313 nonwater bodies); the second subregion has a total of 409 samples (202 water bodies, 207 nonwater bodies); and the third subregion has 608 samples (302 water bodies, 306 nonwater bodies) (Figure 2).

3.2. Features Extraction

Considering the characteristics of high altitude, a complex underlying surface, and a high suspended sediment concentration in the study area, three types of features were extracted from the DEM and Sentinel-2 imagery. These features are listed as follows:

Basic information of the DEM and the band reflectivity of Sentinel-2 images (10 features) are provided, including elevation, aspect, slope, and hillshade derived from the DEM and the band reflectivities of B2, B3, B4, B8, B11, and B12 from the Sentinel-1 imagery.
The gray level cooccurrence matrix (GLCM) is employed to derive certain textural features (180 features). GEE provides a total of 18 matrices, of which 14 are from Haralick et al. [43] and 4 are from Conners et al. [44]. Please refer to these two papers for the meaning of each feature and their detailed calculation formulas, as this study will not explain these features in detail. For the 10 basic features obtained in the first step, the 18 texture features were extracted from the GEE platform.
The spectral indices of the Sentinel-2 images (49 features) were constructed based on the apparent reflectance of the B2, B3, B4, B8, B11, and B12 bands (see Table 3). Most of the spectral indices originate from the remote sensing index database (https://www.indexdatabase.de/, accessed on 21 February 2021).

3.3. ML-RF Features Selection and Water Extraction

In this study, we first tested for multicollinearity between variables and removed variables with a Pearson’s r > 0.85 to reduce the redundancy. However, the noise in the water body extraction results was much more than that of the extraction results without removing any variables, so we did not remove any variables in the end. The feature variables that substantially contributed to the model’s accuracy were sorted according to the mean decrease accuracy (MDA) method in the RF algorithm (http://blog.datadive.net/selecting-good-features-part-iii-random-forests/, accessed on 21 February 2021). The MDA method directly measures the impact of each feature on the accuracy of the model. The main idea of this method is to disrupt the order of the features and measure the impact of order changes on the accuracy of the model. The order of the feature variables sorted from top to bottom is shown in Figure 5. The first 6 features were selected to represent all 239 features to construct a feature subset for ML modeling.

The RF model was used to reorder the feature subsets of the three subregions according to the value of MDA and the impact of each feature on the Gini node impurity. The six features of the three subregions in order of importance are aweish, ewi, wi2015, B8, WDVI, and B11 in the first region; SRMIR_Red, B2, rndwi, B2_diss, CRI550, and Fe2 in the second region; and B8, B11, wi2015, WDVI, aweish, and GDVI in the third region.

The model-feature ranking shows that for the first region where the river-suspended sediment concentration is lowest, the underlying surface is mainly bare land (rocks, sandy land) with low-to-medium grassland coverage, and the top three contributing characteristics are the water body indices. For the second region, where the river-suspended sediment concentration is highest and the underlying surface features are mainly urban and rural land, arable land, and low-to-medium grassland coverage, the largest contribution is the water quality index-SRMIR_Red, followed by the blue band and water body index. This finding shows that the water quality index greatly contributes to this area with a high suspended sediment concentration and more bare yellow land. For the third region where the river suspended sediment concentration ranks in the middle and the underlying surface is mainly high-coverage grasses and shrubs, the two infrared bands (B8 and B11) contribute the most, which further supports the sensitive response of the infrared band to water and vegetation.

3.4. Model Evaluation

The overall accuracy (OA) and kappa coefficient were calculated from the confusion matrix to characterize the accuracy of the ML modeling classification. The kappa coefficient is a ratio that represents the error reduction between an evaluated classification and a completely random classification. In general, the minimum allowable discrimination accuracy of the kappa coefficient is 0.7 [57]. The formula is shown below:

K = \frac{N \cdot \sum_{i}^{r} x_{i i} - \sum (x_{i +} \cdot x_{+ i})}{N^{2} - \sum (x_{i +} \cdot x_{+ i})}

(1)

where K is the kappa coefficient, r is the number of rows in the error matrix, x_ii is the value on row i and column i (main diagonal), x_i+ and x_+i are the sum of the i-th row and the i-th column, respectively, and N is the total number of samples. In most cases, kappa statistics are used to evaluate the classification effect. Landis and Koch [58] determined that when the statistical kappa value is within the range of 0.60–0.80, the strength of agreement is substantial, and when the statistical kappa value is above 0.8, the strength of agreement is nearly perfect.

A total of 10 repeated models were executed, and the repeated 3-fold cross-validation results are shown in Table 4.

Combining the information in Table 4 with Landis and Koch’s [58] classification, the classification results of the three subregions are very good, and the classification OA is in the order of third > second > first; the kappa average value of the third subregion is >0.8.

4. Results

4.1. River Extraction Results

Although the drainage network from the DEM is used to constrain the image and remove most of the background noise, there is still strong background noise present in certain areas. The extracted river raster results are converted into vectors using ARCGIS 10.2, and then, the individual areas with noise are edited manually. The final editing results are shown in Figure 6 and Figure 7. Figure 6 shows the result of the extracted connected rivers above 30 m and the drainage network above order 3 from the DEM; Figure 7 shows the result of all the extracted rivers (connected + disconnected) and the drainage network above order 2.

Figure 6 shows that all rivers (excluding areas A and B) above order 4 and nearly half of the order 4 rivers were extracted. The effective connected width of the extracted rivers is greater than 30 m, which is 3 times the image resolution. Figure 7 also shows that the rivers extracted by the ML method basically cover all the river networks above order 3 and most of the order 3 rivers in the 10-m Sentinel-2 images under bankfull discharge. Among them, area C was not successfully extracted; thus, the original satellite image was checked, and the area contained a township road. In and around area A, in the northern part of area B, and on the right side of area C, some rivers of orders 3 and 4 were not successfully extracted. Based on a check of the satellite images, one reason that this extraction was unsuccessful was that some river sections were too narrow (river width < 20 m); the other reason was due to image losses caused by cloud removal processing; and the third explanation was that the extraction result was too noisy to distinguish the boundary of the river and directly delete it.

4.2. Estimation Accuracy of River Width

Three indicators, i.e., R², root mean square error (RMSE), and mean bias error (MBE) [20], were used to quantitatively evaluate the estimated river width. The boundary of the extracted river vectors in Figure 6 was considered to be the bank of the river. First, the river center lines were extracted from the final edited river results, and then, the perpendicular lines of the river center lines were established [26]. There are two intersections between the perpendicular line and the two banks of the river. For a single-thread river, the distance between the intersections of the two banks of the river is the river width. For a multiple-thread river, the river width is the sum of the width of each flow. For each measured cross section, the resolution of the remote sensing image is 10 m; therefore, 5 m is taken as the step length, and the average of three consecutive measurements is taken as the final river width. Figure 8 shows the linear regression between the estimated bankfull river width and the measured river width from 22 hydrological stations.

The in situ river widths of 22 hydrological stations were used to evaluate the estimated river widths extracted from remote sensing images (Figure 8). The estimated river width results are satisfactory. The R², RMSE, and MBE results are 0.991, 7.455 m, and −0.232 m, respectively (Figure 8). The RMSE is calculated within one pixel, and the overall river width is underestimated. The results indicate that the estimated river widths of the single channel of the mainstream reaches are basically within 300 m.

4.3. River Width Distribution along the Mainstream of the Upper Yellow River

The machine learning method was used to extract the rivers in the upper Yellow River from the Sentinel-2 images through the GEE platform. Excluding some small headwaters at the source of the Yellow River, the main stream of the upper Yellow River is completely extracted. Within a distance of 2000+ km from the source of the extracted main stream to the exit of Anningdu station (AND), 5 km was taken as the step length to extract the river widths along the main stream, excluding lakes and reservoirs. The results are shown in Figure 9.

Figure 9 shows the distribution characteristics of the estimated bankfull river width along the main stream of the Yellow River (Figure 9a), as well as the increase in the contributing area and discharge of each hydrological station (Figure 9b). The single-thread river reaches and multithread river reaches are staggered. There are four single-thread river reaches (gray area, Figure 9a) and three multithread river reaches (light blue area, Figure 9a), though all 12 hydrological stations are in single-thread reaches.

Based on the estimated bankfull river width along the main stream, the mean and standard deviation (SD) of the river widths of the two river types were calculated with single threads and multiple threads in different river reaches. The average river width of the single-thread river reaches shows a stronger linear relationship along the main stream, with an R² of 0.801 (Figure 9a). The contributing area and discharge also gradually increase along the mainstream, and there are positive correlations between the bankfull river width and the contributing area and discharge. The average river widths of the multithread river reaches vary greatly and have no obvious regularity.

The estimated river widths, contributing areas, and bankfull discharges of the hydrological stations were selected to study their relationships in two situations: (1) single-thread river reaches, excluding the hydrological stations affected by reservoirs (Figure 10a,b); and (2) all river reaches, excluding the hydrological stations affected by reservoirs (Figure 10c,d).

Figure 10b shows a regression curve between the estimated river widths and the bankfull discharges of the single-thread river reaches, representing a good downstream hydraulic geometry relationship, with an R² of 0.782. Since there is a positive correlation between the contributing areas and the bankfull discharges, a regression curve between the contributing area and river width is created, which is in the form of a power law, as shown in Figure 10a, and the R² is 0.630. In addition, the relationships between the river widths and the contributing areas and bankfull discharges of all hydrological stations less affected by reservoirs and human activities were determined, as shown in Figure 10c,d, with R² values of 0.462 and 0.662, respectively, which are slightly lower than those of the single-thread river reaches. The R² of river width versus bankfull discharge is greater than the R² of the river width versus contributing area relationship. This finding confirms the results of Wilkerson et al. [59], who stated that using the contributing area alone does not yield a reliable river width versus contributing area relationship.

5. Discussions

In this study, the main factors that affected the accuracy and completeness of the river extraction are summarized as follows. First, the spatial resolution of the image directly determines the effective river width that can be measured. The effective width of the extracted rivers is 3 times greater than the image resolution. For small and narrow rivers, higher-resolution images must be used. The second factor is the quality of the satellite imagery. The cloud and shadow cover on the image directly affect the effective extraction of rivers. In the future, more effective algorithms must be developed for cloud and shadow removal. Third, the suspended sediment concentration of rivers and the water quality environment should be considered. The river’s suspended sediment concentration is too high, and the water quality environment is complex, making the difference between the river’s spectral and textural characteristics and the background small, and the extraction results are too noisy. Suspended sediment concentration and water quality are important factors affecting the extraction results.

In addition, river width distribution under bankfull discharge was impossible to obtain from the sparsely distributed hydrological stations, otherwise a remote sensing technique needs to be used. The study of the morphologies of alternatively distributed single-thread and multithread rivers can benefit from the extracted bankfull river widths based on the method developed in this study. Results indicate that the estimated river widths can be used as substitutes for the in situ-measured river widths when analyzing downstream hydraulic geometry. The good downstream hydraulic geometry relationship shows the connection of channel geometry of single-thread river reaches along the river course, though they are separated by multithread reaches. Both the geological and geomorphic background and inflow water and sediment contribute to the formation of river morphology. Results of this study indicate that inflow water and sediment contribute more to the morphology shaping of single-thread river reaches. By understanding the variation in bankfull river widths along river reaches and across river networks, the effects of discharge and sediment on channel geometry can be predicted.

6. Conclusions

This work developed a method of extracting bankfull river widths on small rivers (width < 90 m) in mountainous areas based on remote sensing images and DEM, and preliminary explored the downstream hydraulic geometries of the main stream of the upper Yellow River. The main conclusions of this study are described as follows:

(1): The ML method exhibits good performance in the extraction of rivers in the upper Yellow River, and the extraction integrity can reach order 3 and above for the DEM drainage network. The mean overall accuracy of three subregions was above 0.87, and their mean kappa values were all above 0.75. The estimated R², RMSE, and MBE of the bankfull river width are 0.991, 7.455 m, and −0.232 m, respectively.
(2): Bankfull river widths of the mainstream were extracted with a step length of 5 km from the source to the exit. The average river widths of the single-thread sections showed a good linear relationship, with an R² value reaching 0.801. There are good power relationships between the river width and the bankfull discharge and contributing area, with R² values of 0.782 and 0.630, respectively.
(3): The effective connected river width was 30 m, which was 3 times the image resolution. The research results could enrich the river channel width database of the upper Yellow River and provide basic data for applications in hydrology, fluvial geomorphology, and stream ecology.
(4): The high spatial resolution of the bankfull river width dataset can be used to (1) compensate for the missing river width data between two traditional hydrological stations, and further analyze the channel geometries of alternatively distributed single-thread and multithread rivers; (2) analyze downstream hydraulic geometry and estimate bankfull discharge in river sections without hydrological data; (3) provide additional boundary conditions for distributed hydrological models to improve the simulation accuracy; and (4) quantify water carbon emissions [60].

The extraction of rivers below 30 m is relatively poor using the 10-m remote sensing images. In addition, due to the limitation of hydrological data and remote sensing images under bankfull discharge, only 22 hydrological stations were used in this study, and the research results had certain limitations. In the future, radar data will be combined with optical images with resolutions of 2 m or more to explore the automatic extraction of mountainous rivers with widths less than 30 m under complex terrains, to obtain the river width parameters of the whole river network. Simultaneously, the nature of the water body and the influence of substances in the water combined with the hydrological model will be considered to improve the accuracy of water body extraction in complex environments.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/rs13142650/s1, Table S1: Basic information of the cross sections located in the upper Yellow River Basin.

Author Contributions

Conceptualization, D.L. and B.W.; data curation, D.L., G.W. and C.Q.; funding acquisition, B.W.; investigation, G.W. and C.Q.; methodology, D.L.; software, D.L.; supervision, D.L. and B.W.; validation, G.W.; visualization, C.Q.; writing—review and editing, C.Q. and B.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation of China, grant number 51639005 and 52009061, the National Key R&D Program of China, grant number 2017YFC0405202, and the Postdoctoral Innovation Talents Support Program of China, grant number BX20190177. The APC was funded by the Natural Science Foundation of China, grant number 51639005.

Data Availability Statement

Data provided by the Bureau of Hydrology at the Ministry of Water Resources of China were in the form of hard copy but not electronic copy, therefore, no link (URL or DOI) can be presented here. The other data and extraction results are available on request from the corresponding author.

Acknowledgments

The authors would like to thank Bowei Chen from the Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences for his guidance and help in compiling and modifying the program. And to acknowledge the Bureau of Hydrology at the Ministry of Water Resources of China for providing the in-situ measured hydrological data.

Conflicts of Interest

The authors declare no conflict of interest.

References

Oki, T.; Kanae, S.; Musiake, K. Global Hydrological Cycle and World Water Resources. Science 2006, 313, 1068–1072. [Google Scholar] [CrossRef] [Green Version]
Feng, D.; Gleason, C.J.; Yang, X.; Pavelsky, T.M. Comparing Discharge Estimates Made via the BAM Algorithm in High-Order Arctic Rivers Derived Solely from Optical CubeSat, Landsat, and Sentinel-2 Data. Water Resour. Res. 2019, 55, 7753–7771. [Google Scholar] [CrossRef]
Agouridis, C. Bankfull frequency in rivers. In Handbook of Engineering Hydrology, Modeling, Climate Change, and Variability; CRC Press: Boca Raton, FL, USA, 2014. [Google Scholar]
Copeland, R.R.; Biedenharn, D.S.; Fischenich, J.C. Channel-Forming Discharge; Army Corps of Engineers: Washington, DC, USA, 2000. [Google Scholar]
Faustini, J.M.; Kaufmann, P.R.; Herlihy, A.T. Downstream variation in bankfull width of wadeable streams across the conterminous United States. Geomorphology 2009, 108, 292–311. [Google Scholar] [CrossRef] [Green Version]
Jupp, D.L.B.; Kirk, J.T.O.; Harris, G.P. Detection, identification and mapping of cyanobacteria—Using remote sensing to measure the optical quality of turbid inland waters. Mar. Freshw. Res. 1994, 45, 801–828. [Google Scholar] [CrossRef]
Hu, D.Y.; Li, J.; Chen, Y.H.; Jiang, W.G. Water and settlement area extraction from single-band, single-polarization SAR images based on SVM method. J. Image Graph. 2008, 13, 257–263. (In Chinese) [Google Scholar]
Feng, L.; Hu, C.; Chen, X.; Cai, X.; Tian, L.; Gan, W. Assessment of inundation changes of Poyang Lake using MODIS observations between 2000 and 2010. Remote Sens. Environ. 2012, 121, 80–92. [Google Scholar] [CrossRef]
Feyisa, G.L.; Meilby, H.; Fensholt, R.; Proud, S.R. Automated Water Extraction Index: A new technique for surface water mapping using Landsat imagery. Remote Sens. Environ. 2014, 140, 23–35. [Google Scholar] [CrossRef]
Durand, M.; Gleason, C.; Garambols, P.; Bjerklie, D.; Smith, L.; Roux, H.; Rodriguez, E.; Bates, P.; Pavelsky, T.; Monnier, J. An inter comparison of remote sensing river discharge estimation algorithms from measurements of river height, width, and slope. Water Resour. Res. 2016, 52, 4527–4549. [Google Scholar] [CrossRef] [Green Version]
Yang, X.; Qin, Q.; Grussenmeyer, P.; Koehl, M. Urban surface water body detection with suppressed built-up noise based on water indices from Sentinel-2 MSI imagery. Remote Sens. Environ. 2018, 219, 259–270. [Google Scholar] [CrossRef]
Cui, X.; Guo, X.; Wang, Y.; Wang, X.; Zhu, W.; Shi, J.; Lin, C.; Gao, X. Application of remote sensing to water environmental processes under a changing climate. J. Hydrol. 2019, 574, 892–902. [Google Scholar] [CrossRef]
Li, D.; Wu, B.; Chen, B.; Xue, Y.; Zhang, Y. Review of water body information extraction based on satellite remote sensing. J. Tsinghua Univ. 2020, 60, 147–161. (In Chinese) [Google Scholar]
Xu, H.Q. Modification of normalized difference water index (NDWI) to enhance open water features in remotely sensed imagery. Int. J. Remote Sens. 2006, 27, 3025–3033. (In Chinese) [Google Scholar] [CrossRef]
Pekel, J.-F.; Cottam, A.; Gorelick, N.; Belward, A.S. High-resolution mapping of global surface water and its long-term changes. Nature 2016, 540, 418–422. [Google Scholar] [CrossRef]
Yang, X.; Pavelsky, T.M.; Allen, G.; Donchyts, G. RivWidthCloud: An Automated Google Earth Engine Algorithm for River Width Extraction from Remotely Sensed Imagery. IEEE Geosci. Remote Sens. Lett. 2019, 17, 217–221. [Google Scholar] [CrossRef]
Santoro, M.; Wegmüller, U.; Lamarche, C.; Bontemps, S.; Defourny, P.; Arino, O. Strengths and weaknesses of multi-year Envisat ASAR backscatter measurements to map permanent open water bodies at global scale. Remote Sens. Environ. 2015, 171, 185–201. [Google Scholar] [CrossRef]
Zou, Z.; Xiao, X.; Dong, J.; Qin, Y.; Doughty, R.B.; Menarguez, M.A.; Zhang, G.; Wang, J. Divergent trends of open-surface water body area in the contiguous United States from 1984 to 2016. Proc. Natl. Acad. Sci. USA 2018, 115, 3810–3815. [Google Scholar] [CrossRef] [Green Version]
Deng, Y.; Jiang, W.; Tang, Z.; Ling, Z.; Wu, Z. Long-Term Changes of Open-Surface Water Bodies in the Yangtze River Basin Based on the Google Earth Engine Cloud Platform. Remote Sens. 2019, 11, 2213. [Google Scholar] [CrossRef] [Green Version]
Li, D.; Wu, B.; Chen, B.; Qin, C.; Wang, Y.; Zhang, Y.; Xue, Y. Open-Surface River Extraction Based on Sentinel-2 MSI Imagery and DEM Data: Case Study of the Upper Yellow River. Remote Sens. 2020, 12, 2737. [Google Scholar] [CrossRef]
Wang, X.; Xiao, X.; Zou, Z.; Chen, B.; Ma, J.; Dong, J.; Doughty, R.B.; Zhong, Q.; Qin, Y.; Dai, S.; et al. Tracking annual changes of coastal tidal flats in China during 1986–2016 through analyses of Landsat images with Google Earth Engine. Remote Sens. Environ. 2020, 238, 110987. [Google Scholar] [CrossRef]
Chen, F.; Zhang, M.; Tian, B.; Li, Z. Extraction of Glacial Lake Outlines in Tibet Plateau Using Landsat 8 Imagery and Google Earth Engine. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 4002–4009. [Google Scholar] [CrossRef]
Richard, J.B.; Richard, D.W.; Trevor, B.H.; Brian, B.; Octria, A.P. Applications of Google Earth Engine in fluvial geomorphology for detecting river channel change. WIREs Water 2020, 8, e21496. [Google Scholar]
Ji, Q.; Liang, W.; Fu, B.; Zhang, W.; Yan, J.; Lv, Y.; Yue, C.; Jin, Z.; Lan, Z.; Li, S.; et al. Mapping land use/cover dynamics of the Yellow River basin from 1986 to 2018 supported by Google Earth Enfine. Remote Sens. 2021, 13, 1299. [Google Scholar] [CrossRef]
Allen, G.H.; Pavelsky, T.M. Patterns of river width and surface area revealed by the satellite-derived North American River Width data set. Geophys. Res. Lett. 2015, 42, 395–402. [Google Scholar] [CrossRef]
Allen, G.H.; Pavelsky, T.M. Global extent of rivers and streams. Science 2018, 361, 585–588. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Gleason, C.J.; Smith, L.C.; Lee, J. Retrieval of river discharge solely from satellite imagery and at-many-stations hydraulic geometry: Sensitivity to river form and optimization parameters. Water Resour. Res. 2014, 50, 9604–9619. [Google Scholar] [CrossRef]
Navratil, O.; Albert, M.-B.; Herouin, E.; Gresillon, J.-M. Determination of bankfull discharge magnitude and frequency: Comparison of methods on 16 gravel-bed river reaches. Earth Surf. Process. Landf. 2006, 31, 1345–1363. [Google Scholar] [CrossRef]
Yamazaki, D.; O’Loughlin, F.; Trigg, M.A.; Miller, Z.F.; Pavelsky, T.M.; Bates, P.D. Development of the Global Width Database for Large Rivers. Water Resour. Res. 2014, 50, 3467–3480. [Google Scholar] [CrossRef]
Pavelsky, T.M. Using width-based rating curves from spatially discontinuous satellite imagery to monitor river discharge. Hydrol. Process. 2014, 28, 3035–3040. [Google Scholar] [CrossRef]
Chapman, D. (Ed.) Water Quality Assessments: A Guide to the Use of Biota, Sediments and Water in Environmental Monitoring, 2nd ed.; CRC Press: Boca Raton, FL, USA, 1996. [Google Scholar]
Alsdorf, D.E.; Lettenmaier, D.P. Tracking fresh water from space. Science 2003, 301, 1491–1494. [Google Scholar] [CrossRef] [Green Version]
Sulistioadi, Y.B.; Tseng, K.-H.; Shum, C.K.; Hidayat, H.; Sumaryono, M.; Suhardiman, A.; Setiawan, F.; Sunarso, S. Satellite radar altimetry for monitoring small rivers and lakes in Indonesia. Hydrol. Earth Syst. Sci. 2015, 19, 341–359. [Google Scholar] [CrossRef] [Green Version]
Huang, Q.; Long, D.; Du, M.; Zeng, C.; Qiao, G.; Li, X.; Hou, A.; Hong, Y. Discharge estimation in high-mountain regions with improved methods using multisource remote sensing: A case study of the Upper Brahmaputra River. Remote Sens. Environ. 2018, 219, 115–134. [Google Scholar] [CrossRef]
Kebede, M.G.; Wang, L.; Li, X.; Hu, Z. Remote sensing-based river discharge estimation for a small river flowing over the high mountain regions of the Tibetan Plateau. Int. J. Remote Sens. 2019, 41, 3322–3345. [Google Scholar] [CrossRef]
Wang, L.; Sichangi, A.W.; Zeng, T.; Li, X.; Hu, Z.; Genanu, M. New methods designed to estimate the daily discharges of rivers in the Tibetan Plateau. Sci. Bull. 2019, 64, 418–421. [Google Scholar] [CrossRef] [Green Version]
Yellow River Water Resources Commission. Annual Hydrological Report: Upper Yellow River Basin (Hard Copy); Department of Hydrology, China Ministry of Water Resources (MWR): Beijing, China, 2017. (In Chinese)
Qin, C.; Wu, B.; Wang, G.; Wang, G. Spatial distributions of at-many-stations hydraulic geometry for mountain rivers originated from the Qinghai-Tibet Plateau. Water Resour. Res. 2021, 57, e2020WR029090. [Google Scholar] [CrossRef]
Qin, C.; Wu, B.; Wang, Y.; Fu, X.; Xue, Y.; Li, D.; Li, M.; Zhang, Y. Dynamic variability of at-a-station hydraulic-geometry for mountain rivers in the southeast Qinghai-Tibet Plateau: The cases of Yalong River and upper Jinsha River. Catena 2020, 194, 104723. [Google Scholar] [CrossRef]
Bai, R.; Li, T.; Huang, Y.; Li, J.; Wang, G. An efficient and comprehensive method for drainage network extraction from DEM with billions of pixels using a size-balanced binary search tree. Geomorphology 2015, 238, 56–67. [Google Scholar] [CrossRef]
Wu, T.; Li, J.; Li, T.; Sivakumar, B.; Zhang, G.; Wang, G. High-efficient extraction of drainage networks from digital elevation models constrained by enhanced flow enforcement from known river maps. Geomorphology 2019, 340, 184–201. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Haralick, R.M.; Shanmugam, K.; Dinstein, I. Textural Features for Image Classification. IEEE Trans. Syst. Man Cybern. 1973, SMC-3, 610–621. [Google Scholar] [CrossRef] [Green Version]
Conners, R.W.; Trivedi, M.M.; Harlow, C.A. Segmentation of a high-resolution urban scene using texture operators. Comput. Vis. Graph. Image Process. 1984, 25, 273–310. [Google Scholar] [CrossRef]
McFeeters, S.K. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 1996, 17, 1425–1432. [Google Scholar] [CrossRef]
Xu, H.Q. A study on information extraction of water body with the modified normalized difference water index (MNDWI). J. Remote Sens. 2005, 9, 589–595. (In Chinese) [Google Scholar]
Ouma, Y.O.; Tateishi, R. A water index for rapid mapping of shoreline changes of five East African Rift Valley lakes: An empirical analysis using Landsat TM and ETM+ data. Int. J. Remote Sens. 2006, 27, 3153–3181. [Google Scholar] [CrossRef]
Yan, P.; Zhang, Y.; Zhang, Y. A study on information extraction of water system in semi-arid regions with the enhanced water index (EWI) and GIS based noise remove techniques. Remote Sens. Inf. 2007, 6, 62–67. (In Chinese) [Google Scholar]
Fisher, A.; Flood, N.; Danaher, T. Comparing Landsat water index methods for automated water classification in eastern Australia. Remote Sens. Environ. 2016, 175, 167–182. [Google Scholar] [CrossRef]
Cao, R.L.; Li, C.J.; Liu, L.Y.; Wang, J.H.; Yan, G.J. Extracting Miyun reservoir’s water area and monitoring its change based on a revised normalized different water index. Sci. Surv. Mapp. 2008, 33, 158–160. [Google Scholar]
Chen, W.Q.; Ding, J.L.; Li, Y.H.; Niu, Z.Y. Extraction of water information based on China-made GF-1 remote sense image. Resour. Sci. 2015, 37, 1166–1172. [Google Scholar]
Liu, S.T. Research on Water Body Extraction Method of Domestic Gao Fen Series Remote Sensing Images. Master’s Thesis, Lanzhou Jiaotong University, Lanzhou, China, 2019. [Google Scholar]
Zou, C.; Yang, X.; Dong, Z.; Wang, D. A fast water information extraction method based on GF-2 remote sensing image. J. Graph. 2019, 40, 99–104. [Google Scholar]
Ding, F. A new method for fast information extraction of water bodies using remotely sensed data. Remote Sens. Technol. Appl. 2009, 24, 167–171. [Google Scholar]
Zha, Y.; Ni, S.; Yang, S. An effective approach to automatically extract urban land-use from TM imagery. J. Remote Sens. 2003, 7, 37–40. [Google Scholar]
Rouse, J.W., Jr.; Haas, R.H.; Schell, J.A.; Deering, D.W. Monitoring vegetation systems in the great plains with ERTS. In Proceedings of the Third ERTS Symposium, Washington, DC, USA, 10–14 December 1973; NASA: Washington, DC, USA, 1974; pp. 309–317. [Google Scholar]
Lucas, I.F.; Frans, J.M. Accuracy assessment of satellite derived land-cover data: A review. Photogramm. Eng. Remote Sens. 1994, 60, 410–432. [Google Scholar]
Landis, J.R.; Koch, G.G. The Measurement of Observer Agreement for Categorical Data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wilkerson, G.V.; Kandel, D.R.; Perg, L.A.; Dietrich, W.E.; Wilcock, P.R.; Whiles, M.R. Continental-scale relationship between bankfull width and drainage area for single-thread alluvial channels. Water Resour. Res. 2014, 50, 919–936. [Google Scholar] [CrossRef]
Karlsson, J.; Serikova, S.; Vorobyev, S.N.; Rocher-Ros, G.; Denfeld, B.; Pokrovsky, O.S. Carbon emission from Western Siberian inland waters. Nat. Commun. 2021, 12, 825. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The technical flowchart of river extraction under bankfull discharge.

Figure 2. Geographical location of the study area and distribution of the hydrological stations and training samples. Subregions are marked as “First”, “Second”, and “Third”. Hydrological stations are marked with yellow triangles. Water samples are marked with blue crosses, and nonwater samples are marked with green crosses.

Figure 3. Cross-section morphology of the year 2015 (a) and the water level-discharge rating curve (b) of a typical cross section at the Mentang hydrological station.

Figure 4. The ratios of the daily discharges to bankfull discharges.

Figure 5. Ranking of the importance of the feature variables in three subregions. (a–c) ranking of MDA for the first, second, and third region; (d–f) ranking of MDG for the first, second, and third region.

Figure 6. Comparisons between the extracted connected rivers with a width over 30 m and river networks above order 3 that were generated from a 90-m resolution DEM.

Figure 7. Comparisons between all the extracted rivers and drainage networks above order 2 that were generated from a 90-m resolution DEM.

Figure 8. Comparison between the estimated river width and in situ-measured data.

Figure 9. Distribution characteristics of the (a) bankfull river widths and (b) contributing area and bankfull discharge along the main stream of the upper Yellow River. BRW—Bankfull river width.

Figure 10. Regression relationships between the estimated river widths and the contributing areas and bankfull discharges. (a,b) The regressions of single-thread river reaches, excluding the hydrological stations affected by reservoirs; (c,d) the regressions of all river reaches, excluding the hydrological stations affected by reservoirs.

Table 1. Underlying surface types and water suspended sediment concentrations of the three subregions.

Regions	Elevation (m)	NHS ¹	LUCC2015 ²	SSC ³ (kg/m³)	CSC ⁴
First	2177–6295	4	32, 33, 61, 62, 63, 65, 66, 67	0.046–0.556	<20%
Second	1344–4453	4	12, 32, 33, 51, 52, 65	0.199–1.390	<5%
Third	1629–5334	14	12, 22, 31, 32, 33, 52, 64, 67	0.055–0.818	<5%

¹ NHS—Number of hydrological stations; ² LUCC—Land-use cover change; ³ SSC—Suspended sediment concentration; ⁴ CSC—Cloud and snow coverage; 12—Dryland, 22—Shrubland, 31/32/33—High/moderate/low coverage grassland, 51—Urban land, 52—Rural residential land, 61—Sandy land, 62—Gobi land, 63—Saline and alkaline land, 64—Wetland, 65—Bare land, 66—Bare rock land, and 67—Other unused land.

Table 2. Bankfull discharges, river widths, and flood frequencies of 22 hydrological stations.

Type	Station Name	Bankfull Discharge (m³/s)	Bankfull River Width (m)	Flood Frequency (%)	Method
Mainstream	Huangheyan4 (HHY4)	54.5	87.5	48.8	CSM ¹
	Jimai4 (JM4)	419.0	149	87.0	CSM
	Mentang (MT)	592.0	143	87.0	CSM
	Maqu2 (MQ2)	1098.0	269.8	95.2	Flood frequency
	Jungong (JG)	1208.5	179	95.2	Flood frequency
	Tangnaihai (TNH)	1385.2	150.5	96.2	Flood frequency
	Guide2 (GD2)	1460.4	200	84.0	Flood frequency
	Xunhua3 (XH3)	1680.0	127.5	74.1	CSM
	Xiaochuan (XC)	1660.0	146	83.3	CSM
	Shangquan6 (SQ6)	1510.0	231.4	87.0	Flood frequency
	Lanzhou (LZ)	1880.0	206	87.0	CSM
	Anningdu (AND)	1720.0	162.5	87.0	Flood frequency
Tributary	Qingshui (QS)	22.0	29.1	49.5	CSM
	Jiuzhi (JZ)	54.7	46.6	89.6	CSM
	Huangyuan (HY)	48.9	34	38.8	CSM
	Shuangcheng (SC)	79.5	42.1	78.9	CSM
	Luqu (LQ)	155.7	38.9	20.6	Flood frequency
	Tangke (TK)	168.2	242	91.6	Flood frequency
	Minxian4 (MX4)	408.4	171.2	50.9	Flood frequency
	Xining (XN)	161.1	30.5	42.7	Flood frequency
	Minhe3 (MH3)	264.7	34.8	42.7	Flood frequency
	Tongren (TR)	98.3	34.4	69.9	Flood frequency

¹ CSM—Cross-section morphology.

Table 3. Spectral indices of the Sentinel-2 images.

Spectral Indices	Formula	Reference
Normalized Difference Water Index	NDWI = (B3 − B8)/(B3 + B8)	[45]
Modified Normalized Difference Water Index	MNDWI = (B3 − B11)/(B3 + B11)	[46]
Normalized Difference Water Index 3	NDWI3 = (B8 − B11)/(B8 + B11)	[47]
Automated Water Extraction Index	① AWEIsh = B2 + 2.5 × B3 − 1.5 × (B8 + B12) − 0.25 × B11	[9]
Automated Water Extraction Index	② AWEInsh = 4 × (B3 − B12) − (0.25 × B8 + 2.75 × B11)	[9]
Enhanced Water Index	EWI = (B3 − B8 − B12)/(B3 + B8 + B12)	[48]
Water Index 2015	WI2015 = 1.7204 + 171 × B3 + 3 × B4 − 70 × B8 − 45 × B11 − 71 × B12	[49]
Revised Normalized Difference Water Index	RNDWI = (B12 − B4)/(B12 + B4)	[50]
Shadow Water Index	SWI = B2 + B3 − B8	[51]
Enhanced Shadow Water Index	ESWI = (B2 + B3)/(B8 + B8)	[52]
New Comprehensive Water Index	NCWI = (7 × B3 − 2 × B2 − 5 × B8)/(7 × B3 + 2 × B2 + 5 × B8)	[53]
New Water Index	NWI = ((B2 − (B8 + B11 + B12))/(B2 + (B8 + B11 + B12))) × 100	[54]
Normalized Difference Building-up Index	NDBI = (B12 − B8)/(B12 + B8)	[55]
Normalized Difference Vegetation Index	NDVI = (B8 − B4)/(B8 + B4)	[56]
Green Normalized Difference Vegetation Index	GNDVI = (B8 − B3)/(B8 + B3)	https://www.indexdatabase.de/, accessed on 21 February 2021
Ratio Vegetation Index	RVI = B8/B4
Enhanced Vegetation Index	EVI = 2.5 × (B8−B4)/((B8 + 6.0 × B4−7.5 × B2) + 1.0)
Difference Vegetation Index	DVI = B8 − B4
Green Difference Vegetation Index	GDVI = B8 − B3
Weighted Difference Vegetation Index	WDVI = B8 − 0.460 × B4
Renormalized Difference Vegetation Index	RDVI = (B8 − B4)/(B8 + B4) × 0.5
Pan Normalized Difference Vegetation Index	PNDVI = (B8 − (B3 + B4 + B2))/(B8 + (B3 + B4 + B2))
Red-Blue Normalized Difference Vegetation Index	RBNDVI = (B8 − (B4 + B2))/(B8 + (B4 + B2))
Blue-Normalized Difference Vegetation Index	BNDVI = (B8 − B2)/(B8 + B2)
Blue-Wide Dynamic Range Vegetation Index	BWDRVI = (0.1 × B8 − B2)/(0.1 × B8 + B2)
Simple Ratio Red/NIR Ratio Vegetation-Index	SRRed_NIR = B4/B8
Simple Ratio MIR/Red Eisenhydroxid-Index	SRMIR_Red = B12/B4
Soil-Adjusted Vegetation Index mir	SAVImir = (B8 − B12) × (1.0 + 0.401)/(B8 + B12 + 0.401)
Adjusted Transformed Soil-Adjusted Vegetation Index	ATSAVI = 1.22 × (B8 − 1.22 × B4 − 0.03)/(1.22 × B8 + B4 − 1.22 × 0.03 + 0.08 × (1.0 + 1.22 × 2.0))
Transformed Soil Adjusted Vegetation Index	TSAVI = (0.743 × (B8 − 0.743 × B4 − 0.323))/(B4 + 0.743 × (B8 − 0.323) + 0.413 × (1.0 + 0.743 × 2.0))
	PRWI = (B3 + B8)/(B3 − B8)
Soil Composition Index	SCI = (B11 − B8)/(B11 + B8)
Ratio Drought Index	RDI = B12/B8
Moisture Stress Index 2	MSI2 = B11/B8
Tasselled Cap-wetness	WET = 0.1509 × B2 + 0.1973 × B3 + 0.3279 × B4 + 0.3406 × B8−0.7112 × B11−0.4572 × B12
Normalized Burn Ratio	NBR = (B8 − B12)/(B8 + B12)
Simple Ratio 520/670	SR520_670 = B2/B4
Simple Ratio 550/800	SR550_800 = B3/B8
Simple Ratio 800/2170	SR800_2170 = B8/B12
Simple Ratio 800/550	SR800_550 = B8/B3
Simple Ratio 833/1649 MSIhyper	SR833_1649 = B8/B11
Difference 678/500	D678_500 = B4 − B2
Visible Atmospherically Resistant Index Green	VARIgreen = (B3 − B4)/(B3 + B4 − B2)
Iron Oxide	IO = B4/B2
Ferric iron, Fe²⁺	Fe2 = B12/B8 + B3/B4
Ferric iron, Fe³⁺	Fe3 = B4/B3
Shape Index	IF = (2.0 × B4 − B3 − B2)/(B3 − B2)
Coloration Index	CI = (B4 − B2)/B4
Redness Index	RI = (B4 − B3)/(B4 + B3)
Color Rendering Index 550	CRI550 = B2 × (−1.0) − B3 × (−1.0)
Alteration	Alteration = B11/B12

B2—Blue; B3—Green; B4—Red; B8—NIR; B11—MIR; B12—SWIR.

Table 4. The results of the 3-fold cross-validation.

Region ID	OA Min	OA Max	OA Mean	OA SD	Kappa Min	Kappa Max	Kappa Mean	Kappa SD
First	0.8654	0.8942	0.8788	0.0101	0.7292	0.7823	0.7530	0.0192
Second	0.8559	0.928	0.8993	0.0245	0.7095	0.8554	0.7978	0.0496
Third	0.8895	0.9379	0.8998	0.0189	0.7791	0.8749	0.8049	0.0292

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, D.; Wang, G.; Qin, C.; Wu, B. River Extraction under Bankfull Discharge Conditions Based on Sentinel-2 Imagery and DEM Data. Remote Sens. 2021, 13, 2650. https://doi.org/10.3390/rs13142650

AMA Style

Li D, Wang G, Qin C, Wu B. River Extraction under Bankfull Discharge Conditions Based on Sentinel-2 Imagery and DEM Data. Remote Sensing. 2021; 13(14):2650. https://doi.org/10.3390/rs13142650

Chicago/Turabian Style

Li, Dan, Ge Wang, Chao Qin, and Baosheng Wu. 2021. "River Extraction under Bankfull Discharge Conditions Based on Sentinel-2 Imagery and DEM Data" Remote Sensing 13, no. 14: 2650. https://doi.org/10.3390/rs13142650

APA Style

Li, D., Wang, G., Qin, C., & Wu, B. (2021). River Extraction under Bankfull Discharge Conditions Based on Sentinel-2 Imagery and DEM Data. Remote Sensing, 13(14), 2650. https://doi.org/10.3390/rs13142650

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu