MDPI - Publisher of Open Access Journals

21 pages, 10271 KiB

Open AccessArticle

HSP-UNet: An Accuracy and Efficient Segmentation Method for Carbon Traces of Surface Discharge in the Oil-Immersed Transformer

by Hongxin Ji, Xinghua Liu, Peilin Han, Liqing Liu and Chun He

Sensors 2024, 24(19), 6498; https://doi.org/10.3390/s24196498 - 9 Oct 2024

Viewed by 347

Abstract

Restricted by a metal-enclosed structure, the internal defects of large transformers are difficult to visually detect. In this paper, a micro-robot is used to visually inspect the interior of a transformer. For the micro-robot to successfully detect the discharge level and insulation degradation [...] Read more.

Restricted by a metal-enclosed structure, the internal defects of large transformers are difficult to visually detect. In this paper, a micro-robot is used to visually inspect the interior of a transformer. For the micro-robot to successfully detect the discharge level and insulation degradation trend in the transformer, it is essential to segment the carbon trace accurately and rapidly from the complex background. However, the complex edge features and significant size differences of carbon traces pose a serious challenge for accurate segmentation. To this end, we propose the Hadamard production-Spatial coordinate attention-PixelShuffle UNet (HSP-UNet), an innovative architecture specifically designed for carbon trace segmentation. To address the pixel over-concentration and weak contrast of carbon trace image, the Adaptive Histogram Equalization (AHE) algorithm is used for image enhancement. To realize the effective fusion of carbon trace features with different scales and reduce model complexity, the novel grouped Hadamard Product Attention (HPA) module is designed to replace the original convolution module of the UNet. Meanwhile, to improve the activation intensity and segmentation completeness of carbon traces, the Spatial Coordinate Attention (SCA) mechanism is designed to replace the original jump connection. Furthermore, the PixelShuffle up-sampling module is used to improve the parsing ability of complex boundaries. Compared with UNet, UNet++, UNeXt, MALUNet, and EGE-UNet, HSP-UNet outperformed all the state-of-the-art methods on both carbon trace datasets. For dendritic carbon traces, HSP-UNet improved the Mean Intersection over Union (MIoU), Pixel Accuracy (PA), and Class Pixel Accuracy (CPA) of the benchmark UNet by 2.13, 1.24, and 4.68 percentage points, respectively. For clustered carbon traces, HSP-UNet improved MIoU, PA, and CPA by 0.98, 0.65, and 0.83 percentage points, respectively. At the same time, the validation results showed that the HSP-UNet has a good model lightweighting advantage, with the number of parameters and GFLOPs of 0.061 M and 0.066, respectively. This study could contribute to the accurate segmentation of discharge carbon traces and the assessment of the insulation condition of the oil-immersed transformer. Full article

(This article belongs to the Section Sensors and Robotics)

► Show Figures

Figure 1

19 pages, 7665 KiB

Open AccessArticle

Chestnut Burr Segmentation for Yield Estimation Using UAV-Based Imagery and Deep Learning

by Gabriel A. Carneiro, Joaquim Santos, Joaquim J. Sousa, António Cunha and Luís Pádua

Drones 2024, 8(10), 541; https://doi.org/10.3390/drones8100541 - 1 Oct 2024

Viewed by 587

Abstract

Precision agriculture (PA) has advanced agricultural practices, offering new opportunities for crop management and yield optimization. The use of unmanned aerial vehicles (UAVs) in PA enables high-resolution data acquisition, which has been adopted across different agricultural sectors. However, its application for decision support [...] Read more.

Precision agriculture (PA) has advanced agricultural practices, offering new opportunities for crop management and yield optimization. The use of unmanned aerial vehicles (UAVs) in PA enables high-resolution data acquisition, which has been adopted across different agricultural sectors. However, its application for decision support in chestnut plantations remains under-represented. This study presents the initial development of a methodology for segmenting chestnut burrs from UAV-based imagery to estimate its productivity in point cloud data. Deep learning (DL) architectures, including U-Net, LinkNet, and PSPNet, were employed for chestnut burr segmentation in UAV images captured at a 30 m flight height, with YOLOv8m trained for comparison. Two datasets were used for training and to evaluate the models: one newly introduced in this study and an existing dataset. U-Net demonstrated the best performance, achieving an F1-score of 0.56 and a counting accuracy of 0.71 on the proposed dataset, using a combination of both datasets during training. The primary challenge encountered was that burrs often tend to grow in clusters, leading to unified regions in segmentation, making object detection potentially more suitable for counting. Nevertheless, the results show that DL architectures can generate masks for point cloud segmentation, supporting precise chestnut tree production estimation in future studies. Full article

(This article belongs to the Special Issue Intelligent Processing and Application of UAV Remote Sensing Image Data)

► Show Figures

Figure 1

Figure 1
Methodological pipeline for chestnut yield estimation from UAV imagery, including data acquisition and processing, imagery segmentation, and point cloud processing. Full article ">Figure 2
Example of a UAV image on the chestnut grove, and the image split into 48 patches. Full article ">Figure 3
Examples of masks obtained using the threshold approach. Each row represents a sample: the original image (a), the resulting mask (b), and the overlapping visualization (c). Threshold method applied to chestnut trees with phytosanitary issues (first row), and to healthy chestnut trees (second row). Full article ">Figure 4
Examples of the transformation applied to the proposed dataset to make it suitable for training object detection models. Red bounding boxes represent areas of chestnut burrs. Full article ">Figure 5
General overview of the architectures of the selected segmentation models (LinkNet, U-Net, and PSPNet). Full article ">Figure 6
Segmentation examples on Dataset 1 for each segmentation model trained on Dataset 1 and by merging both datasets. Full article ">Figure 7
Segmentation examples on Dataset 2 for each segmentation model trained on Dataset 2 and by merging both datasets. Full article ">Figure 8
Example of occluded chestnut burr (highlighted in the red box) that was not annotated in Dataset 2 and the segmentation results in the different models. Full article ">

19 pages, 14422 KiB

Open AccessArticle

YOLO-SegNet: A Method for Individual Street Tree Segmentation Based on the Improved YOLOv8 and the SegFormer Network

by Tingting Yang, Suyin Zhou, Aijun Xu, Junhua Ye and Jianxin Yin

Agriculture 2024, 14(9), 1620; https://doi.org/10.3390/agriculture14091620 - 15 Sep 2024

Viewed by 687

Abstract

In urban forest management, individual street tree segmentation is a fundamental method to obtain tree phenotypes, which is especially critical. Most existing tree image segmentation models have been evaluated on smaller datasets and lack experimental verification on larger, publicly available datasets. Therefore, this [...] Read more.

In urban forest management, individual street tree segmentation is a fundamental method to obtain tree phenotypes, which is especially critical. Most existing tree image segmentation models have been evaluated on smaller datasets and lack experimental verification on larger, publicly available datasets. Therefore, this paper, based on a large, publicly available urban street tree dataset, proposes YOLO-SegNet for individual street tree segmentation. In the first stage of the street tree object detection task, the BiFormer attention mechanism was introduced into the YOLOv8 network to increase the contextual information extraction and improve the ability of the network to detect multiscale and multishaped targets. In the second-stage street tree segmentation task, the SegFormer network was proposed to obtain street tree edge information more efficiently. The experimental results indicate that our proposed YOLO-SegNet method, which combines YOLOv8+BiFormer and SegFormer, achieved a 92.0% mean intersection over union (mIoU), 95.9% mean pixel accuracy (mPA), and 97.4% accuracy on a large, publicly available urban street tree dataset. Compared with those of the fully convolutional neural network (FCN), lite-reduced atrous spatial pyramid pooling (LR-ASPP), pyramid scene parsing network (PSPNet), UNet, DeepLabv3+, and HRNet, the mIoUs of our YOLO-SegNet increased by 10.5, 9.7, 5.0, 6.8, 4.5, and 2.7 percentage points, respectively. The proposed method can effectively support smart agroforestry development. Full article

(This article belongs to the Section Digital Agriculture)

► Show Figures

Figure 1

14 pages, 5108 KiB

Open AccessArticle

Soldering Defect Segmentation Method for PCB on Improved UNet

by Zhongke Li and Xiaofang Liu

Appl. Sci. 2024, 14(16), 7370; https://doi.org/10.3390/app14167370 - 21 Aug 2024

Viewed by 408

Abstract

Despite being indispensable devices in the electronic manufacturing industry, printed circuit boards (PCBs) may develop various soldering defects in the production process, which seriously affect the product’s quality. Due to the substantial background interference in the soldering defect image and the small and [...] Read more.

Despite being indispensable devices in the electronic manufacturing industry, printed circuit boards (PCBs) may develop various soldering defects in the production process, which seriously affect the product’s quality. Due to the substantial background interference in the soldering defect image and the small and irregular shapes of the defects, the accurate segmentation of soldering defects is a challenging task. To address this issue, a method to improve the encoder–decoder network structure of UNet is proposed for PCB soldering defect segmentation. To enhance the feature extraction capabilities of the encoder and focus more on deeper features, VGG16 is employed as the network encoder. Moreover, a hybrid attention module called the DHAM, which combines channel attention and dynamic spatial attention, is proposed to reduce the background interference in images and direct the model’s focus more toward defect areas. Additionally, based on GSConv, the RGSM is introduced and applied in the decoder to enhance the model’s feature fusion capabilities and improve the segmentation accuracy. The experiments demonstrate that the proposed method can effectively improve the segmentation accuracy for PCB soldering defects, achieving an mIoU of 81.74% and mPA of 87.33%, while maintaining a relatively low number of model parameters at only 22.13 M and achieving an FPS of 30.16, thus meeting the real-time detection speed requirements. Full article

(This article belongs to the Special Issue Advances in Computer Vision and Semantic Segmentation, 2nd Edition)

► Show Figures

Figure 1

23 pages, 25042 KiB

Open AccessArticle

Segmentation Network for Multi-Shape Tea Bud Leaves Based on Attention and Path Feature Aggregation

by Tianci Chen, Haoxin Li, Jinhong Lv, Jiazheng Chen and Weibin Wu

Agriculture 2024, 14(8), 1388; https://doi.org/10.3390/agriculture14081388 - 17 Aug 2024

Viewed by 457

Abstract

Accurately detecting tea bud leaves is crucial for the automation of tea picking robots. However, challenges arise due to tea stem occlusion and overlapping of buds and leaves, presenting varied shapes of one bud–one leaf targets in the field of view, making precise [...] Read more.

Accurately detecting tea bud leaves is crucial for the automation of tea picking robots. However, challenges arise due to tea stem occlusion and overlapping of buds and leaves, presenting varied shapes of one bud–one leaf targets in the field of view, making precise segmentation of tea bud leaves challenging. To improve the segmentation accuracy of one bud–one leaf targets with different shapes and fine granularity, this study proposes a novel semantic segmentation model for tea bud leaves. The method designs a hierarchical Transformer block based on a self-attention mechanism in the encoding network, which is beneficial for capturing long-range dependencies between features and enhancing the representation of common features. Then, a multi-path feature aggregation module is designed to effectively merge the feature outputs of encoder blocks with decoder outputs, thereby alleviating the loss of fine-grained features caused by downsampling. Furthermore, a refined polarized attention mechanism is employed after the aggregation module to perform polarized filtering on features in channel and spatial dimensions, enhancing the output of fine-grained features. The experimental results demonstrate that the proposed Unet-Enhanced model achieves segmentation performance well on one bud–one leaf targets with different shapes, with a mean intersection over union (mIoU) of 91.18% and a mean pixel accuracy (mPA) of 95.10%. The semantic segmentation network can accurately segment tea bud leaves, providing a decision-making basis for the spatial positioning of tea picking robots. Full article

(This article belongs to the Section Digital Agriculture)

► Show Figures

Figure 1

18 pages, 7039 KiB

Open AccessArticle

Two-Stage Detection Algorithm for Plum Leaf Disease and Severity Assessment Based on Deep Learning

by Caihua Yao, Ziqi Yang, Peifeng Li, Yuxia Liang, Yamin Fan, Jinwen Luo, Chengmei Jiang and Jiong Mu

Agronomy 2024, 14(7), 1589; https://doi.org/10.3390/agronomy14071589 - 21 Jul 2024

Cited by 2 | Viewed by 879

Abstract

Crop diseases significantly impact crop yields, and promoting specialized control of crop diseases is crucial for ensuring agricultural production stability. Disease identification primarily relies on human visual inspection, which is inefficient, inaccurate, and subjective. This study focused on the plum red spot ( [...] Read more.

Crop diseases significantly impact crop yields, and promoting specialized control of crop diseases is crucial for ensuring agricultural production stability. Disease identification primarily relies on human visual inspection, which is inefficient, inaccurate, and subjective. This study focused on the plum red spot (Polystigma rubrum), proposing a two-stage detection algorithm based on deep learning and assessing the severity of the disease through lesion coverage rate. The specific contributions are as follows: We utilized the object detection model YOLOv8 to strip leaves to eliminate the influence of complex backgrounds. We used an improved U-Net network to segment leaves and lesions. We combined Dice Loss with Focal Loss to address the poor training performance due to the pixel ratio imbalance between leaves and disease spots. For inconsistencies in the size and shape of leaves and lesions, we utilized ODConv and MSCA so that the model could focus on features at different scales. After verification, the accuracy rate of leaf recognition is 95.3%, and the mIoU, mPA, mPrecision, and mRecall of the leaf disease segmentation model are 90.93%, 95.21%, 95.17%, and 95.21%, respectively. This research provides an effective solution for the detection and severity assessment of plum leaf red spot disease under complex backgrounds. Full article

(This article belongs to the Special Issue The Applications of Deep Learning in Smart Agriculture)

► Show Figures

Figure 1

24 pages, 10938 KiB

Open AccessArticle

Segmentation and Coverage Measurement of Maize Canopy Images for Variable-Rate Fertilization Using the MCAC-Unet Model

by Hailiang Gong, Litong Xiao and Xi Wang

Agronomy 2024, 14(7), 1565; https://doi.org/10.3390/agronomy14071565 - 18 Jul 2024

Viewed by 498

Abstract

Excessive fertilizer use has led to environmental pollution and reduced crop yields, underscoring the importance of research into variable-rate fertilization (VRF) based on digital image technology in precision agriculture. Current methods, which rely on spectral sensors for monitoring and prescription mapping, face significant [...] Read more.

Excessive fertilizer use has led to environmental pollution and reduced crop yields, underscoring the importance of research into variable-rate fertilization (VRF) based on digital image technology in precision agriculture. Current methods, which rely on spectral sensors for monitoring and prescription mapping, face significant technical challenges, high costs, and operational complexities, limiting their widespread adoption. This study presents an automated, intelligent, and precise approach to maize canopy image segmentation using the multi-scale attention and Unet model to enhance VRF decision making, reduce fertilization costs, and improve accuracy. A dataset of maize canopy images under various lighting and growth conditions was collected and subjected to data augmentation and normalization preprocessing. The MCAC-Unet model, built upon the MobilenetV3 backbone network and integrating the convolutional block attention module (CBAM), atrous spatial pyramid pooling (ASPP) multi-scale feature fusion, and content-aware reassembly of features (CARAFE) adaptive upsampling modules, achieved a mean intersection over union (mIOU) of 87.51% and a mean pixel accuracy (mPA) of 93.85% in maize canopy image segmentation. Coverage measurements at a height of 1.1 m indicated a relative error ranging from 3.12% to 6.82%, averaging 4.43%, with a determination coefficient of 0.911, meeting practical requirements. The proposed model and measurement system effectively address the challenges in maize canopy segmentation and coverage assessment, providing robust support for crop monitoring and VRF decision making in complex environments. Full article

► Show Figures

Figure 1

17 pages, 4157 KiB

Open AccessArticle

Segmentation of Apparent Multi-Defect Images of Concrete Bridges Based on PID Encoder and Multi-Feature Fusion

by Yanna Liao, Chaoyang Huang and Yafang Yin

Buildings 2024, 14(5), 1463; https://doi.org/10.3390/buildings14051463 - 17 May 2024

Cited by 1 | Viewed by 713

Abstract

To address the issue of insufficient deep contextual information mining in the semantic segmentation task of multiple defects in concrete bridges, due to the diversity in texture, shape, and scale of the defects as well as significant differences in the background, we propose [...] Read more.

To address the issue of insufficient deep contextual information mining in the semantic segmentation task of multiple defects in concrete bridges, due to the diversity in texture, shape, and scale of the defects as well as significant differences in the background, we propose the Concrete Bridge Apparent Multi-Defect Segmentation Network (PID-MHENet) based on a PID encoder and multi-feature fusion. PID-MHENet consists of a PID encoder, skip connection, and decoder. The PID encoder adopts a multi-branch structure, including an integral branch and a proportional branch with a “thick and long” design principle and a differential branch with a “thin and short” design principle. The PID Aggregation Enhancement (PAE) combines the detail information of the proportional branch and the semantic information of the differential branch to enhance the fusion of contextual information and, at the same time, introduces the self-learning parameters, which can effectively extract the information of the boundary details of the lesions, the texture, and the background differences. The Multi-Feature Fusion Enhancement Decoding Block (MFEDB) in the decoding stage enhances the information and globally fuses the different feature maps introduced by the three-channel skip connection, which improves the segmentation accuracy of the network for the background similarity and the micro-defects. The experimental results show that the mean Pixel accuracy (mPa) and mean Intersection over Union (mIoU) values of PID-MHENet on the concrete bridge multi-defect semantic segmentation dataset improved by 5.17% and 5.46%, respectively, compared to the UNet network. Full article

(This article belongs to the Topic Artificial Intelligence (AI) Applied in Civil Engineering, 2nd Volume)

► Show Figures

Figure 1

20 pages, 4630 KiB

Open AccessArticle

U-Net with Coordinate Attention and VGGNet: A Grape Image Segmentation Algorithm Based on Fusion Pyramid Pooling and the Dual-Attention Mechanism

by Xiaomei Yi, Yue Zhou, Peng Wu, Guoying Wang, Lufeng Mo, Musenge Chola, Xinyun Fu and Pengxiang Qian

Agronomy 2024, 14(5), 925; https://doi.org/10.3390/agronomy14050925 - 28 Apr 2024

Viewed by 950

Abstract

Currently, the classification of grapevine black rot disease relies on assessing the percentage of affected spots in the total area, with a primary focus on accurately segmenting these spots in images. Particularly challenging are cases in which lesion areas are small and boundaries [...] Read more.

Currently, the classification of grapevine black rot disease relies on assessing the percentage of affected spots in the total area, with a primary focus on accurately segmenting these spots in images. Particularly challenging are cases in which lesion areas are small and boundaries are ill-defined, hampering precise segmentation. In our study, we introduce an enhanced U-Net network tailored for segmenting black rot spots on grape leaves. Leveraging VGG as the U-Net’s backbone, we strategically position the atrous spatial pyramid pooling (ASPP) module at the base of the U-Net to serve as a link between the encoder and decoder. Additionally, channel and spatial dual-attention modules are integrated into the decoder, alongside a feature pyramid network aimed at fusing diverse levels of feature maps to enhance the segmentation of diseased regions. Our model outperforms traditional plant disease semantic segmentation approaches like DeeplabV3+, U-Net, and PSPNet, achieving impressive pixel accuracy (PA) and mean intersection over union (MIoU) scores of 94.33% and 91.09%, respectively. Demonstrating strong performance across various levels of spot segmentation, our method showcases its efficacy in enhancing the segmentation accuracy of black rot spots on grapevines. Full article

(This article belongs to the Special Issue Computer Vision and Deep Learning Technology in Agriculture: 2nd Edition)

► Show Figures

Figure 1

18 pages, 6470 KiB

Open AccessArticle

Enhanced Tropical Cyclone Precipitation Prediction in the Northwest Pacific Using Deep Learning Models and Ensemble Techniques

by Lunkai He, Qinglan Li, Jiali Zhang, Xiaowei Deng, Zhijian Wu, Yaoming Wang, Pak-Wai Chan and Na Li

Water 2024, 16(5), 671; https://doi.org/10.3390/w16050671 - 25 Feb 2024

Viewed by 1468

Abstract

This study focuses on optimizing precipitation forecast induced by tropical cyclones (TCs) in the Northwest Pacific region, with lead times ranging from 6 to 72 h. The research employs deep learning models, such as U-Net, UNet3+, SE-Net, and SE-UNet3+, which utilize precipitation forecast [...] Read more.

This study focuses on optimizing precipitation forecast induced by tropical cyclones (TCs) in the Northwest Pacific region, with lead times ranging from 6 to 72 h. The research employs deep learning models, such as U-Net, UNet3+, SE-Net, and SE-UNet3+, which utilize precipitation forecast data from the Global Forecast System (GFS) and real-time GFS environmental background data using a U-Net structure. To comprehensively make use of the precipitation forecasts from these models, we additionally use probabilistic matching (PM) and simple averaging (AVR) in rainfall prediction. The precipitation data from the Global Precipitation Measurement (GPM) Mission serves as the rainfall observation. The results demonstrate that the root mean squared errors (RMSEs) of U-Net, UNet3+, SE-UNet, SE-UNet3+, AVR, and PM are lowered by 8.7%, 10.1%, 9.7%, 10.0%, 11.4%, and 11.5%, respectively, when compared with the RMSE of the GFS TC precipitation forecasts, while the mean absolute errors are reduced by 9.6%, 11.3%, 9.0%, 12.0%, 12.8%, and 13.0%, respectively. Furthermore, the neural network model improves the precipitation threat scores (TSs). On average, the TSs of U-Net, UNet3+, SE-UNet, SE-UNet3+, AVR, and PM are raised by 12.8%, 21.3%, 19.3%, 20.7%, 22.5%, and 22.9%, respectively, compared with the GFS model. Notably, AVR and PM outperform all other individual models, with PM’s performance slightly better than AVR’s. The most important feature variables in optimizing TC precipitation forecast in the Northwest Pacific region based on the UNet-based neural network include GFS precipitation forecast data, land and sea masks, latitudinal winds at 500 hPa, and vertical winds at 500 hPa. Full article

(This article belongs to the Section Hydrology)

► Show Figures

Figure 1

15 pages, 2064 KiB

Open AccessArticle

Portrait Semantic Segmentation Method Based on Dual Modal Information Complementarity

by Guang Feng and Chong Tang

Appl. Sci. 2024, 14(4), 1439; https://doi.org/10.3390/app14041439 - 9 Feb 2024

Viewed by 779

Abstract

Semantic segmentation of human images is a research hotspot in the field of computer vision. At present, the semantic segmentation models based on U-net generally lack the ability to capture the spatial information of images. At the same time, semantic incompatibility exists because [...] Read more.

Semantic segmentation of human images is a research hotspot in the field of computer vision. At present, the semantic segmentation models based on U-net generally lack the ability to capture the spatial information of images. At the same time, semantic incompatibility exists because the feature maps of encoder and decoder are directly connected in the skip connection stage. In addition, in low light scenes such as at night, it is easy for false segmentation and segmentation accuracy to appear. To solve the above problems, a portrait semantic segmentation method based on dual-modal information complementarity is proposed. The encoder adopts a double branch structure, and uses a SK-ASSP module that can adaptively adjust the convolution weights of different receptor fields to extract features in RGB and gray image modes respectively, and carries out cross-modal information complementarity and feature fusion. A hybrid attention mechanism is used in the jump connection phase to capture both the channel and coordinate context information of the image. Experiments on human matting dataset show that the PA and MIoU coefficients of this algorithm model reach 96.58% and 94.48% respectively, which is better than U-net benchmark model and other mainstream semantic segmentation models. Full article

(This article belongs to the Section Computing and Artificial Intelligence)

► Show Figures

Figure 1

16 pages, 3202 KiB

Open AccessArticle

Machine Learning-Based Estimation of Tropical Cyclone Intensity from Advanced Technology Microwave Sounder Using a U-Net Algorithm

by Zichao Liang, Yong-Keun Lee, Christopher Grassotti, Lin Lin and Quanhua Liu

Remote Sens. 2024, 16(1), 77; https://doi.org/10.3390/rs16010077 - 24 Dec 2023

Viewed by 1399

Abstract

A U-Net algorithm was used to retrieve surface pressure and wind speed over the ocean within tropical cyclones (TCs) and their neighboring areas using NOAA-20 Advanced Technology Microwave Sounder (ATMS) reprocessed Sensor Data Record (SDR) brightness temperatures (TBs) and geolocation information. For TC [...] Read more.

A U-Net algorithm was used to retrieve surface pressure and wind speed over the ocean within tropical cyclones (TCs) and their neighboring areas using NOAA-20 Advanced Technology Microwave Sounder (ATMS) reprocessed Sensor Data Record (SDR) brightness temperatures (TBs) and geolocation information. For TC locations, International Best Track Archive for Climate Stewardship (IBTrACS) data have been used over the North Atlantic Ocean and West Pacific Ocean between 2018 and 2021. The European Centre for Medium-Range Weather Forecasts (ECMWF) Reanalysis v5 (ERA5) surface pressure and wind speed were employed as reference labels. Preliminary results demonstrated that the visualizations for wind speed and pressure matched the prediction and ERA5 location. The residual biases and standard deviations between the predicted and reference labels were about 0.15 m/s and 1.95 m/s, respectively, for wind speed and 0.48 hPa and 2.67 hPa, respectively, for surface pressure, after applying cloud screening for each ATMS pixel. This indicates that the U-Net model is effective for surface wind speed and surface pressure estimates over general ocean conditions. Full article

(This article belongs to the Special Issue Advances in Remote Sensing and Atmospheric Optics)

► Show Figures

Figure 1

Figure 1
U-Net Architecture in this study. Detailed information is described in <a href="#sec2-remotesensing-16-00077" class="html-sec">Section 2</a> Methodology. Full article ">Figure 2
Flowchart of the data preprocessing. Full article ">Figure 3
Loss Curve for U-Net Training and Validation Loss converging over 500 epochs. Full article ">Figure 4
Single sample residual histograms (U-Net prediction—ERA5) for (a,b) surface wind speed and surface pressure residuals, respectively, for sample valid on 10 October 2018 at 06 UTC, while (c,d) contain similar residuals but for the sample valid on 14 September 2018 at 06 UTC. Full article ">Figure 5
U-Net prediction and ERA5 surface wind speed maps. (a,b) represent ERA5 and U-Net predicted wind speed, respectively, of sample valid on 10 October 2018 at 06 UTC (Leslie), while (c,d) represent ERA5 and U-Net predicted wind speed, respectively, of sample valid on 14 September 2018 at 06 UTC (Joyce in the middle and Helene on the right-side). Full article ">Figure 6
U-Net prediction and ERA5 surface pressure maps. (a,b) represent ERA5 and U-Net predicted surface pressure, respectively, of sample valid on 10 October 2018 at 06 UTC, while (c,d) represent ERA5 and U-Net predicted surface pressure, respectively, of sample valid on 14 September 2018 at 06 UTC. Full article ">Figure 7
Scatterplots of U-Net prediction vs. ERA5 for (a) wind speed (m/s) and (b) surface pressure (hPa) across all 27 test samples. The pixels included in this analysis were selected from within a 350 km radius circle centered on the TC. The data distribution changes from dense to sparse as the color shifts from yellow to blue. (R: Pearson correlation coefficients; SD: standard deviation; N: number of selected pixels). Full article ">

18 pages, 4806 KiB

Open AccessArticle

Extracting Citrus in Southern China (Guangxi Region) Based on the Improved DeepLabV3+ Network

by Hao Li, Jia Zhang, Jia Wang, Zhongke Feng, Boyi Liang, Nina Xiong, Junping Zhang, Xiaoting Sun, Yibing Li and Shuqi Lin

Remote Sens. 2023, 15(23), 5614; https://doi.org/10.3390/rs15235614 - 3 Dec 2023

Cited by 1 | Viewed by 1691

Abstract

China is one of the countries with the largest citrus cultivation areas, and its citrus industry has received significant attention due to its substantial economic benefits. Traditional manual forestry surveys and remote sensing image classification tasks are labor-intensive and time-consuming, resulting in low [...] Read more.

China is one of the countries with the largest citrus cultivation areas, and its citrus industry has received significant attention due to its substantial economic benefits. Traditional manual forestry surveys and remote sensing image classification tasks are labor-intensive and time-consuming, resulting in low efficiency. Remote sensing technology holds great potential for obtaining spatial information on citrus orchards on a large scale. This study proposes a lightweight model for citrus plantation extraction that combines the DeepLabV3+ model with the convolutional block attention module (CBAM) attention mechanism, with a focus on the phenological growth characteristics of citrus in the Guangxi region. The objective is to address issues such as inaccurate extraction of citrus edges in high-resolution images, misclassification and omissions caused by intra-class differences, as well as the large number of network parameters and long training time found in classical semantic segmentation models. To reduce parameter count and improve training speed, the MobileNetV2 lightweight network is used as a replacement for the Xception backbone network in DeepLabV3+. Additionally, the CBAM is introduced to extract citrus features more accurately and efficiently. Moreover, in consideration of the growth characteristics of citrus, this study augments the feature input with additional channels to better capture and utilize key phenological features of citrus, thereby enhancing the accuracy of citrus recognition. The results demonstrate that the improved DeepLabV3+ model exhibits high reliability in citrus recognition and extraction, achieving an overall accuracy (OA) of 96.23%, a mean pixel accuracy (mPA) of 83.79%, and a mean intersection over union (mIoU) of 85.40%. These metrics represent an improvement of 11.16%, 14.88%, and 14.98%, respectively, compared to the original DeepLabV3+ model. Furthermore, when compared to classical semantic segmentation models, such as UNet and PSPNet, the proposed model achieves higher recognition accuracy. Additionally, the improved DeepLabV3+ model demonstrates a significant reduction in both parameters and training time. Generalization experiments conducted in Nanning, Guangxi Province, further validate the model’s strong generalization capabilities. Overall, this study emphasizes extraction accuracy, reduction in parameter count, adherence to timeliness requirements, and facilitation of rapid and accurate extraction of citrus plantation areas, presenting promising application prospects. Full article

(This article belongs to the Section Remote Sensing in Agriculture and Vegetation)

► Show Figures

Figure 1

Figure 1
Study area. (a) Geographic location of the study area. (b) Main study area, i.e., Yangshuo County, Guangxi Province. (c,d) show the labeled areas of citrus samples (marked by yellow and green blocks). The images used are GF-2 images with pseudo-color components (R = near-infrared, G = red, B = green). Full article ">Figure 2
Structure of improved DeepLabV3+ model. Full article ">Figure 3
Structure of CBAM: (a) Channel attention module; (b) Spatial attention module; (c) CBAM. Full article ">Figure 4
Comparison of extraction accuracy of various models for citrus. Full article ">Figure 5
Citrus extraction results using four different models, where the black area is the background area, the gray is the citrus sample labeled area, and the white is the citrus area extracted by the models. Among the three special plots selected, plot (a) contains roads and water, plot (b) contains complex and fragmentary citrus planting areas, and plot (c) contains concentrated citrus planting areas. Full article ">Figure 6
Results of model testing in Nanning City. Full article ">

19 pages, 6565 KiB

Open AccessArticle

Recurrent Residual Deformable Conv Unit and Multi-Head with Channel Self-Attention Based on U-Net for Building Extraction from Remote Sensing Images

by Wenling Yu, Bo Liu, Hua Liu and Guohua Gou

Remote Sens. 2023, 15(20), 5048; https://doi.org/10.3390/rs15205048 - 20 Oct 2023

Cited by 4 | Viewed by 1243

Abstract

Considering the challenges associated with accurately identifying building shape features and distinguishing between building and non-building features during the extraction of buildings from remote sensing images using deep learning, we propose a novel method for building extraction based on U-Net, incorporating a recurrent [...] Read more.

Considering the challenges associated with accurately identifying building shape features and distinguishing between building and non-building features during the extraction of buildings from remote sensing images using deep learning, we propose a novel method for building extraction based on U-Net, incorporating a recurrent residual deformable convolution unit (RDCU) module and augmented multi-head self-attention (AMSA). By replacing conventional convolution modules with an RDCU, which adopts a deformable convolutional neural network within a residual network structure, the proposed method enhances the module’s capacity to learn intricate details such as building shapes. Furthermore, AMSA is introduced into the skip connection function to enhance feature expression and positions through content–position enhancement operations and content–content enhancement operations. Moreover, AMSA integrates an additional fusion channel attention mechanism to aid in identifying cross-channel feature expression Intersection over Union (IoU) score differences. For the Massachusetts dataset, the proposed method achieves an Intersection over Union (IoU) score of 89.99%, PA (Pixel Accuracy) score of 93.62%, and Recall score of 89.22%. For the WHU Satellite dataset I, the proposed method achieves an IoU score of 86.47%, PA score of 92.45%, and Recall score of 91.62%, For the INRIA dataset, the proposed method achieves an IoU score of 80.47%, PA score of 90.15%, and Recall score of 85.42%. Full article

(This article belongs to the Section Remote Sensing Image Processing)

► Show Figures

Figure 1

27 pages, 13192 KiB

Open AccessArticle

An Efficient and Automated Image Preprocessing Using Semantic Segmentation for Improving the 3D Reconstruction of Soybean Plants at the Vegetative Stage

by Yongzhe Sun, Linxiao Miao, Ziming Zhao, Tong Pan, Xueying Wang, Yixin Guo, Dawei Xin, Qingshan Chen and Rongsheng Zhu

Agronomy 2023, 13(9), 2388; https://doi.org/10.3390/agronomy13092388 - 14 Sep 2023

Cited by 2 | Viewed by 1546

Abstract

The investigation of plant phenotypes through 3D modeling has emerged as a significant field in the study of automated plant phenotype acquisition. In 3D model construction, conventional image preprocessing methods exhibit low efficiency and inherent inefficiencies, which increases the difficulty of model construction. [...] Read more.

The investigation of plant phenotypes through 3D modeling has emerged as a significant field in the study of automated plant phenotype acquisition. In 3D model construction, conventional image preprocessing methods exhibit low efficiency and inherent inefficiencies, which increases the difficulty of model construction. In order to ensure the accuracy of the 3D model, while reducing the difficulty of image preprocessing and improving the speed of 3D reconstruction, deep learning semantic segmentation technology was used in the present study to preprocess original images of soybean plants. Additionally, control experiments involving soybean plants of different varieties and different growth periods were conducted. Models based on manual image preprocessing and models based on image segmentation were established. Point cloud matching, distance calculation and model matching degree calculation were carried out. In this study, the DeepLabv3+, Unet, PSPnet and HRnet networks were used to conduct semantic segmentation of the original images of soybean plants in the vegetative stage (V), and Unet network exhibited the optimal test effect. The values of mIoU, mPA, mPrecision and mRecall reached 0.9919, 0.9953, 0.9965 and 0.9953. At the same time, by comparing the distance results and matching accuracy results between the models and the reference models, a conclusion could be drawn that semantic segmentation can effectively improve the challenges of image preprocessing and long reconstruction time, greatly improve the robustness of noise input and ensure the accuracy of the model. Semantic segmentation plays a crucial role as a fundamental component in enabling efficient and automated image preprocessing for 3D reconstruction of soybean plants during the vegetative stage. In the future, semantic segmentation will provide a solution for the pre-processing of 3D reconstruction for other crops. Full article

(This article belongs to the Section Precision and Digital Agriculture)

► Show Figures

Figure 1

Search Results (55)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (55)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI