MDPI - Publisher of Open Access Journals

20 pages, 8420 KiB

Open AccessArticle

CRAUnet++: A New Convolutional Neural Network for Land Surface Water Extraction from Sentinel-2 Imagery by Combining RWI with Improved Unet++

by Nan Li, Xiaohua Xu, Shifeng Huang, Yayong Sun, Jianwei Ma, He Zhu and Mengcheng Hu

Remote Sens. 2024, 16(18), 3391; https://doi.org/10.3390/rs16183391 - 12 Sep 2024

Viewed by 221

Abstract

Accurately mapping the surface water bodies through remote sensing technology is of great significance for water resources management, flood monitoring, and drought monitoring. At present, many scholars at home and abroad carry out research on deep learning image recognition algorithms based on convolutional [...] Read more.

Accurately mapping the surface water bodies through remote sensing technology is of great significance for water resources management, flood monitoring, and drought monitoring. At present, many scholars at home and abroad carry out research on deep learning image recognition algorithms based on convolutional neural networks, and a variety of variant-based convolutional neural networks are proposed to be applied to extract water bodies from remote sensing images. However, due to the low depth of convolutional layers employed and underutilization of water spectral feature information, most of the water body extraction methods based on convolutional neural networks (CNNs) for remote sensing images are limited in accuracy. In this study, we propose a novel surface water automatic extraction method based on the convolutional neural network (CRAUnet++) for Sentinel-2 images. The proposed method includes three parts: (1) substituting the feature extractor of the original Unet++ with ResNet34 to enhance the network’s complexity by increasing its depth; (2) Embedding the Spatial and Channel ‘Squeeze and Excitation’ (SCSE) module into the up-sampling stage of the network to suppress background features and amplify water body features; (3) adding the vegetation red edge-based water index (RWI) into the input data to maximize the utilization of water body spectral information of Sentinel-2 images without increasing the data processing time. To verify the performance and accuracy of the proposed algorithm, the ablation experiment under four different strategies and comparison experiment with different algorithms of RWI, FCN, SegNet, Unet, and DeepLab v3+ were conducted on Sentinel-2 images of the Poyang Lake. The experimental result shows that the precision, recall, F1, and IoU of CRAUnet++ are 95.99%, 96.41%, 96.19%, and 92.67%, respectively. CRAUnet++ has a good performance in extracting various types of water bodies and suppressing noises because it introduces SCSE attention mechanisms and combines surface water spectral features from RWI, exceeding that of the other five algorithms. The result demonstrates that CRAUnet++ has high validity and reliability in extracting surface water bodies based on Sentinel-2 images. Full article

(This article belongs to the Section AI Remote Sensing)

► Show Figures

Figure 1

15 pages, 6817 KiB

Open AccessArticle

A Fully Connected Neural Network (FCNN) Model to Simulate Karst Spring Flowrates in the Umbria Region (Central Italy)

by Francesco Maria De Filippi, Matteo Ginesi and Giuseppe Sappa

Water 2024, 16(18), 2580; https://doi.org/10.3390/w16182580 - 12 Sep 2024

Viewed by 354

Abstract

In the last decades, climate change has led to increasingly frequent drought events within the Mediterranean area, creating an urgent need of a more sustainable management of groundwater resources exploited for drinking and agricultural purposes. One of the most challenging issues is to [...] Read more.

In the last decades, climate change has led to increasingly frequent drought events within the Mediterranean area, creating an urgent need of a more sustainable management of groundwater resources exploited for drinking and agricultural purposes. One of the most challenging issues is to provide reliable simulations and forecasts of karst spring discharges, whose reduced information, as well as the hydrological processes involving their feeding aquifers, is often a big issue for water service managers and researchers. In order to plan a sustainable water resource exploitation that could face future shortages, the groundwater availability should be assessed by continuously monitoring spring discharge during the hydrological year, using collected data to better understand the past behaviour and, possibly, forecast the future one in case of severe droughts. The aim of this paper is to understand the factors that govern different spring discharge patterns according to rainfall inputs and to present a model, based on artificial neural network (ANN) data training and cross-correlation analyses, to evaluate the discharge of some karst spring in the Umbria region (Central Italy). The model used is a fully connected neural network (FCNN) and has been used both for filling gaps in the spring discharge time series and for simulating the response of six springs to rainfall seasonal patterns from a 20-year continuous daily record, collected and provided by the Regional Environmental Protection Agency (ARPA) of the Umbria region. Full article

(This article belongs to the Special Issue Recent Advances in Karstic Hydrogeology, 2nd Edition)

► Show Figures

Figure 1

Figure 1
Simplified hydrogeological map (IMP: impermeable, SP: semi-permeable, HP: highly permeable). Full article ">Figure 2
Conceptual structure of FCNN for filling gaps within the discharge time series. Full article ">Figure 3
Karst spring discharge time series: (a) raw data with gaps and (b) post-processed data after filling gaps. Full article ">Figure 4
Conceptual structure of FCNN for simulating karst spring discharge behavior. Full article ">Figure 5
Final plots of loss function and simulated vs. measured discharge values for the entire dataset of the six selected springs (Phyton 3.12.3). Red dotted lines in the scatter plot defines the 90% confidence interval. Full article ">Figure 6
Comparison between measured (orange) and simulated (blue) spring flowrates (in L/s) in the time series 2000–2024: (a) Rasiglia; (b) Nocera; (c) San Giovenale. Full article ">Figure 7
Comparison between measured (orange) and simulated (blue) spring flowrates (in L/s) in the time series 2000–2024: (a) Lupa; (b) Bagnara; (c) Acquabianca. Full article ">Figure 7 Cont.
Comparison between measured (orange) and simulated (blue) spring flowrates (in L/s) in the time series 2000–2024: (a) Lupa; (b) Bagnara; (c) Acquabianca. Full article ">

26 pages, 1895 KiB

Open AccessArticle

Enhanced Ischemic Stroke Lesion Segmentation in MRI Using Attention U-Net with Generalized Dice Focal Loss

by Beatriz P. Garcia-Salgado, Jose A. Almaraz-Damian, Oscar Cervantes-Chavarria, Volodymyr Ponomaryov, Rogelio Reyes-Reyes, Clara Cruz-Ramos and Sergiy Sadovnychiy

Appl. Sci. 2024, 14(18), 8183; https://doi.org/10.3390/app14188183 - 11 Sep 2024

Viewed by 302

Abstract

Ischemic stroke lesion segmentation in MRI images represents significant challenges, particularly due to class imbalance between foreground and background pixels. Several approaches have been developed to achieve higher F1-Scores in stroke lesion segmentation under this challenge. These strategies include convolutional neural networks (CNN) [...] Read more.

Ischemic stroke lesion segmentation in MRI images represents significant challenges, particularly due to class imbalance between foreground and background pixels. Several approaches have been developed to achieve higher F1-Scores in stroke lesion segmentation under this challenge. These strategies include convolutional neural networks (CNN) and models that represent a large number of parameters, which can only be trained on specialized computational architectures that are explicitly oriented to data processing. This paper proposes a lightweight model based on the U-Net architecture that handles an attention module and the Generalized Dice Focal loss function to enhance the segmentation accuracy in the class imbalance environment, characteristic of stroke lesions in MRI images. This study also analyzes the segmentation performance according to the pixel size of stroke lesions, giving insights into the loss function behavior using the public ISLES 2015 and ISLES 2022 MRI datasets. The proposed model can effectively segment small stroke lesions with F1-Scores over 0.7, particularly in FLAIR, DWI, and T2 sequences. Furthermore, the model shows reasonable convergence with their 7.9 million parameters at 200 epochs, making it suitable for practical implementation on mid and high-end general-purpose graphic processing units. Full article

(This article belongs to the Special Issue Advances in Computer Vision and Semantic Segmentation, 2nd Edition)

► Show Figures

Figure 1

Figure 1
Scheme of the proposed model. Full article ">Figure 2
Distribution of segmentation masks’ sizes (in pixels) with annotation of the first quartile (Q1), median (Q2), and third quartile (Q3): (a) ISLES 2015, (b) ISLES 2022. Full article ">Figure 3
F1-Scores resulting from changing key hyperparameters <math display="inline"><semantics> <mrow> <msub> <mi>λ</mi> <mrow> <mi>F</mi> <mi>L</mi> </mrow> </msub> <mo>=</mo> <mi>FL</mi> </mrow> </semantics></math>, <math display="inline"><semantics> <mrow> <msub> <mi>λ</mi> <mrow> <mi>G</mi> <mi>D</mi> <mi>L</mi> </mrow> </msub> <mo>=</mo> <mi>GDL</mi> </mrow> </semantics></math>. The combination leading to the best results is highlighted in orange. (a) Experiments performed on FLAIR sequences of ISLES 2015. (b) Experiments performed on DWI modality of ISLES 2022. Full article ">Figure 4
Learning curves comparison: (a) Proposed model, (b) SGD, (c) W/O A.M., (d) CBAM, (e) Dice Loss, (f) Focal Loss. Full article ">Figure 5
Visual comparison of the model’s versions results: Ground truth masks are displayed in the first column (a,g,m,s). Results of Proposed model are given in the second column (b,h,n,t), of Dice Loss model in the third column (c,i,o,u), of Focal Loss model in the fourth column (d,j,p,v), of W/O A.M. model in the fifth column (e,k,q,w), and of CBAM model in the sixth column (f,l,r,x). Full article ">Figure 6
Violin plot of the proposed model’s results on FLAIR images using the axial view, where the dot localizes the median, and the white line represents the mean: (a) IoU scores by mask’s size category, (b) F1-Scores by mask’s size category. Full article ">Figure 7
Performance of the proposed model in segmenting small lesions on different MRI modalities using the ISLES 2015 dataset (dot and white line represent the median and the mean values): (a) IoU scores by MRI modality, (b) F1-Scores by MRI modality. Full article ">Figure 8
Overall performance of the proposed model on different MRI modalities using the ISLES 2015 dataset (dot and white line represent the median and the mean values): (a) IoU scores in the coronal plane, (b) IoU scores in the sagittal plane. Full article ">Figure 9
Examples of segmented FLAIR images in the coronal plane by the proposed method (second row) and their corresponding ground truth mask (first row) for mask categories Small (a,e), Medium Down (b,f), Medium Up (c,g), and Large (d,h). Full article ">Figure 10
Examples of segmented FLAIR images in the sagittal plane by the proposed method (second row) and their corresponding ground truth mask (first row) for mask categories Small (a,e), Medium Down (b,f), Medium Up (c,g), and Large (d,h). Full article ">Figure 11
Violin plot of the proposed model’s results on DWI and ADC images using the axial view, where the dot localizes the median, and the white line represents the mean: (a) F1-Scores by mask size category using DWI and configuration A (FL = 0.7, GDL = 0.3), (b) F1-Scores by mask size category using DWI and configuration B (FL = 0.9, GDL = 0.1), (c) F1-Scores by mask size category using ADC and configuration A (FL = 0.7, GDL = 0.3), (d) F1-Scores by mask size category using ADC and configuration B (FL = 0.9, GDL = 0.1). Full article ">Figure 12
Violin plot of the non-segmented images’ mask size in pixels. Mean value is marked as a white line. Full article ">Figure 13
Examples of ground truth mask of DWI images in the axial plane (first row) and the segmentation results by the proposed method using <math display="inline"><semantics> <mrow> <msub> <mi>λ</mi> <mrow> <mi>G</mi> <mi>D</mi> <mi>L</mi> </mrow> </msub> <mo>=</mo> <mn>0.3</mn> </mrow> </semantics></math>, <math display="inline"><semantics> <mrow> <msub> <mi>λ</mi> <mrow> <mi>F</mi> <mi>L</mi> </mrow> </msub> <mo>=</mo> <mn>0.7</mn> </mrow> </semantics></math> (second row) and <math display="inline"><semantics> <mrow> <msub> <mi>λ</mi> <mrow> <mi>G</mi> <mi>D</mi> <mi>L</mi> </mrow> </msub> <mo>=</mo> <mn>0.1</mn> </mrow> </semantics></math>, <math display="inline"><semantics> <mrow> <msub> <mi>λ</mi> <mrow> <mi>F</mi> <mi>L</mi> </mrow> </msub> <mo>=</mo> <mn>0.9</mn> </mrow> </semantics></math> (third row) for mask categories Small (a,e,i), Medium Down (b,f,j), Medium Up (c,g,k), and Large (d,h,l). Full article ">

23 pages, 30954 KiB

Open AccessArticle

A Deep CNN-Based Salinity and Freshwater Fish Identification and Classification Using Deep Learning and Machine Learning

by Wahidur Rahman, Mohammad Motiur Rahman, Md Ariful Islam Mozumder, Rashadul Islam Sumon, Samia Allaoua Chelloug, Rana Othman Alnashwan and Mohammed Saleh Ali Muthanna

Sustainability 2024, 16(18), 7933; https://doi.org/10.3390/su16187933 - 11 Sep 2024

Viewed by 394

Abstract

Concerning the oversight and safeguarding of aquatic environments, it is necessary to ascertain the quantity of fish, their size, and their distribution. Many deep learning (DL), artificial intelligence (AI), and machine learning (ML) techniques have been developed to oversee and safeguard the fish [...] Read more.

Concerning the oversight and safeguarding of aquatic environments, it is necessary to ascertain the quantity of fish, their size, and their distribution. Many deep learning (DL), artificial intelligence (AI), and machine learning (ML) techniques have been developed to oversee and safeguard the fish species. Still, all the previous work had some limitations, such as a limited dataset, only binary class categorization, only employing one technique (ML/DL), etc. Therefore, in the proposed work, the authors develop an architecture that will eliminate all the limitations. Both DL and ML techniques were used in the suggested framework to identify and categorize multiple classes of the salinity and freshwater fish species. Two different datasets of fish images with thirteen fish species were employed in the current research. Seven CNN architectures were implemented to find out the important features of the fish images. Then, seven ML classifiers were utilized in the suggested work to identify the binary class (freshwater and salinity) of fish species. Following that, the multiclass classification of thirteen fish species was evaluated through the ML algorithms, where the present model diagnosed the freshwater or salinity fish in the specific fish species. To achieve the primary goals of the proposed study, several assessments of the experimental data are provided. The results of the investigation indicated that DenseNet121, EfficientNetB0, ResNet50, VGG16, and VGG19 architectures of the CNN with SVC ML technique achieved 100% accuracy, F1-score, precision, and recall for binary classification (freshwater/salinity) of fish images. Additionally, the ResNet50 architecture of the CNN with SVC ML technique achieved 98.06% and 100% accuracy for multiclass classification (freshwater and salinity fish species) of fish images. However, the proposed pipeline can be very effective in sustainable fish management in fish identification and classification. Full article

(This article belongs to the Special Issue Sustainable Engineering Applications of Artificial Intelligence)

► Show Figures

Figure 1

21 pages, 20841 KiB

Open AccessArticle

Snow Detection in Gaofen-1 Multi-Spectral Images Based on Swin-Transformer and U-Shaped Dual-Branch Encoder Structure Network with Geographic Information

by Yue Wu, Chunxiang Shi, Runping Shen, Xiang Gu, Ruian Tie, Lingling Ge and Shuai Sun

Remote Sens. 2024, 16(17), 3327; https://doi.org/10.3390/rs16173327 - 8 Sep 2024

Viewed by 456

Abstract

Snow detection is imperative in remote sensing for various applications, including climate change monitoring, water resources management, and disaster warning. Recognizing the limitations of current deep learning algorithms in cloud and snow boundary segmentation, as well as issues like detail snow information loss [...] Read more.

Snow detection is imperative in remote sensing for various applications, including climate change monitoring, water resources management, and disaster warning. Recognizing the limitations of current deep learning algorithms in cloud and snow boundary segmentation, as well as issues like detail snow information loss and mountainous snow omission, this paper presents a novel snow detection network based on Swin-Transformer and U-shaped dual-branch encoder structure with geographic information (SD-GeoSTUNet), aiming to address the above issues. Initially, the SD-GeoSTUNet incorporates the CNN branch and Swin-Transformer branch to extract features in parallel and the Feature Aggregation Module (FAM) is designed to facilitate the detail feature aggregation via two branches. Simultaneously, an Edge-enhanced Convolution (EeConv) is introduced to promote snow boundary contour extraction in the CNN branch. In particular, auxiliary geographic information, including altitude, longitude, latitude, slope, and aspect, is encoded in the Swin-Transformer branch to enhance snow detection in mountainous regions. Experiments conducted on Levir_CS, a large-scale cloud and snow dataset originating from Gaofen-1, demonstrate that SD-GeoSTUNet achieves optimal performance with the values of 78.08%, 85.07%, and 92.89% for

I o U_s

,

F 1_s

, and

M P A

, respectively, leading to superior cloud and snow boundary segmentation and thin cloud and snow detection. Further, ablation experiments reveal that integrating slope and aspect information effectively alleviates the omission of snow detection in mountainous areas and significantly exhibits the best vision under complex terrain. The proposed model can be used for remote sensing data with geographic information to achieve more accurate snow extraction, which is conducive to promoting the research of hydrology and agriculture with different geospatial characteristics. Full article

(This article belongs to the Section Environmental Remote Sensing)

► Show Figures

Figure 1

15 pages, 4574 KiB

Open AccessArticle

Student Behavior Recognition in Classroom Based on Deep Learning

by Qingzheng Jia and Jialiang He

Appl. Sci. 2024, 14(17), 7981; https://doi.org/10.3390/app14177981 - 6 Sep 2024

Viewed by 427

Abstract

With the widespread application of information technology in education, the real-time detection of student behavior in the classroom has become a key issue in improving teaching quality. This paper proposes a Student Behavior Detection (SBD) model that combines YOLOv5, the Contextual Attention (CA) [...] Read more.

With the widespread application of information technology in education, the real-time detection of student behavior in the classroom has become a key issue in improving teaching quality. This paper proposes a Student Behavior Detection (SBD) model that combines YOLOv5, the Contextual Attention (CA) mechanism and OpenPose, aiming to achieve efficient and accurate behavior recognition in complex classroom environments. By integrating YOLOv5 with the CA attention mechanism to enhance feature extraction capabilities, the model’s recognition performance in complex backgrounds, such as those with occlusion, is significantly improved. In addition, the feature map generated by the improved YOLOv5 is used to replace VGG-19 in OpenPose, which effectively improves the accuracy of student posture recognition. The experimental results demonstrate that the proposed model achieves a maximum mAP of 82.1% in complex classroom environments, surpassing Faster R-CNN by 5.2 percentage points and YOLOv5 by 4.6 percentage points. Additionally, the F1 score and R value of this model exhibit clear advantages over the other two traditional methods. This model offers an effective solution for intelligent classroom behavior analysis and the optimization of educational management. Full article

(This article belongs to the Special Issue Intelligent Techniques, Platforms and Applications of E-learning)

► Show Figures

Figure 1

25 pages, 10917 KiB

Open AccessArticle

Promoting Sustainable Development of Coal Mines: CNN Model Optimization for Identification of Microseismic Signals Induced by Hydraulic Fracturing in Coal Seams

by Nan Li, Yunpeng Zhang, Xiaosong Zhou, Lihong Sun, Xiaokai Huang, Jincheng Qiu, Yan Li and Xiaoran Wang

Sustainability 2024, 16(17), 7592; https://doi.org/10.3390/su16177592 - 2 Sep 2024

Viewed by 521

Abstract

Borehole hydraulic fracturing in coal seams can prevent dynamic coal mine disasters and promote the sustainability of the mining industry, and microseismic signal recognition is a prerequisite and foundation for microseismic monitoring technology that evaluates the effectiveness of hydraulic fracturing. This study constructed [...] Read more.

Borehole hydraulic fracturing in coal seams can prevent dynamic coal mine disasters and promote the sustainability of the mining industry, and microseismic signal recognition is a prerequisite and foundation for microseismic monitoring technology that evaluates the effectiveness of hydraulic fracturing. This study constructed ultra-lightweight CNN models specifically designed to identify microseismic waveforms induced by borehole hydraulic fracturing in coal seams, namely Ul-Inception28, Ul-ResNet12, Ul-MobileNet17, and Ul-TripleConv8. The three best-performing models were selected to create both a probability averaging ensemble CNN model and a voting ensemble CNN model. Additionally, an automatic threshold adjustment strategy for CNN identification was introduced. The relationships between feature map entropy, training data volume, and model performance were also analyzed. The results indicated that our in-house models surpassed the performance of the InceptionV3, ResNet50, and MobileNetV3 models from the TensorFlow Keras library. Notably, the voting ensemble CNN model achieved an improvement of at least 0.0452 in the F1 score compared to individual models. The automatic threshold adjustment strategy enhanced the identification threshold’s precision to 26 decimal places. However, a continuous zero-entropy value in the feature maps of various channels was found to detract from the model’s generalization performance. Moreover, the expanded training dataset, derived from thousands of waveforms, proved more compatible with CNN models comprising hundreds of thousands of parameters. The findings of this research significantly contribute to the prevention of dynamic coal mine disasters, potentially reducing casualties, economic losses, and promoting the sustainable progress of the coal mining industry. Full article

(This article belongs to the Section Hazards and Sustainability)

► Show Figures

Figure 1

25 pages, 12480 KiB

Open AccessArticle

EFS-Former: An Efficient Network for Fruit Tree Leaf Disease Segmentation and Severity Assessment

by Donghui Jiang, Miao Sun, Shulong Li, Zhicheng Yang and Liying Cao

Agronomy 2024, 14(9), 1992; https://doi.org/10.3390/agronomy14091992 - 2 Sep 2024

Viewed by 466

Abstract

Fruit is a major source of vitamins, minerals, and dietary fiber in people’s daily lives. Leaf diseases caused by climate change and other factors have significantly reduced fruit production. Deep learning methods for segmenting leaf diseases can effectively mitigate this issue. However, challenges [...] Read more.

Fruit is a major source of vitamins, minerals, and dietary fiber in people’s daily lives. Leaf diseases caused by climate change and other factors have significantly reduced fruit production. Deep learning methods for segmenting leaf diseases can effectively mitigate this issue. However, challenges such as leaf folding, jaggedness, and light shading make edge feature extraction difficult, affecting segmentation accuracy. To address these problems, this paper proposes a method based on EFS-Former. The expanded local detail (ELD) module extends the model’s receptive field by expanding the convolution, better handling fine spots and effectively reducing information loss. H-attention reduces computational redundancy by superimposing multi-layer convolutions, significantly improving feature filtering. The parallel fusion architecture effectively utilizes the different feature extraction intervals of the convolutional neural network (CNN) and Transformer encoders, achieving comprehensive feature extraction and effectively fusing detailed and semantic information in the channel and spatial dimensions within the feature fusion module (FFM). Experiments show that, compared to DeepLabV3+, this method achieves 10.78%, 9.51%, 0.72%, and 8.00% higher scores for mean intersection over union (mIoU), mean pixel accuracy (mPA), accuracy (Acc), and F_score, respectively, while having 1.78 M fewer total parameters and 0.32 G lower floating point operations per second (FLOPS). Additionally, it effectively calculates the ratio of leaf area occupied by spots. This method is also effective in calculating the disease period by analyzing the ratio of leaf area occupied by diseased spots. The method’s overall performance is evaluated using mIoU, mPA, Acc, and F_score metrics, achieving 88.60%, 93.49%, 98.60%, and 95.90%, respectively. In summary, this study offers an efficient and accurate method for fruit tree leaf spot segmentation, providing a solid foundation for the precise analysis of fruit tree leaves and spots, and supporting smart agriculture for precision pesticide spraying. Full article

(This article belongs to the Special Issue The Applications of Deep Learning in Smart Agriculture)

► Show Figures

Figure 1

19 pages, 2702 KiB

Open AccessArticle

Modeling and Forecasting Ionospheric foF2 Variation Based on CNN-BiLSTM-TPA during Low- and High-Solar Activity Years

by Baoyi Xu, Wenqiang Huang, Peng Ren, Yi Li and Zheng Xiang

Remote Sens. 2024, 16(17), 3249; https://doi.org/10.3390/rs16173249 - 2 Sep 2024

Viewed by 396

Abstract

The transmission of high-frequency signals over long distances depends on the ionosphere’s reflective properties, with the selection of operating frequencies being closely tied to variations in the ionosphere. The accurate prediction of ionospheric critical frequency foF2 and other parameters in low latitudes is [...] Read more.

The transmission of high-frequency signals over long distances depends on the ionosphere’s reflective properties, with the selection of operating frequencies being closely tied to variations in the ionosphere. The accurate prediction of ionospheric critical frequency foF2 and other parameters in low latitudes is of great significance for understanding ionospheric changes in high-frequency communications. Currently, deep learning algorithms demonstrate significant advantages in capturing characteristics of the ionosphere. In this paper, a state-of-the-art hybrid neural network is utilized in conjunction with a temporal pattern attention mechanism for predicting variations in the foF2 parameter during high- and low-solar activity years. Convolutional neural networks (CNNs) and bidirectional long short-term memory (BiLSTM), which is capable of extracting spatiotemporal features of ionospheric variations, are incorporated into a hybrid neural network. The foF2 data used for training and testing come from three observatories in Brisbane (27°53′S, 152°92′E), Darwin (12°45′S, 130°95′E) and Townsville (19°63′S, 146°85′E) in 2000, 2008, 2009 and 2014 (the peak or trough years of solar activity in solar cycles 23 and 24), using the advanced Australian Digital Ionospheric Sounder. The results show that the proposed model accurately captures the changes in ionospheric foF2 characteristics and outperforms International Reference Ionosphere 2020 (IRI-2020) and BiLSTM ionospheric prediction models. Full article

(This article belongs to the Special Issue Ionosphere Monitoring with Remote Sensing (3rd Edition))

► Show Figures

Figure 1

37 pages, 76788 KiB

Open AccessArticle

Machine Learning-Based Remote Sensing Inversion of Non-Photosynthetic/Photosynthetic Vegetation Coverage in Desertified Areas and Its Response to Drought Analysis

by Zichen Guo, Shulin Liu, Kun Feng, Wenping Kang and Xiang Chen

Remote Sens. 2024, 16(17), 3226; https://doi.org/10.3390/rs16173226 - 31 Aug 2024

Viewed by 516

Abstract

Determining the responses of non-photosynthetic vegetation (NPV) and photosynthetic vegetation (PV) communities to climate change is crucial in illustrating the sensitivity and sustainability of these ecosystems. In this study, we evaluated the accuracy of inverting NPV and PV using Landsat imagery with random [...] Read more.

Determining the responses of non-photosynthetic vegetation (NPV) and photosynthetic vegetation (PV) communities to climate change is crucial in illustrating the sensitivity and sustainability of these ecosystems. In this study, we evaluated the accuracy of inverting NPV and PV using Landsat imagery with random forest (RF), backpropagation neural network (BPNN), and fully connected neural network (FCNN) models. Additionally, we inverted MODIS NPV and PV time-series data using spectral unmixing. Based on this, we analyzed the responses of NPV and PV to precipitation and drought across different ecological regions. The main conclusions are as follows: (1) In NPV remote sensing inversion, the softmax activation function demonstrates greater advantages over the ReLU activation function. Specifically, the use of the softmax function results in an approximate increase of 0.35 in the R² value. (2) Compared with a five-layer FCNN with 128 neurons and a three-layer BPNN with 12 neurons, a random forest model with over 50 trees and 5 leaf nodes provides better inversion results for NPV and PV (R²__RF-NPV = 0.843, R²__RF-PV = 0.861). (3) Long-term drought or heavy rainfall events can affect the utilization of precipitation by NPV and PV. There is a high correlation between extreme precipitation events following prolonged drought and an increase in PV coverage. (4) Under long-term drought conditions, the vegetation in the study area responded to precipitation during the last winter and growing season. This study provides an illustration of the response of semi-arid ecosystems to drought and wetting events, thereby offering a data basis for the effect evaluation of afforestation projects. Full article

(This article belongs to the Special Issue Machine Learning for Spatiotemporal Remote Sensing Data (2nd Edition))

► Show Figures

Figure 1

32 pages, 10548 KiB

Open AccessArticle

GAN-SkipNet: A Solution for Data Imbalance in Cardiac Arrhythmia Detection Using Electrocardiogram Signals from a Benchmark Dataset

by Hari Mohan Rai, Joon Yoo and Serhii Dashkevych

Mathematics 2024, 12(17), 2693; https://doi.org/10.3390/math12172693 - 29 Aug 2024

Cited by 1 | Viewed by 313

Abstract

Electrocardiography (ECG) plays a pivotal role in monitoring cardiac health, yet the manual analysis of ECG signals is challenging due to the complex task of identifying and categorizing various waveforms and morphologies within the data. Additionally, ECG datasets often suffer from a significant [...] Read more.

Electrocardiography (ECG) plays a pivotal role in monitoring cardiac health, yet the manual analysis of ECG signals is challenging due to the complex task of identifying and categorizing various waveforms and morphologies within the data. Additionally, ECG datasets often suffer from a significant class imbalance issue, which can lead to inaccuracies in detecting minority class samples. To address these challenges and enhance the effectiveness and efficiency of cardiac arrhythmia detection from imbalanced ECG datasets, this study proposes a novel approach. This research leverages the MIT-BIH arrhythmia dataset, encompassing a total of 109,446 ECG beats distributed across five classes following the Association for the Advancement of Medical Instrumentation (AAMI) standard. Given the dataset’s inherent class imbalance, a 1D generative adversarial network (GAN) model is introduced, incorporating the Bi-LSTM model to synthetically generate the two minority signal classes, which represent a mere 0.73% fusion (F) and 2.54% supraventricular (S) of the data. The generated signals are rigorously evaluated for similarity to real ECG data using three key metrics: mean squared error (MSE), structural similarity index (SSIM), and Pearson correlation coefficient (r). In addition to addressing data imbalance, the work presents three deep learning models tailored for ECG classification: SkipCNN (a convolutional neural network with skip connections), SkipCNN+LSTM, and SkipCNN+LSTM+Attention mechanisms. To further enhance efficiency and accuracy, the test dataset is rigorously assessed using an ensemble model, which consistently outperforms the individual models. The performance evaluation employs standard metrics such as precision, recall, and F1-score, along with their average, macro average, and weighted average counterparts. Notably, the SkipCNN+LSTM model emerges as the most promising, achieving remarkable precision, recall, and F1-scores of 99.3%, which were further elevated to an impressive 99.60% through ensemble techniques. Consequently, with this innovative combination of data balancing techniques, the GAN-SkipNet model not only resolves the challenges posed by imbalanced data but also provides a robust and reliable solution for cardiac arrhythmia detection. This model stands poised for clinical applications, offering the potential to be deployed in hospitals for real-time cardiac arrhythmia detection, thereby benefiting patients and healthcare practitioners alike. Full article

(This article belongs to the Special Issue Artificial Intelligence for Biomedical Image Processing and Data Analysis)

► Show Figures

Figure 1

20 pages, 11706 KiB

Open AccessArticle

Precision Medicine for Apical Lesions and Peri-Endo Combined Lesions Based on Transfer Learning Using Periapical Radiographs

by Pei-Yi Wu, Yi-Cheng Mao, Yuan-Jin Lin, Xin-Hua Li, Li-Tzu Ku, Kuo-Chen Li, Chiung-An Chen, Tsung-Yi Chen, Shih-Lun Chen, Wei-Chen Tu and Patricia Angela R. Abu

Bioengineering 2024, 11(9), 877; https://doi.org/10.3390/bioengineering11090877 - 29 Aug 2024

Viewed by 415

Abstract

An apical lesion is caused by bacteria invading the tooth apex through caries. Periodontal disease is caused by plaque accumulation. Peri-endo combined lesions include both diseases and significantly affect dental prognosis. The lack of clear symptoms in the early stages of onset makes [...] Read more.

An apical lesion is caused by bacteria invading the tooth apex through caries. Periodontal disease is caused by plaque accumulation. Peri-endo combined lesions include both diseases and significantly affect dental prognosis. The lack of clear symptoms in the early stages of onset makes diagnosis challenging, and delayed treatment can lead to the spread of symptoms. Early infection detection is crucial for preventing complications. PAs used as the database were provided by Chang Gung Memorial Medical Center, Taoyuan, Taiwan, with permission from the Institutional Review Board (IRB): 02002030B0. The tooth apex image enhancement method is a new technology in PA detection. This image enhancement method is used with convolutional neural networks (CNN) to classify apical lesions, peri-endo combined lesions, and asymptomatic cases, and to compare with You Only Look Once-v8-Oriented Bounding Box (YOLOv8-OBB) disease detection results. The contributions lie in the utilization of database augmentation and adaptive histogram equalization on individual tooth images, achieving the highest comprehensive validation accuracy of 95.23% with the ConvNextv2 model. Furthermore, the CNN outperformed YOLOv8 in identifying apical lesions, achieving an F1-Score of 92.45%. For the classification of peri-endo combined lesions, CNN attained the highest F1-Score of 96.49%, whereas YOLOv8 scored 88.49%. Full article

(This article belongs to the Special Issue Clinical and Translational Research on Technologies for Diagnosis and Treatment)

► Show Figures

Graphical abstract

14 pages, 7195 KiB

Open AccessFeature PaperArticle

RHYTHMI: A Deep Learning-Based Mobile ECG Device for Heart Disease Prediction

by Alaa Eleyan, Ebrahim AlBoghbaish, Abdulwahab AlShatti, Ahmad AlSultan and Darbi AlDarbi

Appl. Syst. Innov. 2024, 7(5), 77; https://doi.org/10.3390/asi7050077 - 29 Aug 2024

Viewed by 448

Abstract

Heart disease, a global killer with many variations like arrhythmia and heart failure, remains a major health concern. Traditional risk factors include age, cholesterol, diabetes, and blood pressure. Fortunately, artificial intelligence (AI) offers a promising solution. We have harnessed the power of AI, [...] Read more.

Heart disease, a global killer with many variations like arrhythmia and heart failure, remains a major health concern. Traditional risk factors include age, cholesterol, diabetes, and blood pressure. Fortunately, artificial intelligence (AI) offers a promising solution. We have harnessed the power of AI, specifically deep learning and convolutional neural networks (CNNs), to develop Rhythmi, an innovative mobile ECG diagnosis device for heart disease detection. Rhythmi leverages extensive medical data from databases like MIT-BIH and BIDMC. These data empower the training and testing of the developed deep learning model to analyze ECG signals with accuracy, precision, sensitivity, specificity, and F1-score in identifying arrhythmias and other heart conditions, with performances reaching 98.52%, 98.55%, 98.52%, 99.26%, and 98.52%, respectively. Moreover, we tested Rhythmi in real time using a mobile device with a single-lead ECG sensor. This user-friendly prototype captures the ECG signal, transmits it to Rhythmi’s dedicated website, and provides instant diagnosis and feedback on the patient’s heart health. The developed mobile ECG diagnosis device addresses the main problems of traditional ECG diagnostic devices such as accessibility, cost, mobility, complexity, and data integration. However, we believe that despite the promising results, our system will still need intensive clinical validation in the future. Full article

(This article belongs to the Special Issue Advancing Healthcare Through Intelligent Clinical Decision Support Systems: Techniques, Applications, and Future Directions)

► Show Figures

Figure 1

22 pages, 29298 KiB

Open AccessArticle

Landslide Recognition Based on Machine Learning Considering Terrain Feature Fusion

by Jincan Wang, Zhiheng Wang, Liyao Peng and Chenzhihao Qian

ISPRS Int. J. Geo-Inf. 2024, 13(9), 306; https://doi.org/10.3390/ijgi13090306 - 28 Aug 2024

Viewed by 459

Abstract

Landslides are one of the major disasters that exist worldwide, posing a serious threat to human life and property safety. Rapid and accurate detection and mapping of landslides are crucial for risk assessment and humanitarian assistance in affected areas. To achieve this goal, [...] Read more.

Landslides are one of the major disasters that exist worldwide, posing a serious threat to human life and property safety. Rapid and accurate detection and mapping of landslides are crucial for risk assessment and humanitarian assistance in affected areas. To achieve this goal, this study proposes a landslide recognition method based on machine learning (ML) and terrain feature fusion. Taking the Dawan River Basin in Detuo Township and Tianwan Yi Ethnic Township as the research area, firstly, landslide-related data were compiled, including a landslide inventory based on field surveys, satellite images, historical data, high-resolution remote sensing images, and terrain data. Then, different training datasets for landslide recognition are constructed, including full feature datasets that fusion terrain features and remote sensing features and datasets that only contain remote sensing features. At the same time, different ratios of landslide to non-landslide (or positive/negative, P/N) samples are set in the training data. Subsequently, five ML algorithms, including Extreme Gradient Boost (XGBoost), Adaptive Boost (AdaBoost), Light Gradient Boost (LightGBM), Random Forest (RF), and Convolutional Neural Network (CNN), were used to train each training dataset, and landslide recognition was performed on the validation area. Finally, accuracy (A), precision (P), recall (R), F1 score (F1), and intersection over union (IOU) were selected to evaluate the landslide recognition ability of different models. The research results indicate that selecting ML models suitable for the study area and the ratio of the P/N samples can improve the A, R, F1, and IOU of landslide identification results, resulting in more accurate and reasonable landslide identification results; Fusion terrain features can make the model recognize landslides more comprehensively and align better with the actual conditions. The best-performing model in the study is LightGBM. When the input data includes all features and the P/N sample ratio is optimal, the A, P, R, F1, and IOU of landslide recognition results for this model are 97.47%, 85.40%, 76.95%, 80.95%, and 71.28%, respectively. Compared to the landslide recognition results using only remote sensing features, this model shows improvements of 4.51%, 35.66%, 5.41%, 22.27%, and 29.16% in A, P, R, F1, and IOU, respectively. This study serves as a valuable reference for the precise and comprehensive identification of landslide areas. Full article

(This article belongs to the Special Issue Advances in Remote Sensing and GIS for Natural Hazards Monitoring and Management)

► Show Figures

Figure 1

16 pages, 2588 KiB

Open AccessArticle

Development of a Machine Learning Model for the Classification of Enterobius vermicularis Egg

by Natthanai Chaibutr, Pongphan Pongpanitanont, Sakhone Laymanivong, Tongjit Thanchomnang and Penchom Janwan

J. Imaging 2024, 10(9), 212; https://doi.org/10.3390/jimaging10090212 - 28 Aug 2024

Viewed by 436

Abstract

Enterobius vermicularis (pinworm) infections are a significant global health issue, affecting children predominantly in environments like schools and daycares. Traditional diagnosis using the scotch tape technique involves examining E. vermicularis eggs under a microscope. This method is time-consuming and depends heavily on the [...] Read more.

Enterobius vermicularis (pinworm) infections are a significant global health issue, affecting children predominantly in environments like schools and daycares. Traditional diagnosis using the scotch tape technique involves examining E. vermicularis eggs under a microscope. This method is time-consuming and depends heavily on the examiner’s expertise. To improve this, convolutional neural networks (CNNs) have been used to automate the detection of pinworm eggs from microscopic images. In our study, we enhanced E. vermicularis egg detection using a CNN benchmarked against leading models. We digitized and augmented 40,000 images of E. vermicularis eggs (class 1) and artifacts (class 0) for comprehensive training, using an 80:20 training–validation and a five-fold cross-validation. The proposed CNN model showed limited initial performance but achieved 90.0% accuracy, precision, recall, and F1-score after data augmentation. It also demonstrated improved stability with an ROC-AUC metric increase from 0.77 to 0.97. Despite its smaller file size, our CNN model performed comparably to larger models. Notably, the Xception model achieved 99.0% accuracy, precision, recall, and F1-score. These findings highlight the effectiveness of data augmentation and advanced CNN architectures in improving diagnostic accuracy and efficiency for E. vermicularis infections. Full article

(This article belongs to the Section Image and Video Processing)

► Show Figures

Graphical abstract

Graphical abstract
Full article ">Figure 1
The workflow of an object detection system incorporating data augmentation techniques is illustrated. (A) This comprehensive process begins with data acquisition, followed by preprocessing and image augmentation. The augmented images are used for model training, after which the model undergoes validation and testing. Once trained, the model is applied to new test images to detect objects, with performance evaluated using the Intersection-over-Union (IoU) metric. (B) Various augmentation techniques are applied to the original dataset, creating an enhanced and diversified dataset. Full article ">Figure 2
A series of microscopic images illustrating the impact of various image augmentation techniques on two distinct classes (class 0 and class 1). The first column features the original, unaltered images. Moving to the right, each subsequent column reveals the transformed images after applying different augmentation methods: Gaussian blur, mean filtering, Gaussian noise, and kernel sharpening. These techniques introduce a range of visual variations and distortions, enriching the dataset. By incorporating these augmentations, the goal is to bolster the robustness and enhance the generalization capabilities of the machine learning model trained on this enriched dataset. The upper row of images represents class 0, and the lower row represents class 1. Full article ">Figure 3
Architectural design of proposed convolutional neural network (CNN). Full article ">Figure 4
Outcomes of training and validating machine learning model. Results of five-fold cross-validation training loss. (A) Non-augmented image dataset. (B) Augmented image dataset. Relationship between prediction accuracy and number of folds used in cross-validation for image datasets. (C) Non-augmented image dataset. (D) Augmented image dataset. Full article ">Figure 5
Receiver operating characteristic (ROC) curves for binary classification model trained on two different datasets. (A) Non-augmented image dataset. (B) Augmented image dataset. The orange line indicates the correct positives and misclassified negatives, highlighting the model’s class distinction ability. The blue dashed line indicates an AUC value of 0.5, suggesting no discriminative power. Full article ">Figure 6
A detailed comparison between the object detection results of a highly trained machine learning model, Xception, and the annotations made by expert medical staff on the microscopic images. (A) The objects detected by the Xception model, highlighted by green bounding boxes. (B) The annotations made by a parasitology expert, indicated by red bounding boxes. (C) A combined view, displaying both the expert annotations (red bounding boxes) and the model’s predictions (green bounding boxes). Full article ">

Search Results (1,334)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (1,334)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI