[go: up one dir, main page]

 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (14,754)

Search Parameters:
Keywords = machine learning algorithms

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
32 pages, 614 KiB  
Review
Automatic Generation of Medical Case-Based Multiple-Choice Questions (MCQs): A Review of Methodologies, Applications, Evaluation, and Future Directions
by Somaiya Al Shuraiqi, Abdulrahman Aal Abdulsalam, Ken Masters, Hamza Zidoum and Adhari AlZaabi
Big Data Cogn. Comput. 2024, 8(10), 139; https://doi.org/10.3390/bdcc8100139 (registering DOI) - 17 Oct 2024
Abstract
This paper offers an in-depth review of the latest advancements in the automatic generation of medical case-based multiple-choice questions (MCQs). The automatic creation of educational materials, particularly MCQs, is pivotal in enhancing teaching effectiveness and student engagement in medical education. In this review, [...] Read more.
This paper offers an in-depth review of the latest advancements in the automatic generation of medical case-based multiple-choice questions (MCQs). The automatic creation of educational materials, particularly MCQs, is pivotal in enhancing teaching effectiveness and student engagement in medical education. In this review, we explore various algorithms and techniques that have been developed for generating MCQs from medical case studies. Recent innovations in natural language processing (NLP) and machine learning (ML) for automatic language generation have garnered considerable attention. Our analysis evaluates and categorizes the leading approaches, highlighting their generation capabilities and practical applications. Additionally, this paper synthesizes the existing evidence, detailing the strengths, limitations, and gaps in current practices. By contributing to the broader conversation on how technology can support medical education, this review not only assesses the present state but also suggests future directions for improvement. We advocate for the development of more advanced and adaptable mechanisms to enhance the automatic generation of MCQs, thereby supporting more effective learning experiences in medical education. Full article
(This article belongs to the Special Issue Advances in Natural Language Processing and Text Mining)
Show Figures

Figure 1

Figure 1
<p>An example of medical ontology on COVID-19.</p>
Full article ">
23 pages, 846 KiB  
Article
The Influence Mechanism of Narrative Strategies Used by Virtual Influencers on Consumer Product Preferences
by Yuelong Zeng, Gefei Pu, Jingwen Liu and Wenting Feng
J. Theor. Appl. Electron. Commer. Res. 2024, 19(4), 2828-2850; https://doi.org/10.3390/jtaer19040137 (registering DOI) - 17 Oct 2024
Abstract
As social media has risen, virtual social media influencers have become a significant tool in modern marketing, utilizing computer-generated images (CGI), machine learning algorithms, and artificial intelligence technologies to connect with consumers via virtual online personas. In this study, the Uses and Gratifications [...] Read more.
As social media has risen, virtual social media influencers have become a significant tool in modern marketing, utilizing computer-generated images (CGI), machine learning algorithms, and artificial intelligence technologies to connect with consumers via virtual online personas. In this study, the Uses and Gratifications Theory (UGT) is employed as a theoretical framework to explore the effects of educational narrative strategies and evaluative narrative strategies on consumer product preferences, with an analysis of the mediating role of word-of-mouth effectiveness and the moderating role of perceived product usability. It was demonstrated in Experiment 1 that virtual influencers employing educational narrative strategies are more effective than those using evaluative narrative strategies in enhancing consumer product preferences. The boundaries of the study were clarified in Experiment 2, which found that the main effect of educational narrative strategies utilized by social media influencers to increase consumer product preferences is present only in the context of virtual influencers. In Experiment 3, the mediating role of word-of-mouth recommendation effectiveness in the relationship between narrative strategies and consumer product preferences was further verified. The moderating role of perceived product usability was examined in Experiment 4, and it was found that the main effect is more pronounced in contexts where perceived product usability is low. The results of this study provide theoretical and practical guidance on how companies can effectively leverage virtual influencers to promote their products. Full article
(This article belongs to the Topic Interactive Marketing in the Digital Era)
Show Figures

Figure 1

Figure 1
<p>The effectiveness of WOM’s mediation model.</p>
Full article ">Figure 2
<p>Moderated mediation model.</p>
Full article ">
33 pages, 629 KiB  
Article
Enhancing Smart City Connectivity: A Multi-Metric CNN-LSTM Beamforming Based Approach to Optimize Dynamic Source Routing in 6G Networks for MANETs and VANETs
by Vincenzo Inzillo, David Garompolo and Carlo Giglio
Smart Cities 2024, 7(5), 3022-3054; https://doi.org/10.3390/smartcities7050118 (registering DOI) - 17 Oct 2024
Abstract
The advent of Sixth Generation (6G) wireless technologies introduces challenges and opportunities for Mobile Ad Hoc Networks (MANETs) and Vehicular Ad Hoc Networks (VANETs), necessitating a reevaluation of traditional routing protocols. This paper introduces the Multi-Metric Scoring Dynamic Source Routing (MMS-DSR), a novel [...] Read more.
The advent of Sixth Generation (6G) wireless technologies introduces challenges and opportunities for Mobile Ad Hoc Networks (MANETs) and Vehicular Ad Hoc Networks (VANETs), necessitating a reevaluation of traditional routing protocols. This paper introduces the Multi-Metric Scoring Dynamic Source Routing (MMS-DSR), a novel enhancement of the Dynamic Source Routing (DSR) protocol, designed to meet the demands of 6G-enabled MANETs and the dynamic environments of VANETs. MMS-DSR integrates advanced technologies and methodologies to enhance routing performance in dynamic scenarios. Key among these is the use of a CNN-LSTM-based beamforming algorithm, which optimizes beamforming vectors dynamically, exploiting spatial-temporal variations characteristic of 6G channels. This enables MMS-DSR to adapt beam directions in real time based on evolving network conditions, improving link reliability and throughput. Furthermore, MMS-DSR incorporates a multi-metric scoring mechanism that evaluates routes based on multiple QoS parameters, including latency, bandwidth, and reliability, enhanced by the capabilities of Massive MIMO and the IEEE 802.11ax standard. This ensures route selection is context-aware and adaptive to changing dynamics, making it effective in urban settings where vehicular and mobile nodes coexist. Additionally, the protocol uses machine learning techniques to predict future route performance, enabling proactive adjustments in routing decisions. The integration of dynamic beamforming and machine learning allows MMS-DSR to effectively handle the high mobility and variability of 6G networks, offering a robust solution for future wireless communications, particularly in smart cities. Full article
Show Figures

Figure 1

Figure 1
<p>Flowchart for MMS-DSR Architecture.</p>
Full article ">Figure 2
<p>Network topology diagram illustrating the routes from Node A to Node J with respective metrics.</p>
Full article ">Figure 3
<p>Network topology diagram illustrating the routes from Node A to Node J with respective metrics.</p>
Full article ">Figure 4
<p>CNN-LSTM model architecture for MMS-DSR.</p>
Full article ">Figure 5
<p>Network topology diagram illustrating the routes from Node A to Node J with respective metrics.</p>
Full article ">Figure 6
<p>Network topology diagram illustrating the routes from Node A to Node J with respective metrics.</p>
Full article ">Figure 7
<p>Throughput comparison across varying numbers of vehicles.</p>
Full article ">Figure 8
<p>Throughput comparison across varying vehicle speeds.</p>
Full article ">Figure 9
<p>Latency comparison in function of vehicle density.</p>
Full article ">Figure 10
<p>Latency comparison in function on vehicle speed.</p>
Full article ">Figure 11
<p>Route discovery time vs. vehicle density.</p>
Full article ">Figure 12
<p>Route discovery time vs. vehicle speed.</p>
Full article ">Figure 13
<p>Routing overhead comparison in function of vehicle density.</p>
Full article ">Figure 14
<p>Routing overhead comparison in function of vehicle speed.</p>
Full article ">Figure 15
<p>Scalability comparison performance across increasing vehicle density.</p>
Full article ">
18 pages, 1593 KiB  
Article
Privacy-Preserving Handover Optimization Using Federated Learning and LSTM Networks
by Wei-Che Chien, Yu Huang, Bo-Yu Chang and Wu-Yuin Hwang
Sensors 2024, 24(20), 6685; https://doi.org/10.3390/s24206685 (registering DOI) - 17 Oct 2024
Abstract
The rapid evolution of wireless communication systems necessitates advanced handover mechanisms for seamless connectivity and optimal network performance. Traditional algorithms, like 3GPP Event A3, often struggle with fluctuating signal strengths and dynamic user mobility, leading to frequent handovers and suboptimal resource utilization. This [...] Read more.
The rapid evolution of wireless communication systems necessitates advanced handover mechanisms for seamless connectivity and optimal network performance. Traditional algorithms, like 3GPP Event A3, often struggle with fluctuating signal strengths and dynamic user mobility, leading to frequent handovers and suboptimal resource utilization. This study proposes a novel approach combining Federated Learning (FL) and Long Short-Term Memory (LSTM) networks to predict Reference Signal Received Power (RSRP) and the strongest nearby Reference Signal Received Power (RSRP) signals. Our method leverages FL to ensure data privacy and LSTM to capture temporal dependencies in signal data, enhancing prediction accuracy. We develop a dynamic handover algorithm that adapts to real-time conditions, adjusting thresholds based on predicted signal strengths and historical performance. Extensive experiments with real-world data show our dynamic algorithm significantly outperforms the 3GPP Event A3 algorithm, achieving higher prediction accuracy, reducing unnecessary handovers, and improving overall network performance. In conclusion, this study introduces a data-driven, privacy-preserving approach that leverages advanced machine learning techniques, providing a more efficient and reliable handover mechanism for future wireless networks. Full article
(This article belongs to the Topic Innovation, Communication and Engineering)
Show Figures

Figure 1

Figure 1
<p>Handover Decision Process with TTT and HOM.</p>
Full article ">Figure 2
<p>Proposed system structure.</p>
Full article ">Figure 3
<p>LSTM Model Structure.</p>
Full article ">Figure 4
<p>Explanation of the Process for Checking Ping-Pong Conditions.</p>
Full article ">Figure 5
<p>Our integral system flowchart.</p>
Full article ">Figure 6
<p>Clients’ Data Distribution, the circles in this figure are the outliers in each clients.</p>
Full article ">Figure 7
<p>Prediction result of heterogeneous federated learning.</p>
Full article ">Figure 8
<p>Prediction result of homogeneous federated learning.</p>
Full article ">Figure 9
<p>Prediction result of centralized LSTM model.</p>
Full article ">Figure 10
<p>95% Confidence Interval for RSRP Prediction.</p>
Full article ">Figure 11
<p>Comparison of Algorithm and HOPP Probability.</p>
Full article ">
31 pages, 6207 KiB  
Article
A Distributed VMD-BiLSTM Model for Taxi Demand Forecasting with GPS Sensor Data
by Hasan A. H. Naji, Qingji Xue and Tianfeng Li
Sensors 2024, 24(20), 6683; https://doi.org/10.3390/s24206683 (registering DOI) - 17 Oct 2024
Abstract
With the ubiquitous deployment of mobile and sensor technologies in modes of transportation, taxis have become a significant component of public transportation. However, vacant taxis represent an important waste of transportation resources. Forecasting taxi demand within a short time achieves a supply–demand balance [...] Read more.
With the ubiquitous deployment of mobile and sensor technologies in modes of transportation, taxis have become a significant component of public transportation. However, vacant taxis represent an important waste of transportation resources. Forecasting taxi demand within a short time achieves a supply–demand balance and reduces oil emissions. Although earlier studies have forwarded highly developed machine learning- and deep learning-based models to forecast taxicab demands, these models often face significant computational expenses and cannot effectively utilize large-scale trajectory sensor data. To address these challenges, in this paper, we propose a hybrid deep learning-based model for taxi demand prediction. In particular, the Variational Mode Decomposition (VMD) algorithm is integrated along with a Bidirectional Long Short-Term Memory (BiLSTM) model to perform the prediction process. The VMD algorithm is applied to decompose time series-aware traffic features into multiple sub-modes of different frequencies. After that, the BiLSTM method is utilized to predict time series data fed with the relevant demand features. To overcome the limitation of high computational expenses, the designed model is performed on the Spark distributed platform. The performance of the proposed model is tested using a real-world dataset, and it surpasses existing state-of-the-art predictive models in terms of accuracy, efficiency, and distributed performance. These findings provide insights for enhancing the efficiency of passenger search and increasing the profit of taxicabs. Full article
(This article belongs to the Section Sensor Networks)
Show Figures

Figure 1

Figure 1
<p>The structure of the distributed VMD-BiLSTM prediction model.</p>
Full article ">Figure 2
<p>Road map network of Wuhan City.</p>
Full article ">Figure 3
<p>Typical trajectory of taxi trips.</p>
Full article ">Figure 4
<p>Study area of Wuchang district.</p>
Full article ">Figure 5
<p>The distribution of taxi demands on the weekdays and weekends.</p>
Full article ">Figure 6
<p>Taxi demand distribution in the target area during holidays.</p>
Full article ">Figure 7
<p>The distribution of taxi demands in the target area over 24 h.</p>
Full article ">Figure 8
<p>Schematic diagram of the VMD-BiLSTM model.</p>
Full article ">Figure 9
<p>Flowchart of VMD algorithm.</p>
Full article ">Figure 10
<p>The transformation of the taxi demands time series into a two-dimensional array.</p>
Full article ">Figure 11
<p>Architecture of bidirectional LSTM network.</p>
Full article ">Figure 12
<p>Distributed implementation of VMD-BiLSTM model on Spark.</p>
Full article ">Figure 13
<p>Results of our prediction model on Wuhan’s dataset. (<b>a</b>) 1 day; (<b>b</b>) 1 week; (<b>c</b>) 2 weeks; (<b>d</b>) 1 month; (<b>e</b>) 2 months; and (<b>f</b>) whole dataset.</p>
Full article ">Figure 13 Cont.
<p>Results of our prediction model on Wuhan’s dataset. (<b>a</b>) 1 day; (<b>b</b>) 1 week; (<b>c</b>) 2 weeks; (<b>d</b>) 1 month; (<b>e</b>) 2 months; and (<b>f</b>) whole dataset.</p>
Full article ">Figure 14
<p>VMD renderings.</p>
Full article ">Figure 14 Cont.
<p>VMD renderings.</p>
Full article ">Figure 15
<p>Wavelet threshold denoising method for VMD renderings (IMF1, IMF2, and IMF3).</p>
Full article ">Figure 15 Cont.
<p>Wavelet threshold denoising method for VMD renderings (IMF1, IMF2, and IMF3).</p>
Full article ">Figure 16
<p>Box-plot of MOEs for Wuhan dataset.</p>
Full article ">Figure 17
<p>Comparison of loss function of distributed VMD-BiLSM.</p>
Full article ">Figure 18
<p>Running time (seconds) of VMD-BiLSTM based on Spark platform.</p>
Full article ">Figure 19
<p>Scaleup comparative analysis of distributed VMD-BiLSTM for different computing nodes.</p>
Full article ">Figure 20
<p>Speedup comparative analysis of the proposed model for different computing nodes.</p>
Full article ">
15 pages, 1495 KiB  
Article
Classification of Breathing Phase and Path with In-Ear Microphones
by Malahat H. K. Mehrban, Jérémie Voix and Rachel E. Bouserhal
Sensors 2024, 24(20), 6679; https://doi.org/10.3390/s24206679 (registering DOI) - 17 Oct 2024
Abstract
In recent years, the use of smart in-ear devices (hearables) for health monitoring has gained popularity. Previous research on in-ear breath monitoring with hearables uses signal processing techniques based on peak detection. Such techniques are greatly affected by movement artifacts and other challenging [...] Read more.
In recent years, the use of smart in-ear devices (hearables) for health monitoring has gained popularity. Previous research on in-ear breath monitoring with hearables uses signal processing techniques based on peak detection. Such techniques are greatly affected by movement artifacts and other challenging real-world conditions. In this study, we use an existing database of various breathing types captured using an in-ear microphone to classify breathing path and phase. Having a small dataset, we use XGBoost, a simple and fast classifier, to address three different classification challenges. We achieve an accuracy of 86.8% for a binary path classifier, 74.1% for a binary phase classifier, and 67.2% for a four-class path and phase classifier. Our path classifier outperforms existing algorithms in recall and F1, highlighting the reliability of our approach. This work demonstrates the feasibility of the use of hearables in continuous breath monitoring tasks with machine learning. Full article
(This article belongs to the Special Issue Sensors for Breathing Monitoring)
Show Figures

Figure 1

Figure 1
<p>Illustration of the device worn by participants including an in-ear microphone (IEM), an outer-ear microphone (OEM), and a speaker (SPK).</p>
Full article ">Figure 2
<p>Respiration signal during normal nasal breathing captured simultaneously using an in-ear microphone (<b>a</b>) and the BioHarness 3.0 wearable chest belt (<b>b</b>). The mel-spectrogram of the in-ear microphone signal is presented in (<b>c</b>).</p>
Full article ">Figure 3
<p>Mel-spectrogram obtained from the data captured by hearables. (<b>a</b>–<b>d</b>) show four randomly selected participants breathing normally through their noses after exercise. As depicted in the figures, each participant had a different breathing pattern, level and pace based on their physical fitness level and morphology. For example, in (<b>b</b>), the participant was breathing relatively fast and deeply while the participant in (<b>c</b>) had normal nasal breathing which was barely audible and distinguishable.</p>
Full article ">Figure 3 Cont.
<p>Mel-spectrogram obtained from the data captured by hearables. (<b>a</b>–<b>d</b>) show four randomly selected participants breathing normally through their noses after exercise. As depicted in the figures, each participant had a different breathing pattern, level and pace based on their physical fitness level and morphology. For example, in (<b>b</b>), the participant was breathing relatively fast and deeply while the participant in (<b>c</b>) had normal nasal breathing which was barely audible and distinguishable.</p>
Full article ">Figure 4
<p>Examples of mel-spectrograms created from the IEM recordings. (<b>a</b>–<b>d</b>) illustrate breathing cycles, inhaling and exhaling, for four randomly chosen participants who were breathing deeply through their mouths. Individual differences did not significantly obscure the data; the recordings remained distinct and discernible.</p>
Full article ">Figure 4 Cont.
<p>Examples of mel-spectrograms created from the IEM recordings. (<b>a</b>–<b>d</b>) illustrate breathing cycles, inhaling and exhaling, for four randomly chosen participants who were breathing deeply through their mouths. Individual differences did not significantly obscure the data; the recordings remained distinct and discernible.</p>
Full article ">Figure 5
<p>Proposed processing pipeline illustrating the three classifiers.</p>
Full article ">Figure 6
<p>This figure depicts the mean CM values across all CV sets. (<b>a</b>) shows the results of <span class="html-italic">Nose</span>/<span class="html-italic">Mouth</span> classifier applied on <span class="html-italic">Forced</span> with the segment length of 400 ms, and (<b>b</b>) on <span class="html-italic">All</span>. The results of <span class="html-italic">Nose</span>/<span class="html-italic">Mouth</span> classifier applied on <span class="html-italic">Forced</span> and <span class="html-italic">All</span> with the segment duration of 200 ms are shown in (<b>c</b>,<b>d</b>), respectively. Finally, (<b>e</b>,<b>f</b>) represent the results of <span class="html-italic">Inhale</span>/<span class="html-italic">Exhale</span> classifier trained on <span class="html-italic">Forced</span> and <span class="html-italic">All</span> with the segment length of 200 ms, respectively.</p>
Full article ">Figure 7
<p>Mean confusion matrices showing four-class classifier results from using XGBoost and 200 ms segments. (<b>a</b>) shows the confusion matrix of <span class="html-italic">Forced</span> and (<b>b</b>) the confusion matrix of <span class="html-italic">All</span>. In both matrices, the confusing class was “Exhalation” showing that regardless of respiration path distinguishing exhalation from inhalation is complicated. Comparing (<b>a</b>,<b>b</b>), this gets worse when the algorithm is tested in <span class="html-italic">All</span>.</p>
Full article ">Figure 8
<p>Comparison of ’BreathTrack’, ’Breeze’, and the proposed <span class="html-italic">Inhale</span>/<span class="html-italic">Exhale</span> classifier. Based on the figure, our proposed algorithm exhibits a higher recall and F1-score than the two other algorithms available in the literature.</p>
Full article ">
8 pages, 494 KiB  
Proceeding Paper
CO2 Emissions Projections of the North American Cement Industry
by Ángel Francisco Galaviz Román, Seyedmehdi Mirmohammadsadeghi and Golam Kabir
Eng. Proc. 2024, 76(1), 19; https://doi.org/10.3390/engproc2024076019 - 17 Oct 2024
Abstract
Forecasting carbon dioxide (CO2) emissions has become a relevant issue. International organizations have emphasized the necessity of generating a plan to gradually reduce the concentrations of this pollutant to combat climate change. Cement industries represent one of the key sectors expected [...] Read more.
Forecasting carbon dioxide (CO2) emissions has become a relevant issue. International organizations have emphasized the necessity of generating a plan to gradually reduce the concentrations of this pollutant to combat climate change. Cement industries represent one of the key sectors expected to solve this problematic. The objective of this study is to predict CO2 emissions for North American cement industries. To achieve this, a multi-objective mathematical model is developed, integrating various machine learning algorithms. The results demonstrate a considerable improvement in accuracy metrics, with a 48.13% reduction in Mean Absolute Error achieved using the Generalized Reduced Gradient method (GRG). The forecasts reveal an increment in emissions from about 0.58 MtCO2 every year between 2020 and 2050. The proposed framework can help decision makers and policy makers focus on the technical and logistics requirements to meet net-zero emissions targets. Full article
Show Figures

Figure 1

Figure 1
<p>Machine learning fit plot of CO<sub>2</sub> emissions forecasting.</p>
Full article ">Figure 2
<p>Integrated GRG model fit plot for CO<sub>2</sub> emissions forecasting.</p>
Full article ">
18 pages, 4387 KiB  
Article
Enhanced Image-Based Malware Classification Using Transformer-Based Convolutional Neural Networks (CNNs)
by Moses Ashawa, Nsikak Owoh, Salaheddin Hosseinzadeh and Jude Osamor
Electronics 2024, 13(20), 4081; https://doi.org/10.3390/electronics13204081 - 17 Oct 2024
Abstract
As malware samples grow in complexity and employ advanced evasion techniques, traditional detection methods are insufficient for accurately classifying large volumes of sophisticated malware variants. To address this issue, image-based malware classification techniques leveraging machine learning algorithms have been developed as a more [...] Read more.
As malware samples grow in complexity and employ advanced evasion techniques, traditional detection methods are insufficient for accurately classifying large volumes of sophisticated malware variants. To address this issue, image-based malware classification techniques leveraging machine learning algorithms have been developed as a more optimal solution to this challenge. However, accurately classifying content distribution-based features with unique pixel intensities from grayscale images remains a challenge. This paper proposes an enhanced image-based malware classification system using convolutional neural networks (CNNs) using ResNet-152 and vision transformer (ViT). The two architectures are then compared to determine their classification abilities. A total of 6137 benign files and 9861 malicious executables are converted from text files to unsigned integers and then to images. The ViT examined unsigned integers as pixel values, while ResNet-152 converted the pixel values into floating points for classification. The result of the experiments demonstrates a high-performance accuracy of 99.62% with effective hyperparameters of 10-fold cross-validation. The findings indicate that the proposed model is capable of being implemented in dynamic and complex malware environments, achieving a practical computational efficiency of 47.2 s for the identification and classification of new malware samples. Full article
Show Figures

Figure 1

Figure 1
<p>Showing how the virtual machines are configured to store the executable files.</p>
Full article ">Figure 2
<p>Feature extraction and conversion process.</p>
Full article ">Figure 3
<p>State machine malware representation.</p>
Full article ">Figure 4
<p>Rendering of malware sample image. (<b>a</b>) Pictures integrated in the malware sample and (<b>b</b>) Images of malware that share similarities across various malware categories.</p>
Full article ">Figure 5
<p>The summary of the architectures of the proposed enhanced image-based malware classification model.</p>
Full article ">Figure 6
<p>Training at zero iterations.</p>
Full article ">Figure 7
<p>Image classification at different iterations. (<b>a</b>) 2000th count, (<b>b</b>) 4000th count (<b>c</b>) 6000th count, (<b>d</b>) 8000th count, (<b>e</b>) 10,000th count, (<b>f</b>) 12,000th count, (<b>g</b>) 14,000th count, (<b>h</b>) 16,000th count and (<b>i</b>) 18000th count.</p>
Full article ">Figure 7 Cont.
<p>Image classification at different iterations. (<b>a</b>) 2000th count, (<b>b</b>) 4000th count (<b>c</b>) 6000th count, (<b>d</b>) 8000th count, (<b>e</b>) 10,000th count, (<b>f</b>) 12,000th count, (<b>g</b>) 14,000th count, (<b>h</b>) 16,000th count and (<b>i</b>) 18000th count.</p>
Full article ">Figure 8
<p>Image SOM visualization. The gray color shows that there is no specific categorization of the image pixel intensity.</p>
Full article ">Figure 9
<p>Image sieve diagram visualization for the sample space showing benign and malicious classes.</p>
Full article ">Figure 10
<p>PCA dimensionality reduction. (<b>a</b>) First component, (<b>b</b>) second component, (<b>c</b>) third component, and (<b>d</b>) fourth component.</p>
Full article ">Figure 10 Cont.
<p>PCA dimensionality reduction. (<b>a</b>) First component, (<b>b</b>) second component, (<b>c</b>) third component, and (<b>d</b>) fourth component.</p>
Full article ">Figure 11
<p>Heatmap showing regions of interest as classified by the model. (<b>a</b>) Image cluster showing malware names and their textual cluster classifications. (<b>b</b>) Image cluster showing malware activities and their score clusters.</p>
Full article ">Figure 12
<p>Visualized malware images based on their malware families. (<b>a</b>) QakBot, (<b>b</b>) Gamarue, (<b>c</b>) Sodinokibi, and (<b>d</b>) Ryuk.</p>
Full article ">
20 pages, 1645 KiB  
Article
Classification of Acoustic Tones and Cardiac Murmurs Based on Digital Signal Analysis Leveraging Machine Learning Methods
by Nataliya Shakhovska and Ivan Zagorodniy
Computation 2024, 12(10), 208; https://doi.org/10.3390/computation12100208 - 17 Oct 2024
Abstract
Heart murmurs are abnormal heart sounds that can indicate various heart diseases. Although traditional auscultation methods are effective, they depend more on specialists’ knowledge, making it difficult to make an accurate diagnosis. This paper presents a machine learning-based framework for the classification of [...] Read more.
Heart murmurs are abnormal heart sounds that can indicate various heart diseases. Although traditional auscultation methods are effective, they depend more on specialists’ knowledge, making it difficult to make an accurate diagnosis. This paper presents a machine learning-based framework for the classification of acoustic sounds and heart murmurs using digital signal analysis. Using advanced machine learning algorithms, we aim to improve the accuracy, speed, and accessibility of heart murmur detection. The proposed method includes feature extraction from digital auscultatory recordings, preprocessing using signal processing techniques, and classification using state-of-the-art machine learning models. We evaluated the performance of different machine learning algorithms, such as convolutional neural networks (CNNs), random forests (RFs) and support vector machines (SVMs), on a selected heart noise dataset. The results show that our framework achieves high accuracy in differentiating normal heart sounds from different types of heart murmurs and provides a robust tool for clinical decision-making. Full article
(This article belongs to the Special Issue Artificial Intelligence Applications in Public Health)
Show Figures

Figure 1

Figure 1
<p>CNN architecture.</p>
Full article ">Figure 2
<p>Heart sound signal visualization.</p>
Full article ">Figure 3
<p>The performance of the convolutional neural network (CNN), random forest (RF), and support vector machine (SVM) models.</p>
Full article ">Figure 4
<p>Feature importance.</p>
Full article ">
22 pages, 4837 KiB  
Article
A Machine Learning Approach to Forecasting Hydropower Generation
by Sarah Di Grande, Mariaelena Berlotti, Salvatore Cavalieri and Roberto Gueli
Energies 2024, 17(20), 5163; https://doi.org/10.3390/en17205163 - 17 Oct 2024
Viewed by 99
Abstract
In light of challenges like climate change, pollution, and depletion of fossil fuel reserves, governments and businesses prioritize renewable energy sources such as solar, wind, and hydroelectric power. Renewable energy forecasting models play a crucial role for energy market operators and prosumers, aiding [...] Read more.
In light of challenges like climate change, pollution, and depletion of fossil fuel reserves, governments and businesses prioritize renewable energy sources such as solar, wind, and hydroelectric power. Renewable energy forecasting models play a crucial role for energy market operators and prosumers, aiding in planning, decision-making, optimization of energy sales, and evaluation of investments. This study aimed to develop machine learning models for hydropower forecasting in plants integrated into Water Distribution Systems, where energy is generated from water flow used for municipal water supply. The study involved developing and comparing monthly and two-week forecasting models, utilizing both one-step-ahead and two-step-ahead forecasting methodologies, along with different missing data imputation techniques. The tested algorithms—Seasonal Autoregressive Integrated Moving Average, Random Forest, Temporal Convolutional Network, and Neural Basis Expansion Analysis for Time Series—produced varying levels of performance. The Random Forest model proved to be the most effective for monthly forecasting, while the Temporal Convolutional Network delivered the best results for two-week forecasting. Across all scenarios, the seasonal–trend decomposition using the LOESS technique emerged as the most successful for missing data imputation. The accurate predictions obtained demonstrate the effectiveness of using these models for energy planning and decision-making. Full article
Show Figures

Figure 1

Figure 1
<p>Comparison between time series with outliers (in gray) and time series after outliers detection through single boxplot (in blue).</p>
Full article ">Figure 2
<p>Comparison between time series with outliers (in gray) and time series after outliers detection through multiple boxplots (in green).</p>
Full article ">Figure 3
<p>Comparison between the two time series aggregated monthly after reconstruction with the two different methods. The gray dashed line in the plot defines the starting point of the non-reconstructed data, which is used as the test set.</p>
Full article ">Figure 4
<p>Comparison between the two time series aggregated every two weeks after reconstruction with the two different methods. The gray dashed line in the plot defines the starting point of the non-reconstructed data, which is used as the test set.</p>
Full article ">Figure 5
<p>Comparison between actual and forecasted values for the four monthly models using datasets with STL reconstruction.</p>
Full article ">Figure 6
<p>SMAPE results of the four best-performing models for monthly hydropower forecasts.</p>
Full article ">Figure 7
<p>Comparison between actual and forecasted values from the four two-week models using datasets with STL reconstruction.</p>
Full article ">Figure 8
<p>SMAPE results of the four best-performing models for two-week hydropower forecasts.</p>
Full article ">
29 pages, 7541 KiB  
Article
Machine Learning-Based Water Quality Classification Assessment
by Wenliang Chen, Duo Xu, Bowen Pan, Yuan Zhao and Yan Song
Water 2024, 16(20), 2951; https://doi.org/10.3390/w16202951 - 17 Oct 2024
Viewed by 131
Abstract
Water is a vital resource, and its quality has a direct impact on human health. Groundwater, as one of the primary water sources, requires careful monitoring to ensure its safety. Although manual methods for testing water quality are accurate, they are often time-consuming, [...] Read more.
Water is a vital resource, and its quality has a direct impact on human health. Groundwater, as one of the primary water sources, requires careful monitoring to ensure its safety. Although manual methods for testing water quality are accurate, they are often time-consuming, costly, and inefficient when dealing with large and complex data sets. In recent years, machine learning has become an effective alternative for water quality assessment. However, current approaches still face challenges, such as the limited performance of individual models, minimal improvements from optimization algorithms, lack of dynamic feature weighting mechanisms, and potential information loss when simplifying model inputs. To address these challenges, this paper proposes a hybrid model, BS-MLP, which combines GBDT (gradient-boosted decision tree) and MLP (multilayer perceptron). The model leverages GBDT’s strength in feature selection and MLP’s capability to manage nonlinear relationships, enabling it to capture complex interactions between water quality parameters. We employ Bayesian optimization to fine-tune the model’s parameters and introduce a feature-weighting attention mechanism to develop the BS-FAMLP model, which dynamically adjusts feature weights, enhancing generalization and classification accuracy. In addition, a comprehensive parameter selection strategy is employed to maintain data integrity. These innovations significantly improve the model’s classification performance and efficiency in handling complex water quality environments and imbalanced datasets. This model was evaluated using a publicly available groundwater quality dataset consisting of 188,623 samples, each with 15 water quality parameters and corresponding labels. The BS-FAMLP model shows strong classification performance, with optimized hyperparameters and an adjusted feature-weighting attention mechanism. Specifically, it achieved an accuracy of 0.9616, precision of 0.9524, recall of 0.9655, F1 Score of 0.9589, and an AUC score of 0.9834 on the test set. Compared to single models, classification accuracy improved by approximately 10%, and when compared to other hybrid models with additional attention mechanisms, BS-FAMLP achieved an optimal balance between classification performance and computational efficiency. The core objective of this study is to utilize the acquired water quality parameter data for efficient classification and assessment of water samples, with the aim of streamlining traditional laboratory-based water quality analysis processes. By developing a reliable water quality classification model, this research provides robust technical support for water safety management. Full article
(This article belongs to the Section Water Quality and Contamination)
Show Figures

Figure 1

Figure 1
<p>Timeline of research work [<a href="#B12-water-16-02951" class="html-bibr">12</a>,<a href="#B13-water-16-02951" class="html-bibr">13</a>,<a href="#B14-water-16-02951" class="html-bibr">14</a>,<a href="#B15-water-16-02951" class="html-bibr">15</a>,<a href="#B16-water-16-02951" class="html-bibr">16</a>,<a href="#B17-water-16-02951" class="html-bibr">17</a>,<a href="#B18-water-16-02951" class="html-bibr">18</a>].</p>
Full article ">Figure 2
<p>The methodological process followed in this article.</p>
Full article ">Figure 3
<p>Structure of the BS-FAMLP model.</p>
Full article ">Figure 4
<p>Use a box plot to detect outliers in the water quality parameters of a dataset.</p>
Full article ">Figure 5
<p>RMSE, MAE, and R<sup>2</sup> after filling with different values of K.</p>
Full article ">Figure 6
<p>Histogram of the data distribution to observe the distribution of water quality parameters in the dataset.</p>
Full article ">Figure 7
<p>Heatmap of the correlation between water quality features and labels.</p>
Full article ">Figure 8
<p>ROC curve of a single model on the test set.</p>
Full article ">Figure 9
<p>Optimization process of GBDT model parameters in the Bagging layer.</p>
Full article ">Figure 10
<p>Optimization process of GBDT model parameters in the Stacking layer.</p>
Full article ">Figure 11
<p>Loss function curve of each hybrid model on the test set.</p>
Full article ">Figure 12
<p>ROC curve of each hybrid model on the test set.</p>
Full article ">
20 pages, 962 KiB  
Article
Fast Global and Local Semi-Supervised Learning via Matrix Factorization
by Yuanhua Du, Wenjun Luo, Zezhong Wu and Nan Zhou
Mathematics 2024, 12(20), 3242; https://doi.org/10.3390/math12203242 - 16 Oct 2024
Viewed by 296
Abstract
Matrix factorization has demonstrated outstanding performance in machine learning. Recently, graph-based matrix factorization has gained widespread attention. However, graph-based methods are only suitable for handling small amounts of data. This paper proposes a fast semi-supervised learning method using only matrix factorization, which considers [...] Read more.
Matrix factorization has demonstrated outstanding performance in machine learning. Recently, graph-based matrix factorization has gained widespread attention. However, graph-based methods are only suitable for handling small amounts of data. This paper proposes a fast semi-supervised learning method using only matrix factorization, which considers both global and local information. By introducing bipartite graphs into symmetric matrix factorization, the technique can handle large datasets effectively. It is worth noting that by utilizing tag information, the proposed symmetric matrix factorization becomes convex and unconstrained, i.e., the non-convex problem minx(1x2)2 is transformed into a convex problem. This allows it to be optimized quickly using state-of-the-art unconstrained optimization algorithms. The computational complexity of the proposed method is O(nmd), which is much lower than that of the original symmetric matrix factorization, which is O(n2d), and even lower than that of other anchor-based methods, which is O(nmd+m2n+m3), where n represents the number of samples, d represents the number of features, and mn represents the number of anchors. The experimental results on multiple public datasets indicate that the proposed method achieves higher performance in less time. Full article
Show Figures

Figure 1

Figure 1
<p>Links between samples and anchors, where <math display="inline"><semantics> <msub> <mi>x</mi> <mn>1</mn> </msub> </semantics></math>–<math display="inline"><semantics> <msub> <mi>x</mi> <mn>7</mn> </msub> </semantics></math> represent the seven samples and <math display="inline"><semantics> <msub> <mi>u</mi> <mn>1</mn> </msub> </semantics></math>–<math display="inline"><semantics> <msub> <mi>u</mi> <mn>4</mn> </msub> </semantics></math> represent the four anchors.</p>
Full article ">Figure 2
<p>Clustering performance with different labeled samples on (<b>a</b>) COIL20 dataset, (<b>b</b>) YaleB dataset, (<b>c</b>) COIL100 dataset, (<b>d</b>) USPS dataset, (<b>e</b>) MNIST dataset, and (<b>f</b>) Letters dataset.</p>
Full article ">Figure 3
<p>Sensitivity of FGLMF on (<b>a</b>) COIL20 dataset, (<b>b</b>) YaleB dataset, (<b>c</b>) COIL100 dataset, (<b>d</b>) USPS dataset, (<b>e</b>) MNIST dataset, and (<b>f</b>) Letters dataset.</p>
Full article ">Figure 3 Cont.
<p>Sensitivity of FGLMF on (<b>a</b>) COIL20 dataset, (<b>b</b>) YaleB dataset, (<b>c</b>) COIL100 dataset, (<b>d</b>) USPS dataset, (<b>e</b>) MNIST dataset, and (<b>f</b>) Letters dataset.</p>
Full article ">Figure 4
<p>Accuracy vs. time of FGLMF with different anchors on (<b>a</b>) COIL20 dataset, (<b>b</b>) YaleB dataset, (<b>c</b>) COIL100 dataset, (<b>d</b>) USPS dataset, (<b>e</b>) MNIST dataset, and (<b>f</b>) Letters dataset.</p>
Full article ">Figure 4 Cont.
<p>Accuracy vs. time of FGLMF with different anchors on (<b>a</b>) COIL20 dataset, (<b>b</b>) YaleB dataset, (<b>c</b>) COIL100 dataset, (<b>d</b>) USPS dataset, (<b>e</b>) MNIST dataset, and (<b>f</b>) Letters dataset.</p>
Full article ">Figure 5
<p>Bases generated by FGLMF. The first row is COIL20, the second is YaleB, the third is COIL100, the fourth is USPS, the fifth is MNIST, and the sixth is Letters.</p>
Full article ">Figure 6
<p>Adjacency matrix on COIL20 dataset: (<b>a</b>) bipartite graph generated by Equation (3); (<b>b</b>) full adjacency matrix generated by bipartite using Equation (6); (<b>c</b>) normalized full adjacency matrix generated by Gaussian kernel.</p>
Full article ">Figure 7
<p>Convergence curve on (<b>a</b>) COIL20 dataset, (<b>b</b>) USPS dataset, (<b>c</b>) COIL100 dataset, (<b>d</b>) USPS dataset, (<b>e</b>) MNIST dataset, and (<b>f</b>) Letters dataset.</p>
Full article ">
21 pages, 6930 KiB  
Article
Fault Diagnosis Method for Hydropower Units Based on Dynamic Mode Decomposition and the Hiking Optimization Algorithm–Extreme Learning Machine
by Dan Lin, Yan Wang, Hua Xin, Xiaoyan Li, Shaofei Xu, Wei Zhou and Hui Li
Energies 2024, 17(20), 5159; https://doi.org/10.3390/en17205159 (registering DOI) - 16 Oct 2024
Viewed by 275
Abstract
The diagnosis of vibration faults in hydropower units is essential for ensuring the safe and stable operation of these systems. This paper proposes a fault diagnosis method for hydropower units that combines Dynamic Mode Decomposition (DMD) with an optimized Extreme Learning Machine (ELM) [...] Read more.
The diagnosis of vibration faults in hydropower units is essential for ensuring the safe and stable operation of these systems. This paper proposes a fault diagnosis method for hydropower units that combines Dynamic Mode Decomposition (DMD) with an optimized Extreme Learning Machine (ELM) utilizing the Hiking Optimization Algorithm (HOA). To address the issue of noise interference in the vibration signals of hydropower units, this study employs DMD technology alongside a thresholding technique for noise reduction, demonstrating its effectiveness through comparative trials. Furthermore, to facilitate a thorough analysis of the operational status of hydropower units, this paper extracts multidimensional features from denoised signals. To improve the efficiency of model training, Principal Component Analysis (PCA) is applied to streamline the data. Given that the weights and biases of the ELM are generated randomly, which may impact the model’s stability and generalization capabilities, the HOA is introduced for optimization. The HOA-ELM model achieved a classification accuracy of 95.83%. A comparative analysis with alternative models substantiates the superior performance of the HOA-ELM model in the fault diagnosis of hydropower units. Full article
(This article belongs to the Section F3: Power Electronics)
Show Figures

Figure 1

Figure 1
<p>Multidimensional feature vector.</p>
Full article ">Figure 2
<p>Cumulative contribution of the major components.</p>
Full article ">Figure 3
<p>ELM structural schematic.</p>
Full article ">Figure 4
<p>HOA-ELM model flowchart.</p>
Full article ">Figure 5
<p>Fault diagnosis flowchart based on DMD and HOA-ELM.</p>
Full article ">Figure 6
<p>Sensor mounting positions: (<b>a</b>) upper guide swing in the x-direction; (<b>b</b>) lower guide swing in the x-direction; (<b>c</b>) upper machine frame vibration in the x-direction; and (<b>d</b>) lower machine frame vibration in the x-direction.</p>
Full article ">Figure 6 Cont.
<p>Sensor mounting positions: (<b>a</b>) upper guide swing in the x-direction; (<b>b</b>) lower guide swing in the x-direction; (<b>c</b>) upper machine frame vibration in the x-direction; and (<b>d</b>) lower machine frame vibration in the x-direction.</p>
Full article ">Figure 7
<p>Normal conditions. (<b>a</b>)Time-domain graph. (<b>b</b>) Frequency-domain graph.</p>
Full article ">Figure 8
<p>Stator–rotor rub. (<b>a</b>) Time-domain graph. (<b>b</b>) Frequency-domain graph.</p>
Full article ">Figure 9
<p>Thrust head looseness. (<b>a</b>) Time-domain graph. (<b>b</b>) Frequency-domain graph.</p>
Full article ">Figure 10
<p>Rotor misalignment. (<b>a</b>) Time-domain graph. (<b>b</b>) Frequency-domain graph.</p>
Full article ">Figure 11
<p>DMD results. (<b>a</b>) Time-domain graph. (<b>b</b>) Frequency-domain graph.</p>
Full article ">Figure 12
<p>DMD denoising results. (<b>a</b>) Time-domain graph. (<b>b</b>) Frequency-domain graph.</p>
Full article ">Figure 13
<p>Comparison chart of denoising methods. (<b>a</b>) EMD; (<b>b</b>) EEMD; and (<b>c</b>)EWT.</p>
Full article ">Figure 14
<p>Convergence curve of the fitness function.</p>
Full article ">Figure 15
<p>Training set diagnostic results. (<b>a</b>) Diagnosis accuracy. (<b>b</b>) Confusion matrix.</p>
Full article ">Figure 16
<p>Outcomes from various classification models. (<b>a</b>) HOA-ELM; (<b>b</b>) HOA-BP; (<b>c</b>) HOA-SVM; (<b>d</b>) SSA-ELM; (<b>e</b>) GWO-ELM; and (<b>f</b>)ELM.</p>
Full article ">Figure 16 Cont.
<p>Outcomes from various classification models. (<b>a</b>) HOA-ELM; (<b>b</b>) HOA-BP; (<b>c</b>) HOA-SVM; (<b>d</b>) SSA-ELM; (<b>e</b>) GWO-ELM; and (<b>f</b>)ELM.</p>
Full article ">Figure 17
<p>Diagnostic model accuracy.</p>
Full article ">
16 pages, 1049 KiB  
Article
Border Gateway Protocol Route Leak Detection Technique Based on Graph Features and Machine Learning
by Chen Shen, Ruixin Wang, Xiang Li, Peiying Zhang, Kai Liu and Lizhuang Tan
Electronics 2024, 13(20), 4072; https://doi.org/10.3390/electronics13204072 - 16 Oct 2024
Viewed by 233
Abstract
In the Internet, ASs are interconnected using BGP. However, due to a lack of security considerations in the design of BGP, a series of security issues arise during the propagation of routing information, such as prefix hijacking, route leakage, and AS path tampering. [...] Read more.
In the Internet, ASs are interconnected using BGP. However, due to a lack of security considerations in the design of BGP, a series of security issues arise during the propagation of routing information, such as prefix hijacking, route leakage, and AS path tampering. Therefore, this paper conducts research on the detection of route leakage. By analyzing BGP routing information, we abstract the routing propagation relationship between ASs into a network topology graph, and extract graph features from the graph abstracted from routing data at certain time intervals. Based on the structural robustness features and centrality measurement features of the graph, we determine whether a route leakage has occurred during the current time period. To this end, we use machine learning methods and propose a weighted voting model. This model trains multiple single models and assigns weights to them, and through the weighted analysis of the results of multiple models, it can determine whether a route leakage has occurred. In addition, to determine the corresponding weights, we use genetic algorithms for identifying route leaks. The experimental results show that the method used in this paper has a high accuracy rate, and compared with a single model, it performs better on multiple datasets. Full article
Show Figures

Figure 1

Figure 1
<p>Data collection and processing process.</p>
Full article ">Figure 2
<p>Schematic diagram of weighted voting model.</p>
Full article ">Figure 3
<p>Genetic algorithms.</p>
Full article ">Figure 4
<p>TTNet feature line chart 1.</p>
Full article ">Figure 5
<p>TTNet feature line chart 2.</p>
Full article ">Figure 6
<p>TTNet feature line chart 3.</p>
Full article ">Figure 7
<p>TTNet feature line chart 4.</p>
Full article ">Figure 8
<p>Simulation experiment results.</p>
Full article ">
15 pages, 3155 KiB  
Article
Machine Learning Ensemble Methodologies for the Prediction of the Failure Mode of Reinforced Concrete Beam–Column Joints
by Martha Karabini, Ioannis Karampinis, Theodoros Rousakis, Lazaros Iliadis and Athanasios Karabinis
Information 2024, 15(10), 647; https://doi.org/10.3390/info15100647 - 16 Oct 2024
Viewed by 181
Abstract
One of the most critical aspects in the seismic behavior or reinforced concrete (RC) structures pertains to beam–column joints. Modern seismic design codes dictate that, if failure is to occur, then this should be the ductile yielding of the beam and not brittle [...] Read more.
One of the most critical aspects in the seismic behavior or reinforced concrete (RC) structures pertains to beam–column joints. Modern seismic design codes dictate that, if failure is to occur, then this should be the ductile yielding of the beam and not brittle shear failure of the joint, which can lead to sudden collapse and loss of human lives. To this end, it is imperative to be able to predict the failure mode of RC joints for a large number of structures in a building stock. In this research effort, various ensemble machine learning algorithms were employed to develop novel, robust classification models. A dataset comprising 486 measurements from real experiments was utilized. The performance of the employed classifiers was assessed using Precision, Recall, F1-Score, and overall Accuracy indices. N-fold cross-validation was employed to enhance generalization. Moreover, the obtained models were compared to the available engineering ones currently adopted by many international organizations and researchers. The novel ensemble models introduced in this research were proven to perform much better by improving the obtained accuracy by 12–18%. The obtained metrics also presented small variability among the examined failure modes, indicating unbiased models. Overall, the results indicate that the proposed methodologies can be confidently employed for the prediction of the failure mode of RC joints. Full article
(This article belongs to the Special Issue Real-World Applications of Machine Learning Techniques)
Show Figures

Figure 1

Figure 1
<p>Distributions and descriptive statistics of the features and target variable. In the above, <math display="inline"><semantics> <mi>μ</mi> </semantics></math>, <math display="inline"><semantics> <mi>σ</mi> </semantics></math>, and <math display="inline"><semantics> <msub> <mi>q</mi> <mn>50</mn> </msub> </semantics></math> correspond to the mean, standard deviation, and median, respectively.</p>
Full article ">Figure 2
<p>Pairplot of the independent variables in the dataset. In each subplot, <math display="inline"><semantics> <mi>ρ</mi> </semantics></math> corresponds to the Pearson correlation coefficient defined in Equation (<a href="#FD1-information-15-00647" class="html-disp-formula">1</a>).</p>
Full article ">Figure 3
<p>Illustration of ensemble methodologies. (<b>a</b>) Bagging, (<b>b</b>) boosting, and (<b>c</b>) stacking [<a href="#B43-information-15-00647" class="html-bibr">43</a>]. In the above, <math display="inline"><semantics> <mrow> <msubsup> <mi>X</mi> <mi>t</mi> <mo>*</mo> </msubsup> <mo>,</mo> <mi>t</mi> <mo>=</mo> <mn>1</mn> <mo>,</mo> <mn>2</mn> <mo>,</mo> <mo>.</mo> <mo>.</mo> <mo>.</mo> <mo>,</mo> <mi>T</mi> </mrow> </semantics></math>, are the transformed training datasets examined in the previous paragraphs.</p>
Full article ">Figure 4
<p>Comparison of the cross-validated performance metrics of the ensembles.</p>
Full article ">Figure 5
<p>ROC curve for the best-performing classifier, XGBoost.</p>
Full article ">Figure 6
<p>Indicative moment–rotation (<math display="inline"><semantics> <mrow> <mi>M</mi> <mo>−</mo> <mi>ϕ</mi> </mrow> </semantics></math>) curve for an idealized elastic–plastic beam with hardening. The area under the curve corresponds to the amount of seismic energy that the structural system absorbs.</p>
Full article ">
Back to TopTop