MDPI - Publisher of Open Access Journals

26 pages, 3237 KiB

Open AccessArticle

QoS-Aware Power-Optimized Path Selection for Data Center Networks (Q-PoPS)

by Mohammed Nsaif, Gergely Kovásznai, Ali Malik and Ruairí de Fréin

Electronics 2024, 13(15), 2976; https://doi.org/10.3390/electronics13152976 - 28 Jul 2024

Viewed by 656

Data centers consume significant amounts of energy, contributing indirectly to environmental pollution through greenhouse gas emissions during electricity generation. According to the Natural Resources Defense Council, information and communication technologies and networks account for roughly 10% of global energy consumption. Reducing power consumption [...] Read more.

Data centers consume significant amounts of energy, contributing indirectly to environmental pollution through greenhouse gas emissions during electricity generation. According to the Natural Resources Defense Council, information and communication technologies and networks account for roughly 10% of global energy consumption. Reducing power consumption in Data Center Networks (DCNs) is crucial, especially given that many data center components operate at full capacity even under low traffic conditions, resulting in high costs for both service providers and consumers. Current solutions often prioritize power optimization without considering Quality of Service (QoS). Services such as video streaming and Voice over IP (VoIP) are particularly sensitive to loss or delay and require QoS to be maintained below certain thresholds. This paper introduces a novel framework called QoS-Aware Power-Optimized Path Selection (Q-PoPS) for software-defined DCNs. The objective of Q-PoPS is to minimize DCN power consumption while ensuring that an acceptable QoS is provided, meeting the requirements of DCN services. This paper describes the implementation of a prototype for the Q-PoPS framework that leverages the POX Software-Defined Networking (SDN) controller. The performance of the prototype is evaluated using the Mininet emulator. Our findings demonstrate the performance of the proposed Q-PoPS algorithm in three scenarios. Best-case: Enhancing real-time traffic protocol quality without increasing power consumption. midrange-case: Replacing bottleneck links while preserving real-time traffic quality. Worst-case: Identifying new paths that may increase power consumption but maintain real-time traffic quality. This paper underscores the need for a holistic approach to DCN management, optimizing both power consumption and QoS for critical real-time applications. We present the Q-PoPS framework as evidence that such an approach is achievable. Full article

(This article belongs to the Section Networks)

► Show Figures

Figure 1

37 pages, 18482 KiB

Open AccessArticle

Active Queue Management in L4S with Asynchronous Advantage Actor-Critic: A FreeBSD Networking Stack Perspective

by Deol Satish, Jonathan Kua and Shiva Raj Pokhrel

Future Internet 2024, 16(8), 265; https://doi.org/10.3390/fi16080265 - 25 Jul 2024

Cited by 1 | Viewed by 759

Abstract

Bufferbloat is one of the leading causes of high data transmission latency and jitter on the Internet, which severely impacts the performance of low-latency interactive applications such as online streaming, cloud-based gaming/applications, Internet of Things (IoT) applications, voice over IP (VoIP), real-time video [...] Read more.

Bufferbloat is one of the leading causes of high data transmission latency and jitter on the Internet, which severely impacts the performance of low-latency interactive applications such as online streaming, cloud-based gaming/applications, Internet of Things (IoT) applications, voice over IP (VoIP), real-time video conferencing, and so forth. There is currently a pressing need for developing Transmission Control Protocol (TCP) congestion control algorithms and bottleneck queue management schemes that can collaboratively control/reduce end-to-end latency, thus ensuring optimal quality of service (QoS) and quality of experience (QoE) for users. This paper introduces a novel solution by experimentally integrate the low latency, low loss, and scalable throughput (L4S) architecture (specified by the IETF in RFC 9330) in FreeBSD framework with the asynchronous advantage actor-critic (A3C) reinforcement learning algorithm. The first phase involves incorporating a modified dual-queue coupled active queue management (AQM) system for L4S into the FreeBSD networking stack, enhancing queue management and mitigating latency and packet loss. The second phase employs A3C to adjust and fine-tune the system performance dynamically. Finally, we evaluate the proposed solution’s effectiveness through comprehensive experiments, comparing it with traditional AQM-based systems. This paper contributes to the advancement of machine learning (ML) for transport protocol research in the field. The experimental implementation and results presented in this paper are made available through our GitHub repositories. Full article

(This article belongs to the Special Issue Scalable and Distributed Cloud Continuum Orchestration for Next-Generation IoT Applications: Latest Advances and Prospects)

► Show Figures

Figure 1

32 pages, 31472 KiB

Open AccessArticle

Studying the Impact of Different TCP DoS Attacks on the Parameters of VoIP Streams

by Ivan Nedyalkov

Telecom 2024, 5(3), 556-587; https://doi.org/10.3390/telecom5030029 - 8 Jul 2024

Viewed by 710

Abstract

In today’s digital world, no one and nothing is safe from potential cyberattacks. There is also no 100% protection from such attacks. Therefore, it is advisable to carry out various studies related to the effects of the different cyberattacks on the performance of [...] Read more.

In today’s digital world, no one and nothing is safe from potential cyberattacks. There is also no 100% protection from such attacks. Therefore, it is advisable to carry out various studies related to the effects of the different cyberattacks on the performance of the specific devices under attack. In this work, a study was carried out to determine how individual TCP DoS attacks affect the parameters of VoIP (Voice over IP) voice and video streams. For the purpose of this work, a model of a simple IP network has been created using the GNS3 IP network-modeling platform. The VoIP platform used was Asterisk Free PBX. Tools from Kali Linux were used to implement the individual TCP DoS attacks; IP-network-monitoring tools and round-trip-delay-measurement tools were also used. The proposed study is applicable to multiple VoIP platforms wherein voice and video traffic are passed/processed by the VoIP server. From the obtained results, it was found that Asterisk Free PBX is very well secured against TCP DoS attacks, which do not affect the platform performance or the parameters of the voice and video streams. The values of the observed parameters, such as jitter, packet loss, round-trip delay, etc., are very far from the maximum allowable values. We also observed a low load on the CPU and RAM of the system during the whole study. Full article

► Show Figures

Figure 1

24 pages, 943 KiB

Open AccessArticle

Navigating Legal and Regulatory Frameworks to Achieve the Resilience and Sustainability of Indigenous Socioecological Systems

by Stephen Chitengi Sakapaji, Jorge García Molinos, Varvara Parilova, Tuyara Gavrilyeva and Natalia Yakovleva

Resources 2024, 13(4), 56; https://doi.org/10.3390/resources13040056 - 8 Apr 2024

Cited by 1 | Viewed by 1869

Abstract

The sustainability of Indigenous Socioecological Systems (ISES) largely depends on well-crafted policy regulations. In particular, Indigenous traditional food systems (ITFS) are an essential component of ISES that provide a variety of culturally accepted, healthy foods while also playing an important role in cultural, [...] Read more.

The sustainability of Indigenous Socioecological Systems (ISES) largely depends on well-crafted policy regulations. In particular, Indigenous traditional food systems (ITFS) are an essential component of ISES that provide a variety of culturally accepted, healthy foods while also playing an important role in cultural, spiritual, and economic value to the Indigenous people (IP). Thus, sustainably managing these traditional natural resources must be a priority. As custodians of much of the world’s ecological system, IP have, for generations, exhibited sustainable lifestyles in governing these systems. However, Indigenous perspectives and voices have not been properly reflected in the ISES sustainability discourse, and few comparative case studies have addressed this issue. This study contributes to fill this research gap using a desktop research method based on the Political Ecological Theoretical Framework (PETF) to examine how existing regulatory policies may affect the resilience and sustainability of ISES-ITFS, especially in relation to growing environmental and climatic pressures. Two Indigenous communities, the Karen in Thailand and different Indigenous groups in the Republic of Sakha (Yakutia) in Russia, are examined as case studies. Our study provides crucial insight that should help the development of robust policy interventions that integrate Indigenous concerns into policies and regulations, emphasizing self-determination, cultural preservation, and land rights. The findings emphasize the necessity for comprehensive legal frameworks prioritizing Indigenous involvement and concerns in climate and sustainability policy implementations. The ultimate goal is to foster meaningful dialogues between policymakers and IP in navigating the climate and sustainability challenges of our time. Full article

► Show Figures

Figure 1

Figure 1
Location of our case studies in (a) the Sakha Republic and (b) Thailand. The maps provide approximate distributions of (a) the Karen People in Thailand and (b) the main Indigenous minority Peoples of the North, Siberia, and the Far East in the Sakha Republic. Panel (a) also depicts the location of the Thung Yai Naresuan Wildlife Sanctuary where the Sanephong and Koh Sadueng Karen communities discussed in the text are located. White areas of the Sakha Republic in (b) are ethnically dominated by the Yakuts (Sakha People), a large Turkish ethnic group [<a href="#B33-resources-13-00056" class="html-bibr">33</a>]. Full article ">Figure 2
Conceptual diagram showing the Political Ecological Theoretical Framework model in the context of our research paper. Social, political, economic, and environmental factors defining the dimensions of the interaction between IP and other actors (national and regional regulatory bodies, industries, research institutions…) shape the formation and implementation of legal and regulatory frameworks that can impact (positively or negatively) the resilience and sustainability of the ISES and ITFS. In the context of this study, over and above the direct effects of these dimensions on ISES-ITFS (grey arrows), we focus on the legal and regulatory framework as an instrument of power channelizing and articulating the effects of the different dimensions on the ISES-ITFS (blue arrows). The double head of the grey arrows symbolizes the possibility for IP to exert power on the legal and regulatory system through their actions and agency (e.g., litigation, political representation, public awareness) on all or some of these dimensions. Full article ">

23 pages, 6802 KiB

Open AccessArticle

Non-Face-to-Face P2P (Peer-to-Peer) Real-Time Token Payment Blockchain System

by Hyug-Jun Ko, Seong-Soo Han and Chang-Sung Jeong

Appl. Sci. 2023, 13(13), 7364; https://doi.org/10.3390/app13137364 - 21 Jun 2023

Cited by 1 | Viewed by 1838

Abstract

With the increase in intelligent voice phishing and the increasing reliance on open banking systems, there has been a rise in cases where individuals’ personal information has been exposed, resulting in significant financial losses for the victims. Non-face-to-face transactions in the financial sector [...] Read more.

With the increase in intelligent voice phishing and the increasing reliance on open banking systems, there has been a rise in cases where individuals’ personal information has been exposed, resulting in significant financial losses for the victims. Non-face-to-face transactions in the financial sector face challenges such as customer identification, ensuring transaction integrity and preventing transaction rejection. Blockchain-based distributed ledgers have been proposed as a solution but their adoption is limited due to the difficulty of managing private keys and the burden of gas fees management. This paper proposes a non-face-to-face P2P real-time token payment system that minimizes the risk of key loss by storing private keys in a keystore file and database through a server-based key management module. The proposed system simplifies token creation and management through a server-based token management module and implements an automatic gas-charging function for smooth token transactions. Transaction integrity and non-repudiation are ensured through a transaction confirmation module that uses transaction IDs without exposing personal information. Furthermore, advanced security measures such as blocking foreign IP access and DDoS defense are implemented to securely protect user data. The proposed system aims to provide a convenient, secure and accessible online payment solution to the public by implementing a self-authentication function using a web application that is not limited to smartphones or application platforms. Full article

(This article belongs to the Special Issue Blockchain and Intelligent Networking for Smart Applications)

► Show Figures

Figure 1

14 pages, 1848 KiB

Open AccessArticle

NISQE: Non-Intrusive Speech Quality Evaluator Based on Natural Statistics of Mean Subtracted Contrast Normalized Coefficients of Spectrogram

by Shakeel Zafar, Imran Fareed Nizami, Mobeen Ur Rehman, Muhammad Majid and Jihyoung Ryu

Sensors 2023, 23(12), 5652; https://doi.org/10.3390/s23125652 - 16 Jun 2023

Viewed by 1258

Abstract

With the evolution in technology, communication based on the voice has gained importance in applications such as online conferencing, online meetings, voice-over internet protocol (VoIP), etc. Limiting factors such as environmental noise, encoding and decoding of the speech signal, and limitations of technology [...] Read more.

With the evolution in technology, communication based on the voice has gained importance in applications such as online conferencing, online meetings, voice-over internet protocol (VoIP), etc. Limiting factors such as environmental noise, encoding and decoding of the speech signal, and limitations of technology may degrade the quality of the speech signal. Therefore, there is a requirement for continuous quality assessment of the speech signal. Speech quality assessment (SQA) enables the system to automatically tune network parameters to improve speech quality. Furthermore, there are many speech transmitters and receivers that are used for voice processing including mobile devices and high-performance computers that can benefit from SQA. SQA plays a significant role in the evaluation of speech-processing systems. Non-intrusive speech quality assessment (NI-SQA) is a challenging task due to the unavailability of pristine speech signals in real-world scenarios. The success of NI-SQA techniques highly relies on the features used to assess speech quality. Various NI-SQA methods are available that extract features from speech signals in different domains, but they do not take into account the natural structure of the speech signals for assessment of speech quality. This work proposes a method for NI-SQA based on the natural structure of the speech signals that are approximated using the natural spectrogram statistical (NSS) properties derived from the speech signal spectrogram. The pristine version of the speech signal follows a structured natural pattern that is disrupted when distortion is introduced in the speech signal. The deviation of NSS properties between the pristine and distorted speech signals is utilized to predict speech quality. The proposed methodology shows better performance in comparison to state-of-the-art NI-SQA methods on the Centre for Speech Technology Voice Cloning Toolkit corpus (VCTK-Corpus) with a Spearman’s rank-ordered correlation constant (SRC) of 0.902, Pearson correlation constant (PCC) of 0.960, and root mean squared error (RMSE) of 0.206. Conversely, on the NOIZEUS-960 database, the proposed methodology shows an SRC of 0.958, PCC of 0.960, and RMSE of 0.114. Full article

(This article belongs to the Section Intelligent Sensors)

► Show Figures

Figure 1

16 pages, 1658 KiB

Open AccessArticle

Detecting SPIT Attacks in VoIP Networks Using Convolutional Autoencoders: A Deep Learning Approach

by Waleed Nazih, Khaled Alnowaiser, Esraa Eldesouky and Osama Youssef Atallah

Appl. Sci. 2023, 13(12), 6974; https://doi.org/10.3390/app13126974 - 9 Jun 2023

Viewed by 1874

Abstract

Voice over Internet Protocol (VoIP) is a technology that enables voice communication to be transmitted over the Internet, transforming communication in both personal and business contexts by offering several benefits such as cost savings and integration with other communication systems. However, VoIP attacks [...] Read more.

Voice over Internet Protocol (VoIP) is a technology that enables voice communication to be transmitted over the Internet, transforming communication in both personal and business contexts by offering several benefits such as cost savings and integration with other communication systems. However, VoIP attacks are a growing concern for organizations that rely on this technology for communication. Spam over Internet Telephony (SPIT) is a type of VoIP attack that involves unwanted calls or messages, which can be both annoying and pose security risks to users. Detecting SPIT can be challenging since it is often delivered from anonymous VoIP accounts or spoofed phone numbers. This paper suggests an anomaly detection model that utilizes a deep convolutional autoencoder to identify SPIT attacks. The model is trained on a dataset of normal traffic and then encodes new traffic into a lower-dimensional latent representation. If the network traffic varies significantly from the encoded normal traffic, the model flags it as anomalous. Additionally, the model was tested on two datasets and achieved F1 scores of 99.32% and 99.56%. Furthermore, the proposed model was compared to several traditional anomaly detection approaches and it outperformed them on both datasets. Full article

► Show Figures

Figure 1

25 pages, 10128 KiB

Open AccessArticle

Integration of Virtual Reality in the Control System of an Innovative Medical Robot for Single-Incision Laparoscopic Surgery

by Florin Covaciu, Nicolae Crisan, Calin Vaida, Iulia Andras, Alexandru Pusca, Bogdan Gherman, Corina Radu, Paul Tucan, Nadim Al Hajjar and Doina Pisla

Sensors 2023, 23(12), 5400; https://doi.org/10.3390/s23125400 - 7 Jun 2023

Cited by 6 | Viewed by 2231

Abstract

In recent years, there has been an expansion in the development of simulators that use virtual reality (VR) as a learning tool. In surgery where robots are used, VR serves as a revolutionary technology to help medical doctors train in using these robotic [...] Read more.

In recent years, there has been an expansion in the development of simulators that use virtual reality (VR) as a learning tool. In surgery where robots are used, VR serves as a revolutionary technology to help medical doctors train in using these robotic systems and accumulate knowledge without risk. This article presents a study in which VR is used to create a simulator designed for robotically assisted single-uniport surgery. The control of the surgical robotic system is achieved using voice commands for laparoscopic camera positioning and via a user interface developed using the Visual Studio program that connects a wristband equipped with sensors attached to the user’s hand for the manipulation of the active instruments. The software consists of the user interface and the VR application via the TCP/IP communication protocol. To study the evolution of the performance of this virtual system, 15 people were involved in the experimental evaluation of the VR simulator built for the robotic surgical system, having to complete a medically relevant task. The experimental data validated the initial solution, which will be further developed. Full article

(This article belongs to the Topic Simulations and Applications of Augmented and Virtual Reality)

► Show Figures

Figure 1

11 pages, 2779 KiB

Open AccessArticle

Enhanced Multiple Speakers’ Separation and Identification for VOIP Applications Using Deep Learning

by Amira A. Mohamed, Amira Eltokhy and Abdelhalim A. Zekry

Appl. Sci. 2023, 13(7), 4261; https://doi.org/10.3390/app13074261 - 28 Mar 2023

Cited by 1 | Viewed by 1951

Abstract

Institutions have been adopting work/study-from-home programs since the pandemic began. They primarily utilise Voice over Internet Protocol (VoIP) software to perform online meetings. This research introduces a new method to enhance VoIP calls experience using deep learning. In this paper, integration between two [...] Read more.

Institutions have been adopting work/study-from-home programs since the pandemic began. They primarily utilise Voice over Internet Protocol (VoIP) software to perform online meetings. This research introduces a new method to enhance VoIP calls experience using deep learning. In this paper, integration between two existing techniques, Speaker Separation and Speaker Identification (SSI), is performed using deep learning methods with effective results as introduced by state-of-the-art research. This integration is applied to VoIP system application. The voice signal is introduced to the speaker separation and identification system to be separated; then, the “main speaker voice” is identified and verified rather than any other human or non-human voices around the main speaker. Then, only this main speaker voice is sent over IP to continue the call process. Currently, the online call system depends on noise cancellation and call quality enhancement. However, this does not address multiple human voices over the call. Filters used in the call process only remove the noise and the interference (de-noising speech) from the speech signal. The presented system is tested with up to four mixed human voices. This system separates only the main speaker voice and processes it prior to the transmission over VoIP call. This paper illustrates the algorithm technologies integration using DNN, and voice signal processing advantages and challenges, in addition to the importance of computing power for real-time applications. Full article

(This article belongs to the Special Issue Audio and Acoustic Signal Processing)

► Show Figures

Figure 1

34 pages, 8701 KiB

Open AccessArticle

Towards a Smart Environment: Optimization of WLAN Technologies to Enable Concurrent Smart Services

by Ali Mohd Ali, Mohammad R. Hassan, Ahmad al-Qerem, Ala Hamarsheh, Khalid Al-Qawasmi, Mohammad Aljaidi, Ahmed Abu-Khadrah, Omprakash Kaiwartya and Jaime Lloret

Sensors 2023, 23(5), 2432; https://doi.org/10.3390/s23052432 - 22 Feb 2023

Cited by 5 | Viewed by 2340

Abstract

In this research paper, the spatial distributions of five different services—Voice over Internet Protocol (VoIP), Video Conferencing (VC), Hypertext Transfer Protocol (HTTP), and Electronic Mail—are investigated using three different approaches: circular, random, and uniform approaches. The amount of each service varies from one [...] Read more.

In this research paper, the spatial distributions of five different services—Voice over Internet Protocol (VoIP), Video Conferencing (VC), Hypertext Transfer Protocol (HTTP), and Electronic Mail—are investigated using three different approaches: circular, random, and uniform approaches. The amount of each service varies from one to another. In certain distinct settings, which are collectively referred to as mixed applications, a variety of services are activated and configured at predetermined percentages. These services run simultaneously. Furthermore, this paper has established a new algorithm to assess both the real-time and best-effort services of the various IEEE 802.11 technologies, describing the best networking architecture as either a Basic Service Set (BSS), an Extended Service Set (ESS), or an Independent Basic Service Set (IBSS). Due to this fact, the purpose of our research is to provide the user or client with an analysis that suggests a suitable technology and network configuration without wasting resources on unnecessary technologies or requiring a complete re-setup. In this context, this paper presents a network prioritization framework for enabling smart environments to determine an appropriate WLAN standard or a combination of standards that best supports a specific set of smart network applications in a specified environment. A network QoS modeling technique for smart services has been derived for assessing best-effort HTTP and FTP, and the real-time performance of VoIP and VC services enabled via IEEE 802.11 protocols in order to discover more optimal network architecture. A number of IEEE 802.11 technologies have been ranked by using the proposed network optimization technique with separate case studies for the circular, random, and uniform geographical distributions of smart services. The performance of the proposed framework is validated using a realistic smart environment simulation setting, considering both real-time and best-effort services as case studies with a range of metrics related to smart environments. Full article

(This article belongs to the Special Issue AI for Smart Home Automation)

► Show Figures

Figure 1

18 pages, 864 KiB

Open AccessArticle

A Novel Approach for Efficient Mitigation against the SIP-Based DRDoS Attack

by Ismail Melih Tas and Selcuk Baktir

Appl. Sci. 2023, 13(3), 1864; https://doi.org/10.3390/app13031864 - 31 Jan 2023

Cited by 6 | Viewed by 2051

Abstract

Voice over Internet Protocol (VoIP) and its underlying Session Initiation Protocol (SIP) are widely deployed technologies since they provide an efficient and fast means of both voice and data communication over a single network. However, in spite of their advantages, they also have [...] Read more.

Voice over Internet Protocol (VoIP) and its underlying Session Initiation Protocol (SIP) are widely deployed technologies since they provide an efficient and fast means of both voice and data communication over a single network. However, in spite of their advantages, they also have their security threats due to the inherent vulnerabilities in the underlying Internet Protocol (IP) that can potentially be exploited by hackers. This study introduces a novel defense mechanism to effectively combat advanced attacks that exploit vulnerabilities identified in some less-known features of SIP. The SIP-DRDoS (SIP-based distributed reflection denial of service) attack, which can survive the existing security systems, is an advanced attack that can be performed on an SIP network through the multiplication of legitimate traffic. In this study, we propose a novel defense mechanism that consists of statistics, inspection, and action modules to mitigate the SIP-DRDoS attack. We implement the SIP-DRDoS attack by utilizing our SIP-based audit and attack software in our VoIP/SIP security lab environment that simulates an enterprise-grade SIP network. We then utilize our SIP-based defense tool to realize our novel defense mechanism against the SIP-DRDoS attack. Our experimental results prove that our defense approach can do a deep packet analysis for SIP traffic, detect SIP flood attacks, and mitigate them by dropping attack packets. While the SIP-DRDoS attack with around 1 Gbps of traffic dramatically escalates the CPU (central processing unit) usage of the SIP server by up to

74 %

, our defense mechanism effectively reduces it down to

17 %

within 6 min after the attack is initiated. Our approach represents a significant advancement over the existing defense mechanisms and demonstrates the potential to effectively protect VoIP systems against SIP-based DRDoS attacks. Full article

► Show Figures

Figure 1

23 pages, 2429 KiB

Open AccessArticle

Call Me Maybe: Using Dynamic Protocol Switching to Mitigate Denial-of-Service Attacks on VoIP Systems

by John Kafke and Thiago Viana

Network 2022, 2(4), 545-567; https://doi.org/10.3390/network2040032 - 18 Oct 2022

Cited by 2 | Viewed by 2187

Abstract

Voice over IP is quickly becoming the industry standard voice communication service. While using an IP-based method of communication has many advantages, it also comes with a new set of challenges; voice networks are now accessible to a multitude of internet-based attackers from [...] Read more.

Voice over IP is quickly becoming the industry standard voice communication service. While using an IP-based method of communication has many advantages, it also comes with a new set of challenges; voice networks are now accessible to a multitude of internet-based attackers from anywhere in the world. One of the most prevalent threats to a VoIP network are Denial-of-Service attacks, which consume network bandwidth to congest or disable the communication service. This paper looks at the current state of research into the mitigation of these attacks against VoIP networks, to see if the mechanisms in place are enough. A new framework is proposed titled the “Call Me Maybe” framework, combining elements of latency monitoring with dynamic protocol switching to mitigate DoS attacks against VoIP systems. Research conducted around routing VoIP over TCP rather than UDP is integrated into the proposed design, along with a latency monitoring mechanism to detect when the service is under attack. Data gathered from a Cisco Packet Tracer simulation was used to evaluate the effectiveness of the solution. The gathered results have shown that there is a statistically significant improvement in the response times of voice traffic when using the “Call Me Maybe” framework in a network experiencing a DoS attack. The research and findings therefore aim to provide a contribution to the enhancement of the security of VoIP and future IP-based voice communication systems. Full article

► Show Figures

Figure 1

26 pages, 2813 KiB

Open AccessArticle

Adaptive QoS-Aware Multi-Metrics Gateway Selection Scheme for Heterogenous Vehicular Network

by Mahmoud Alawi, Raed Alsaqour, Maha Abdelhaq, Reem Alkanhel, Baraa Sharef, Elankovan Sundararajan and Mahamod Ismail

Systems 2022, 10(5), 142; https://doi.org/10.3390/systems10050142 - 7 Sep 2022

Cited by 2 | Viewed by 1877

Abstract

A heterogeneous vehicular network (HetVNET) is a promising network architecture that combines multiple network technologies such as IEEE 802.11p, dedicated short-range communication (DSRC), and third/fourth generation cellular networks (3G/4G). In this network area, vehicle users can use wireless fidelity access points (Wi-Fi APs) [...] Read more.

A heterogeneous vehicular network (HetVNET) is a promising network architecture that combines multiple network technologies such as IEEE 802.11p, dedicated short-range communication (DSRC), and third/fourth generation cellular networks (3G/4G). In this network area, vehicle users can use wireless fidelity access points (Wi-Fi APs) to offload 4G long-term evolution (4G-LTE) networks. However, when using Wi-Fi APs, the vehicles must organize themselves and select an appropriate mobile gateway (MGW) to communicate to the cellular infrastructure. Researchers are facing the problem of selecting the best MGW vehicle to aggregate vehicle traffic and reduce LTE load in HetVNETs when the Wi-Fi APs are unavailable for offloading. The selection process utilizes extra network overhead and complexity due to the frequent formation of clusters in this highly dynamic environment. In this study, we proposed a non-cluster adaptive QoS-aware gateway selection (AQAGS) scheme that autonomously picks a limited number of vehicles to act as LTE gateways based on the LTE network’s load status and vehicular ad hoc network (VANET) application’s QoS requirements. The present AQAGS scheme focuses on highway scenarios. The proposed scheme was evaluated using simulation of Urban mobility (SUMO) and network simulator version 2 (NS2) simulators and benchmarked with the clustered and non-clustered schemes. A comparison was made based on the end-to-end delay, throughput, control packet overhead (CPO), and packet delivery ratio (PDR) performance metrics over Voice over Internet Protocol (VoIP) and File Transfer Protocol (FTP) applications. Using VoIP, the AQAGS scheme achieved a 26.7% higher PDR compared with the other schemes. Full article

(This article belongs to the Section Systems Engineering)

► Show Figures

Figure 1

21 pages, 624 KiB

Open AccessArticle

A Reinforcement Learning Approach to Speech Coding

by Jerry Gibson and Hoontaek Oh

Information 2022, 13(7), 331; https://doi.org/10.3390/info13070331 - 11 Jul 2022

Cited by 2 | Viewed by 1822

Abstract

Speech coding is an essential technology for digital cellular communications, voice over IP, and video conferencing systems. For more than 25 years, the main approach to speech coding for these applications has been block-based analysis-by-synthesis linear predictive coding. An alternative approach that has [...] Read more.

Speech coding is an essential technology for digital cellular communications, voice over IP, and video conferencing systems. For more than 25 years, the main approach to speech coding for these applications has been block-based analysis-by-synthesis linear predictive coding. An alternative approach that has been less successful is sample-by-sample tree coding of speech. We reformulate this latter approach as a multistage reinforcement learning problem with L step lookahead that incorporates exploration and exploitation to adapt model parameters and to control the speech analysis/synthesis process on a sample-by-sample basis. The minimization of the spectrally shaped reconstruction error to finite depth manages complexity and serves as an effective stand in for the overall subjective evaluation of reconstructed speech quality and intelligibility. Different control policies that attempt to persistently excite the system states and that encourage exploration are studied and evaluated. The resulting methods produce reconstructed speech quality competitive with the most popular speech codec utilized today. This new reinforcement learning formulation provides new insights and opens up new directions for system design and performance improvement. Full article

► Show Figures

Figure 1

24 pages, 5210 KiB

Open AccessArticle

Phonation Variation as a Function of Checked Syllables and Prosodic Boundaries

by Xin Gao and Jianjing Kuang

Languages 2022, 7(3), 171; https://doi.org/10.3390/languages7030171 - 5 Jul 2022

Cited by 2 | Viewed by 1981

Abstract

The phonation variation in Shanghainese is influenced by both phonemic phonation contrast and global prosodic context. This study investigated the phonetic realization of checked and unchecked syllables at four different prosodic positions (sandhi-medial, sandhi-final, phrase-final, and IP-final). By analyzing both acoustic and articulatory [...] Read more.

The phonation variation in Shanghainese is influenced by both phonemic phonation contrast and global prosodic context. This study investigated the phonetic realization of checked and unchecked syllables at four different prosodic positions (sandhi-medial, sandhi-final, phrase-final, and IP-final). By analyzing both acoustic and articulatory voice measures, we achieved a better understanding of the nature of checkedness contrast and prosodic boundaries: (1) Different phonetic correlates are associated with the two laryngeal functions: The checkedness contrast is mostly distinguished by the relative degree of glottal constriction, but the prosodic boundaries are mostly associated with periodicity and noise measures. (2) The checkedness contrast is well maintained in all prosodic contexts, suggesting that the controls for the local checkedness contrast are rather independent of global prosody. Full article

(This article belongs to the Special Issue Exploring the Interaction between Phonation and Prosody)

► Show Figures

Figure 1

Figure 1
Three types of creakiness: (A) Coda glottal stop: short silence followed by a strong glottal pulse at the end of the syllable. (B) Coda creak: irregular voicing towards the end of the syllable. (C) Broader creak: irregular voicing occurred earlier than the last third of the vowel portion. Full article ">Figure 2
Principal Component Analysis of the acoustic space. (a) Color-coded for targets’ phonemic type. (b) Color-coded for targets’ prosodic position. Concentration ellipse level = 0.95. Full article ">Figure 3
The loadings for PC1 and PC2 of all acoustic features. The most correlated cues for PC1 are A2*, H1*–A2*, H1*–A1*, A3*, and H1*–A3*; the most correlated cues for PC2 are HNR15, HNR25, HNR35, HNR05, and CPP. Full article ">Figure 4
The variation of PC1 influenced by phonemic type and prosodic position. Greater PC1 indicates a more constricted glottis. Significant p-values (<math display="inline"><semantics> <mrow> <mi>p</mi> <mo>≤</mo> <mn>0.05</mn> </mrow> </semantics></math>) are marked in red, which indicates that the PC1 difference between checked and unchecked syllables is significant in that prosodic position. Full article ">Figure 5
The variation of PC2 is influenced by phonemic type and prosodic position. Greater PC1 indicates higher periodicity during the vowel portion. The p-values at all prosodic positions are insignificant (p > 0.05, shown in blue); this indicates that the PC2 differences between checked and unchecked syllables are insignificant at all prosodic positions. Full article ">Figure 6
The variation of CQ influenced by phonemic type and prosodic position. Significant p-values (<math display="inline"><semantics> <mrow> <mi>p</mi> <mo>≤</mo> <mn>0.05</mn> </mrow> </semantics></math>) are marked in red, which indicates that the CQ difference between checked and unchecked syllables is significant in that prosodic position. Full article ">Figure 7
The variation of PIC influenced by phonemic type and prosodic position. Significant p-values (<math display="inline"><semantics> <mrow> <mi>p</mi> <mo>≤</mo> <mn>0.05</mn> </mrow> </semantics></math>) are marked in red, which indicates that the PIC difference between checked and unchecked syllables is significant in that prosodic position. Full article ">Figure 8
The variation of f0 influenced by phonemic type and prosodic position. Significant p-values (<math display="inline"><semantics> <mrow> <mi>p</mi> <mo>≤</mo> <mn>0.05</mn> </mrow> </semantics></math>) are marked in red, which indicates that the f0 difference between checked and unchecked syllables is significant in that prosodic position. Full article ">Figure 9
The variation of duration influenced by phonemic type and prosodic position. Significant p-values (<math display="inline"><semantics> <mrow> <mi>p</mi> <mo>≤</mo> <mn>0.05</mn> </mrow> </semantics></math>) are marked in red, which indicates that the duration difference between checked and unchecked syllables is significant in that prosodic position. Full article ">Figure 10
The distribution of tokens with three different types of creak (coded in non-gray colors) and tokens without visible creak (coded in gray) among checked and unchecked tones at various prosodic positions. Full article ">

Search Results (36)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (36)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI