Asmaa Saad

University of Sadat City, Information System, Faculty Member

Followers

Following

Public Views

Dr. Asmaa Saad is an assistant professor in the Information Systems Department at the Faculty of Computers and Artificial Intelligence, University of Sadat City, Egypt. She works as the deputy director of the Electronic Tests Unit, University of Sadat City. Dr. Asmaa received her B.Sc. degree in 2011, M.Sc. degree in 2016, and Ph.D. in 2022 from the Information Systems Department at the Faculty of Computers and Information, Menoufia University. She is senior member at Scientific Research School of Egypt (SRSEG). Also, she was invited as a reviewer from many international conferences and journals. Dr. Asmaa research interests include Data Mining, Information systems, Data Quality, and Artificial Intelligence.

less

InterestsView All (12)

Uploads

Papers

Enhanced Compressed Maximal Frequent Patterns from COVID-19 Streaming Data

Studies in Informatics and Control, 2022

The Coronavirus disease (COVID-19) pandemic has led to a huge loss of human life. It has also sev... more The Coronavirus disease (COVID-19) pandemic has led to a huge loss of human life. It has also severely
affected the economic, social, and health systems around the world. Frequent pattern mining is one of the main research
topics in data stream mining. It is significant in many critical applications, especially in the medical field. This paper
proposes a Compressed Maximal Frequent Pattern based on a Damped Window model over a data stream (CMFP-DW).
Its main contribution is to integrate the concept of correlation with the purpose of finding valuable patterns that are highly
correlated. As such, a new type of pattern is defined, namely the correlated compressed maximal frequent pattern. The
CMFP-DW approach is employed for mining accurate correlated maximal frequent patterns from streaming data, and it
has been validated against a real-world COVID-19 dataset from the healthcare domain. Frequent patterns generated from
this dataset are exploited with the purpose of detecting the COVID-19 cases in different countries of the world. This helps
decision makers take the appropriate precautions to prevent the further spread of the COVID-19 pandemic across the world.
The six experiments carried out show that the proposed approach outperforms two other existing approaches, namely the
estDec and the CP-Tree algorithms regarding accuracy in extracting correlated maximal frequent patterns, memory usage,
and the required response time.

Download

Frequent Pattern Mining over Streaming Data: From models to research challenges

IJCI. International Journal of Computers and Information, 2021

Research in frequent pattern mining from streaming data becomes a pioneer in the field of inform... more Research in frequent pattern mining from
streaming data becomes a pioneer in the field of information
systems. The data stream is a continuous flow of data generated
from different sources. Extracting frequent patterns from
streaming data raises new challenges for the data mining
community. We present an overview of the growing field of data
streams. Many applications handle streaming data such as
sensor networks, traffic management, log data, telephone call
records, and social networks. These applications generate high
volumes of streaming data with velocity, which is difficult to
handle with traditional data mining techniques. This paper
mainly reviewed different research algorithms, scientific
practices, and methods that have been developed for mining
frequent patterns from streaming data. In addition, it discusses
well-known open-source software and tools for data stream
mining, which are developing to handle streaming data. Finally,
it summarizes the open issues and challenges to current existing
approaches while handling and processing data streams in realworld applications.

Download

Enhancement of Data Quality in Health Care Industry: A Promising Data Quality Approach

IGI Global, 2017

Ensuring data quality is a growing challenge, particularly when emerging big data applications. T... more Ensuring data quality is a growing challenge, particularly when emerging big data applications. This chapter highlights data quality concepts, terminologies, techniques, as well as research issues. Recent studies have shown that databases are often suffered from inconsistent data, which ought to be resolved in the cleaning process. Data mining techniques can play key role for ensuring data quality, which can be reutilized efficiently in data cleaning process. In this chapter, we introduce an approach for dependably generating rules from databases themselves autonomously, in order to detect data inconsistency problems from large databases. The proposed approach employs confidence and lift measures with integrity constraints to guarantee that generated rules are minimal, non-redundant and precise. Since healthcare applications are critical, and managing healthcare environments efficiently results in patient care improvement. The proposed approach is validated against several datasets from healthcare environment. It provides clinicians with automated approach for enhancing quality of electronic medical records. We experimentally demonstrate that the proposed approach achieves significant enhancement over existing approaches.

Download

Efficient Dependable Rules Generation Approach for Data Quality Enhancement

2015 25th International Conference on Computer Theory and Applications (ICCTA), 2015

In area of data quality research, enhancing data quality is still big challenge, especially in la... more In area of data quality research, enhancing data quality is still big challenge, especially in large databases. Data mining techniques can be efficiently utilized in data cleaning process. Databases are often suffered from data inconsistency, which has no vital solution up to now. In this paper, we tackle the problem of detecting data inconsistency from large databases. We propose an approach for discovering dependable rules from databases themselves. Such generated rules are minimal and non-redundant that covers all rules among patterns in database. The proposed approach focuses mainly on generating precise dependable data quality rules through extracting maximal frequent patterns, as effective pruning mechanism to reduce the search space domain. The proposed approach is validated against several datasets from different application domains. Experimental Results demonstrate that our approach outperform other approaches in terms of the efficiency, accuracy and scalability using both real-life and synthetic datasets.

Download

Fixing rules for data cleaning based on conditional functional dependency

Future Computing and Informatics Journal

Most existing databases suffer from data inconsistencies. Enhancing data quality efforts are nece... more Most existing databases suffer from data inconsistencies. Enhancing data quality efforts are necessary to resolve this issue. In this paper, two
techniques are proposed for mining accurate conditional functional dependencies rules from such databases to be employed for data cleaning.
The idea of the proposed techniques is to mine firstly maximal closed frequent patterns, then mine the dependable conditional functional dependencies
rules with the help of lift measure. Moreover, data repairing algorithm is proposed for fixing inconsistent tuples found in the database
exploiting the generated rules. An extensive experimental is conducted study to confirm the effectiveness of the proposed techniques compared
with existing technique on both real-life and synthetic medical data sets.

Download

Automatic Rules Generation Approach for Data Cleaning in Medical Applications

Advances in Intelligent Systems and Computing, 2015

Data quality is considered crucial challenge in emerging big data scenarios. Data mining techniq... more Data quality is considered crucial challenge in emerging big data scenarios.
Data mining techniques can be reutilized efficiently in data cleaning process.
Recent studies have shown that databases are often suffered from inconsistent data
issues, which ought to be resolved in the cleaning process. In this paper, we
introduce an automated approach for dependably generating rules from databases
themselves, in order to detect data inconsistency problems from large databases.
The proposed approach employs confidence and lift measures with integrity constraints,
in order to guarantee that generated rules are minimal, non-redundant and
precise. The proposed approach is validated against several datasets from healthcare
domain. We experimentally demonstrate that our approach outperform significant
enhancement over existing approaches.

Download