The International Arab Journal of Information Technology
Privacy-preserving data publishing have been studied widely on static data. However, many recent ... more Privacy-preserving data publishing have been studied widely on static data. However, many recent applications generate data streams that are real-time, unbounded, rapidly changing, and distributed in nature. Recently, few work addressed k-anonymity and l-diversity for data streams. Their model implied that if the stream is distributed, it is collected at a central site for anonymization. In this paper, we propose a novel distributed model where distributed streams are first anonymized by distributed (collecting) sites before merging and releasing. Our approach extends Continuously Anonymizing STreaming data via adaptive cLustEring (CASTLE) [4], a cluster-based approach that provides both k-anonymity and l-diversity for centralized data streams. The main idea is for each site to construct its local clustering model and exchange this local view with other sites to globally construct approximately the same clustering view. The approach is heuristic in a sense that not every update to t...
This paper focuses on three axes. The first ax is gives a survey of the importance of corpora in ... more This paper focuses on three axes. The first ax is gives a survey of the importance of corpora in language studies e.g. lexicography, grammar, sem antics, Natural Language Processing and other areas. The second axis demonstrates how the A rabic language lacks textual resources, such as corpora and tools for corpus analysis and t he effected of this lack on the quality of Arabic language applications. There are rarely succ essful trials in compiling Arabic corpora, therefore, the third axis presents the technical de sign of the International Corpus of Arabic (ICA), a newly established representative corpus of Arabic that is intended to cover the Arabic language as being used all over the Arab world. The corpus is planned to support various Arabic studies that depends on authentic data, in a ddition to building Arabic Natural Language Processing Applications.
In this paper we introduce a prototype of Library I nformation Systems that uses the Universal Ne... more In this paper we introduce a prototype of Library I nformation Systems that uses the Universal Networking Language (UNL) as a means for translating the metadata of books. T his prototype is capable of handling the bibliographic information of 1000 books selected from the catalogs of Biblioth eca Alexandrina (B.A.). The paper sheds light, firstly, on the idea of shar ing bibliographic information across languages; secondl y, on the linguistic and computational challenges faced when trying to execute such an idea using UNL interlingua, and thi rdly on the implementation of this innovative system.
Despite the focused attention of improvements achieved by the NLP community on various lan-guage-... more Despite the focused attention of improvements achieved by the NLP community on various lan-guage-related issues of knowledge representation, knowledge representation has not been satisfacto-rily achieved. This paper focuses on five axes. The first axis deals with presenting a structured repre-sentation of linguistic knowledge of natural lan-guage sentences through the Universal Network-ing Language (UNL) framework. The second axis deals with applying the UNL knowledge repre-sentation system on Arabic examples illustrating how effective the UNL system in representing Arabic morphology, syntax, semantics and prag-matics. The third axis highlights the available corpora of the UNL. The fourth axis highlights the potential applications of the current knowledge representation system in the fields of information extraction, summarization and other applications that seek an understanding of natural language input. The fifth and final axis deals with the eval-uation of the UNL system output.
Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, 2014
ABSTRACT The problem of continuous spatio-temporal queries’ processing was addressed by many pape... more ABSTRACT The problem of continuous spatio-temporal queries’ processing was addressed by many papers. Some papers introduced solutions using single server architecture while others using distributed server one. In this paper, we introduce MobiPLACE*, an extension to PLACE* [13] system, a distributed framework for spatio-temporal data streams processing exploiting mobile clients’ processing power. We will extend the Query-Track-Participate (QTP) query processing model, introduced as a system architecture in PLACE*, by moving the Query server role to mobile clients. This will reduce memory and processing load on our regional servers in exchange for a little additional communication and memory load on mobile devices. This makes the system more scalable and enhances average query response time. Improvements in mobile devices’ and communication links’ capabilities encouraged us to introduce this extension. In this paper, we will focus on range and k-NN continuous queries and their evaluation on MobiPLACE*. Experimental study is made to compare between MobiPLACE* and PLACE* in terms of server response time and memory.
2013 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA), 2013
ABSTRACT This paper introduces the UNL framework as a collaborative framework that encourages and... more ABSTRACT This paper introduces the UNL framework as a collaborative framework that encourages and promotes the participation of linguists and non-linguists in the development of an integral natural language processing workbench. The UNL workbench includes a multitude of user-friendly back-end and front-end applications that facilitate the process of learning the UNL basics, participating in the development of resources within the UNL framework, as well as applications that perform several NLP tasks such as machine translation and editing. This workbench claims the ability to analyze automatically natural languages into their abstract semantic meanings, with the aim of finding the common denominator between all languages.
20th International Conference on Advanced Information Networking and Applications - Volume 1 (AINA'06), 2006
Abstract Association rules discovery is an important data mining technique which usually produces... more Abstract Association rules discovery is an important data mining technique which usually produces large number of rules. Subset and Superset queries are common queries for association rules. We introduce a new index structure (SSST) for querying association rules, based ...
This paper presents an interlingua approach to the machine translation of lengthy documents. This... more This paper presents an interlingua approach to the machine translation of lengthy documents. This appr oach is based on encoding the source text in the form of universa l semantic networks, using the Universal Networking Language, UNL interlingua, which can then be decoded back into any natural language. This UNL technology has been applied to 1000 pages from the Encyclopedia
1. Abstract. This paper describes the decoding part in an in terlingual system for man-machine co... more 1. Abstract. This paper describes the decoding part in an in terlingual system for man-machine communication in natural language. It is based on the Universal Networking L anguage (UNL) framework. Given a semantic network that represents a relation between a number of concepts, this network can be decoded (or 'DeConverted' in UNL technical terms) back to any natural language. This
The International Arab Journal of Information Technology
Privacy-preserving data publishing have been studied widely on static data. However, many recent ... more Privacy-preserving data publishing have been studied widely on static data. However, many recent applications generate data streams that are real-time, unbounded, rapidly changing, and distributed in nature. Recently, few work addressed k-anonymity and l-diversity for data streams. Their model implied that if the stream is distributed, it is collected at a central site for anonymization. In this paper, we propose a novel distributed model where distributed streams are first anonymized by distributed (collecting) sites before merging and releasing. Our approach extends Continuously Anonymizing STreaming data via adaptive cLustEring (CASTLE) [4], a cluster-based approach that provides both k-anonymity and l-diversity for centralized data streams. The main idea is for each site to construct its local clustering model and exchange this local view with other sites to globally construct approximately the same clustering view. The approach is heuristic in a sense that not every update to t...
This paper focuses on three axes. The first ax is gives a survey of the importance of corpora in ... more This paper focuses on three axes. The first ax is gives a survey of the importance of corpora in language studies e.g. lexicography, grammar, sem antics, Natural Language Processing and other areas. The second axis demonstrates how the A rabic language lacks textual resources, such as corpora and tools for corpus analysis and t he effected of this lack on the quality of Arabic language applications. There are rarely succ essful trials in compiling Arabic corpora, therefore, the third axis presents the technical de sign of the International Corpus of Arabic (ICA), a newly established representative corpus of Arabic that is intended to cover the Arabic language as being used all over the Arab world. The corpus is planned to support various Arabic studies that depends on authentic data, in a ddition to building Arabic Natural Language Processing Applications.
In this paper we introduce a prototype of Library I nformation Systems that uses the Universal Ne... more In this paper we introduce a prototype of Library I nformation Systems that uses the Universal Networking Language (UNL) as a means for translating the metadata of books. T his prototype is capable of handling the bibliographic information of 1000 books selected from the catalogs of Biblioth eca Alexandrina (B.A.). The paper sheds light, firstly, on the idea of shar ing bibliographic information across languages; secondl y, on the linguistic and computational challenges faced when trying to execute such an idea using UNL interlingua, and thi rdly on the implementation of this innovative system.
Despite the focused attention of improvements achieved by the NLP community on various lan-guage-... more Despite the focused attention of improvements achieved by the NLP community on various lan-guage-related issues of knowledge representation, knowledge representation has not been satisfacto-rily achieved. This paper focuses on five axes. The first axis deals with presenting a structured repre-sentation of linguistic knowledge of natural lan-guage sentences through the Universal Network-ing Language (UNL) framework. The second axis deals with applying the UNL knowledge repre-sentation system on Arabic examples illustrating how effective the UNL system in representing Arabic morphology, syntax, semantics and prag-matics. The third axis highlights the available corpora of the UNL. The fourth axis highlights the potential applications of the current knowledge representation system in the fields of information extraction, summarization and other applications that seek an understanding of natural language input. The fifth and final axis deals with the eval-uation of the UNL system output.
Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, 2014
ABSTRACT The problem of continuous spatio-temporal queries’ processing was addressed by many pape... more ABSTRACT The problem of continuous spatio-temporal queries’ processing was addressed by many papers. Some papers introduced solutions using single server architecture while others using distributed server one. In this paper, we introduce MobiPLACE*, an extension to PLACE* [13] system, a distributed framework for spatio-temporal data streams processing exploiting mobile clients’ processing power. We will extend the Query-Track-Participate (QTP) query processing model, introduced as a system architecture in PLACE*, by moving the Query server role to mobile clients. This will reduce memory and processing load on our regional servers in exchange for a little additional communication and memory load on mobile devices. This makes the system more scalable and enhances average query response time. Improvements in mobile devices’ and communication links’ capabilities encouraged us to introduce this extension. In this paper, we will focus on range and k-NN continuous queries and their evaluation on MobiPLACE*. Experimental study is made to compare between MobiPLACE* and PLACE* in terms of server response time and memory.
2013 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA), 2013
ABSTRACT This paper introduces the UNL framework as a collaborative framework that encourages and... more ABSTRACT This paper introduces the UNL framework as a collaborative framework that encourages and promotes the participation of linguists and non-linguists in the development of an integral natural language processing workbench. The UNL workbench includes a multitude of user-friendly back-end and front-end applications that facilitate the process of learning the UNL basics, participating in the development of resources within the UNL framework, as well as applications that perform several NLP tasks such as machine translation and editing. This workbench claims the ability to analyze automatically natural languages into their abstract semantic meanings, with the aim of finding the common denominator between all languages.
20th International Conference on Advanced Information Networking and Applications - Volume 1 (AINA'06), 2006
Abstract Association rules discovery is an important data mining technique which usually produces... more Abstract Association rules discovery is an important data mining technique which usually produces large number of rules. Subset and Superset queries are common queries for association rules. We introduce a new index structure (SSST) for querying association rules, based ...
This paper presents an interlingua approach to the machine translation of lengthy documents. This... more This paper presents an interlingua approach to the machine translation of lengthy documents. This appr oach is based on encoding the source text in the form of universa l semantic networks, using the Universal Networking Language, UNL interlingua, which can then be decoded back into any natural language. This UNL technology has been applied to 1000 pages from the Encyclopedia
1. Abstract. This paper describes the decoding part in an in terlingual system for man-machine co... more 1. Abstract. This paper describes the decoding part in an in terlingual system for man-machine communication in natural language. It is based on the Universal Networking L anguage (UNL) framework. Given a semantic network that represents a relation between a number of concepts, this network can be decoded (or 'DeConverted' in UNL technical terms) back to any natural language. This
Uploads
Papers by Magdy Nagi