[go: up one dir, main page]

Computer Science and Information Systems 2013 Volume 10, Issue 4, Pages: 1703-1726
https://doi.org/10.2298/CSIS121202065H
Full text ( 187 KB)


An evaluation of keyword, string similarity and very shallow syntactic matching for a university admissions processing infobot

Hancox Peter (School of Computer Science, University of Birmingham Edgbaston, Birmingham, United Kingdom)
Polatidis Nikolaos (Department of Applied Informatics, University of Macedonia, Thessaloniki, Greece)

“Infobots” are small-scale natural language question answering systems drawing inspiration from ELIZA-type systems. Their key distinguishing feature is the extraction of meaning from users’ queries without the use of syntactic or semantic representations. Three approaches to identifying the users’ intended meanings were investigated: keyword based systems, Jaro-based string similarity algorithms and matching based on very shallow syntactic analysis. These were measured against a corpus of queries contributed by users of aWWW-hosted infobot for responding to questions about applications to MSc courses. The most effective system was Jaro with stemmed input (78.57%). It also was able to process ungrammatical input and offer scalability.

Keywords: chatbot, infobot, question-answering, Jaro string similarity, Jaro-Winkler string similarity, shallow syntactic processing