Authors: | Mastora, Anna Monopoli, Maria Kapidakis, Sarantos |
Issue Date: | 1-Jan-2011 |
Conference: | 1st Workshop on Digital Information Management, Corfu, Greece, 30-31 March, 2011 1st Workshop on Digital Information Management, Corfu, Greece, 30-31 March, 2011 |
Keywords: | Failed queries, Morpho-syntactic analysis, PoS tagging, Typing errors |
Abstract: | The aim of the study is to elaborate on the procedure needed in order to analyze morpho-syntactically the typing-error queries submitted in Greek during the search process. In the context of our analysis a failed query is a query which returned no hits. The analysis showed that failed queries represent 36% of the submitted queries. More specifically, 19.6% of failed queries occurred due to typing errors. We discovered that for analyzing morpho-syntactically a Greek text corpus the PoS tools need to be rich in tags in order to work adequately. Open Xerox tokenizer performed well but with significant pre-processing of the queries and the analyzer seems to require additional tools to improve its performance. MS Word which was used for spelling corrections seems to perform satisfactorily. All tools were challenged in terms of named entities recognition. |
URL: | https://core.ac.uk/download/pdf/290483622.pdf |
URI: | https://uniwacris.uniwa.gr/handle/3000/842 |
Type: | Conference Paper |
Department: | Department of Archival, Library and Information Studies |
School: | School of Administrative, Economics and Social Sciences |
Affiliation: | University of West Attica (UNIWA) |
Appears in Collections: | Conference Papers or Poster or Presentation / Δημοσιεύσεις σε Συνέδρια |
CORE Recommender
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.