Authors: | Triantafyllou, Ioannis Koulouris, Alexandros Vorgia, Froso |
Publisher: | Springer International Publishing Switzerland |
Issue Date: | 1-Jan-2017 |
Book: | Strategic Innovative Marketing |
Series: | Springer Proceedings in Business and Economics |
Keywords: | Digital libraries, Text classification, WEKA, Word stemming |
Abstract: | The purpose of this paper is to investigate the application of text classification in Hypatia, the digital library of Technological Educational Institute of Athens, in order to provide an automated classification tool as an alternative to manual assignments. The crucial point in text classification is the selection of the most important term-words for document representation. Classic weighting method TF.IDF was investigated. Our document collection consists of 718 abstracts in Medicine, Tourism and Food Technology. Classification was conducted utilizing 14 classifiers available on WEKA. Classification process yielded an excellent ~97 % precision score. |
ISBN: | 9783319338637 |
ISSN: | 21987254 21987246 |
DOI: | 10.1007/978-3-319-33865-1_89 |
URI: | https://uniwacris.uniwa.gr/handle/3000/325 |
Type: | Conference Paper |
Department: | Department of Archival, Library and Information Studies |
School: | School of Administrative, Economics and Social Sciences |
Affiliation: | University of West Attica (UNIWA) |
Appears in Collections: | Book Chapter / Κεφάλαιο Βιβλίου |
CORE Recommender
SCOPUSTM
Citations
50
4
checked on Nov 3, 2024
Page view(s)
43
checked on Nov 5, 2024
Google ScholarTM
Check
Altmetric
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.