Hypatia Digital Library: A novel text classification approach for small text fragments

Koulouris, Alexandros; Triantafyllou, Ioannis; Vorgia, Froso

Hypatia Digital Library: A novel text classification approach for small text fragments
Authors:	Koulouris, Alexandros Triantafyllou, Ioannis Vorgia, Froso
Issue Date:	1-Dec-2019
Journal:	Journal of Integrated Information Management
Volume:	4
Issue:	2
Keywords:	Digital libraries, Statistical natural language processing, Text classification, WEKA, Word stemming
Abstract:	Purpose-The purpose of this paper is to further investigate prior work of the authors in text classification in Hypatia, the digital library of University of Western Attica. The main objective is to provide an accurate automated classification tool as an alternative to manual assignments.Design/methodology/approach-The crucial point in text classification is the selection of the most important term-words for document representation. The specific document collection consists of 718 abstracts in Medicine, Tourism and Food Technology. Two weighting methods were investigated: classic TF. IDF and DEVMAX. DF. The last one was proposed by the authors as a more accurate term-word selection tool for smaller text fragments. Classification was conducted by applying 14 classifiers available on WEKA.Findings-Classification process yielded an excellent~ 97% precision score and DEVMAX. DF proved to perform better than classic TF. IDF.
DOI:	10.26265/jiim.v4i2.4420
URL:	http://users.uniwa.gr/akoul/pubs/jiim2019_hypatia.pdf
URI:	https://uniwacris.uniwa.gr/handle/3000/345
Type:	Article
Department:	Department of Archival, Library and Information Studies
School:	School of Administrative, Economics and Social Sciences
Affiliation:	University of West Attica (UNIWA)
Appears in Collections:	Articles / Άρθρα

CORE Recommender

Show full item record

Please use this identifier to cite or link to this item: https://uniwacris.uniwa.gr/handle/3000/345

Page view(s)

46

checked on Nov 5, 2024

Google Scholar^TM

Check

Page view(s)

Google Scholar^TM

Altmetric

Altmetric

CAMPUSES

EGALEO PARK

ANCIENT OLIVE GROVE

ATHENS

SCHOOLS

Sitemap

FOLLOW US

Page view(s)

Google ScholarTM

Altmetric

Altmetric

EGALEO PARK

ANCIENT OLIVE GROVE

ATHENS

Google Scholar^TM