Authors: | Kapellas, Nikos Kapidakis, Sarantos |
Publisher: | Scitepress |
Issue Date: | 1-Jan-2023 |
Conference: | 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management,13-15 November 2023, Rome, Italy |
Is Part of: | Proceedings of the 15th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2023) |
Volume: | 2 |
Issue: | KEOD |
Keywords: | Event Detection, News Articles, Topic Modeling, Natural Language Processing, Unsupervised Clustering, Named Entity Recognition |
Abstract: | This research presents a comprehensive analysis of news articles with the primary objectives of exploring the underlying structure of the data and detecting events contained within news articles. The study collects articles from Greek online newspapers and focuses on analyzing a subset of this data, related to a predefined news topic. To achieve this, a hybrid approach that combines topic modeling, feature extraction, clustering, and named entity recognition, is employed. The obtained results prove to be satisfactory, as they demonstrate the effectiveness of the proposed methodology in news event detection and extracting relevant contextual information. This research provides valuable insights for multiple parties, including news organizations, researchers, news readers, and decision-making systems, as it contributes to the fields of event detection and clustering. Moreover, it deepens the understanding of applying solutions that do not require explicit human intervention, to real-world language challenges. |
ISBN: | 978-989-758-671-2 2184-3228 |
DOI: | 10.5220/0012234300003598 |
URI: | https://uniwacris.uniwa.gr/handle/3000/2801 |
Type: | Conference Paper |
Department: | Department of Archival, Library and Information Studies |
School: | School of Administrative, Economics and Social Sciences |
Affiliation: | University of West Attica (UNIWA) |
Appears in Collections: | Books / Βιβλία |
CORE Recommender
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.