DC FieldValueLanguage
dc.contributor.authorPapadopoulos, Marinos-
dc.contributor.authorZampakolas, Christos-
dc.contributor.authorKanellopoulou - Botti, Maria-
dc.contributor.authorGanatsiou, Paraskevi-
dc.date.accessioned2023-10-17T10:16:23Z-
dc.date.available2023-10-17T10:16:23Z-
dc.date.issued2020-01-
dc.identifier.otherT1wsRZ4AAAAJ:u5HHmVD_uO8C-
dc.identifier.urihttps://uniwacris.uniwa.gr/handle/3000/483-
dc.description.abstractAlmost two decades of experience on web harvesting and archiving are counted; the subject of web harvesting and web archiving have been top in the interest of researchers, technologists and librarians-information scientists. Web harvesting projects and pilot programs on archiving content traced on the Web are becoming priorities for national libraries and cultural heritage organizations in the EU. This paper pertains to web harvesting as a process for data mining from web and only through web (“pull” function); this paper elaborates upon research implemented in the framework of the funded research project titled “Web Archiving in Public Libraries and IP Law” that focused on the processes of web-harvesting and archiving as well as Text and Data Mining (TDM) operations in the national libraries of EU Member States. Web archiving as an official operation in national libraries of EU Member States creates web collections and preserves them for the purpose of being accessible and usable in perpetuity. This paper pertains to research on various components of web harvesting and archiving through an online survey (qualitative research) which targeted the national libraries of EU Member States. The research team of authors posed seventeen questions to EU national libraries. The survey output comes from answers delivered by 22 national libraries of EU Member States. The questionnaire was created through the use of Google forms. The researchers reached the EU national libraries via email and follow up telephone calls seeking libraries’ participation in the research. The aim of the research was to delve on participant libraries’ Text and Data Mining operation leveraging on Web harvesting and Web archiving technologies and operations. Results analysis reveals that web harvesting is considered among national libraries’ top priorities; the relevant projects increase in number, the web collections become more and more and the technological infrastructures and tools for web harvesting improve. Yet, there are many issues that remain unresolved. A significant number of surveyed libraries consider that legal and technical issues remain the most important to resolve. Access to harvested material is still under legal restrictions. The Directive 2019/790/EU on Copyright in the Digital Single Market (DSM) creates a favorable legal foundation for the deployment of web harvesting operations in national libraries of the EU Member States. TDM technologies make possible new areas of research. Web harvesting that was initially aimed for preservation purposes now expands to unprecedented research of national heritage through state-of-the-art automated TDM processes.en_US
dc.language.isoenen_US
dc.relation.ispartofOpen Journal of Philosophyen_US
dc.subjectTDMen_US
dc.subjectWeb Harvestingen_US
dc.subjectWeb Archivingen_US
dc.subjectNational Librariesen_US
dc.subjectSurveyen_US
dc.titleEmpirical research on web harvesting in the process of text and data mining in national libraries of EU member statesen_US
dc.typeArticleen_US
dc.relation.deptDepartment of Archival, Library and Information Studiesen_US
dc.relation.facultySchool of Administrative, Economics and Social Sciencesen_US
dc.relation.volume10en_US
dc.relation.issue1en_US
dc.identifier.spage88en_US
dc.identifier.epage112en_US
dc.linkhttps://www.researchgate.net/publication/339074670_Empirical_Research_on_Web_Harvesting_in_the_Process_of_Text_and_Data_Mining_in_National_Libraries_of_EU_Member_Statesen_US
dc.collaborationUniversity of West Attica (UNIWA)en_US
dc.subject.fieldSocial Sciencesen_US
dc.journalsOpen Accessen_US
dc.countryGreeceen_US
item.openairetypeArticle-
item.grantfulltextnone-
item.fulltextNo Fulltext-
item.cerifentitytypePublications-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.languageiso639-1en-
crisitem.author.deptDepartment of Archival, Library and Information Studies-
crisitem.author.facultySchool of Administrative, Economics and Social Sciences-
crisitem.author.orcid0009-0004-3941-0725-
crisitem.author.parentorgSchool of Administrative, Economics and Social Sciences-
Appears in Collections:Articles / Άρθρα
CORE Recommender
Show simple item record

Page view(s)

30
checked on Sep 11, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.