Text and Data Mining in the EU Acquis Communautaire Tinkering with TDM & Digital Legal Deposit
Authors: Kanellopoulou - Botti, Maria 
Zampakolas, Christos 
Papadopoulos, Marinos 
Ganatsiou, Paraskevi 
Issue Date: 1-Jan-2019
Journal: Erasmus Law Review 
Volume: 12
Keywords: Web harvesting, Data analysis, Text & data mining, TDM, Computational text
Abstract: 
Text and Data Mining (hereinafter, TDM) issue for the purpose of scientific research or for any other purpose which is included in the provisions of the new EU Directive on Copyright in the Digital Single Market (hereinafter, DSM). TDM is a term that includes Web harvesting and Web Archiving activities. Web harvesting and archiving pertains to the processes of collecting from the web and archiving of works that reside on the Web. In the following analysis we will elaborate briefly upon provisions in EU Copyright law which were discussed during the proposal for a new Directive on Copyright in the DSM as well as provisions which are included in the text of art. 3 and art. 4 of the new Directive 2019/790/EU per TDM. In addition, the following analysis presents legislation in very few EU Member States which pertains to TDM and preceded the rulings of Directive 190 2019/790/EU. The most notable examples are the ones from the United Kingdom and Ireland, Germany, the Netherlands and France. The example of Australia is also presented later because it is one of the oldest and most successful worldwide. The National Library of Australia’s digital legal deposit is state of the art.
The execution of TDM for Web harvesting and archiving of works found on the Internet is based on algorithmic applications and information technology. It is thus an automated computational process, the precision of which depends on the evolution of algorithms and the software used for their implementation. Later in this article we make a reference to the most commonly used algorithms used in libraries for crawling and harvesting of works found online as well as for analysing their content with the aim of discovering new scientific knowledge from their analysis.
Finally, in the text of this article we will address General Data Protection Regulation (GDPR) issues that pertain to TDM and the data subjects whose works have been harvested and/or archived by a library deploying TDM.
URI: https://uniwacris.uniwa.gr/handle/3000/544
Type: Article
Department: Department of Archival, Library and Information Studies 
School: School of Administrative, Economics and Social Sciences 
Affiliation: University of West Attica (UNIWA) 
Appears in Collections:Articles / Άρθρα

CORE Recommender
Show full item record

Page view(s)

34
checked on Nov 25, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.