In Search of Oblivion? How the 'Right to be Forgotten' Could Undermine Web-based Corpora

Sharon Creese

Research output: Contribution to journalArticle

7 Downloads (Pure)


Corpus linguists are now facing a new challenge to collecting accurate data for web-based corpora: the 'Right to be Forgotten'. This element of data protection legislation allows individuals to request that links to webpages be removed if the information contained there can now be considered inaccurate, irrelevant or excessive. The potential difficulties this poses for researchers are illustrated by my experience collecting data for a corpus of neologisms appearing in online versions of UK national newspapers.
Original languageEnglish
Pages (from-to)95–102
JournalProcedia - Social and Behavioral Sciences
Publication statusPublished - 24 Jul 2015


Bibliographical note

This article is distributed under a creative commons attribution - non-commercial - no derivatives licence .


  • corpus linguistics
  • corpora
  • neologism
  • Google
  • newspaper
  • Right to be Forgotten

Cite this