MoFo’s Lokke Moerel and Marijn Storm published an article with IAPP in August 2024 on the “lawfulness” of web scraping for purposes of training a Large Language Model. In April 2023, the European Data Protection Board (EDPB) established its ChatGPT Taskforce to coordinate the various national enforcement actions taken by EU data protection authorities against OpenAI, the provider of ChatGPT. The article discusses the interim report issued by the ChatGPT Taskforce in May 2024, stating that LLMs may not be trained on special categories of data scraped from public websites. Lokke Moerel and Marijn Storm argue that this opinion is not only prohibitive for functional LLMs, but also untenable in light of the case law of the EU Court of Justice.
The article is part of a series on key data protection issues posed by large language models.
Read the full article.