Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Adobe is sued for using pirated books to train AI

The lawsuit claims Adobe training data was contaminated with over one hundred thousand stolen works

byEmre Çıtak
December 18, 2025
in Industry
Home Industry
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

A proposed class-action lawsuit filed by Oregon author Elizabeth Lyon accuses Adobe of training its SlimLM AI model on pirated books, including her guidebooks, through the SlimPajama-627B dataset derived from the RedPajama collection containing Books3.

Adobe has pursued extensive development in artificial intelligence over recent years. The company launched multiple AI services starting in 2023, with Firefly serving as its AI-powered media-generation suite designed for creating images, videos, and other media content from text prompts and inputs.

SlimLM represents a series of small language models that Adobe has optimized specifically for document assistance tasks on mobile devices. These models enable functions such as summarizing documents, extracting key information, and providing contextual help directly within mobile applications.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Adobe states that it pre-trained SlimLM using the SlimPajama-627B dataset. Cerebras released this dataset in June 2023 as a deduplicated, multi-corpora, open-source resource intended for training large language models. The dataset aggregates various text sources after removing duplicates to improve training efficiency and model performance.

Elizabeth Lyon, who specializes in guidebooks for non-fiction writing, initiated the lawsuit claiming that Adobe incorporated pirated versions of numerous books, including her own works, into the training process for SlimLM. The legal action seeks class-action status to represent other affected authors.

The lawsuit details how the SlimPajama dataset originated from the RedPajama dataset, which includes the Books3 collection comprising 191,000 books. Reuters first reported on the filing. The complaint states verbatim: “The SlimPajama dataset was created by copying and manipulating the RedPajama dataset (including copying Books3).” It continues: “Thus, because it is a derivative copy of the RedPajama dataset, SlimPajama contains the Books3 dataset, including the copyrighted works of Plaintiff and the Class members.” Lyon argues that her copyrighted materials appeared in this pre-training data without her consent or compensation.

Books3 has emerged repeatedly in legal disputes within the AI sector, as developers have utilized it to train generative AI systems. The collection contains digitized texts from various genres and authors, making it a comprehensive but contentious training corpus. RedPajama, which incorporates Books3, has also faced mentions in multiple court cases.


Featured image credit

Tags: Adobeslimlm books3

Related Posts

TSMC sets record May revenue on surging AI chip demand

TSMC sets record May revenue on surging AI chip demand

June 10, 2026
Microsoft might be the biggest winner of Google x SpaceX deal

Microsoft might be the biggest winner of Google x SpaceX deal

June 10, 2026
How Freedom Holding Corp.’s ecosystem strategy gained scale in fiscal 2026

How Freedom Holding Corp.’s ecosystem strategy gained scale in fiscal 2026

June 10, 2026
Decart lays the foundation for physical AI systems with Oasis 3

Decart lays the foundation for physical AI systems with Oasis 3

June 10, 2026
OpenAI confirms confidential IPO filing

OpenAI confirms confidential IPO filing

June 9, 2026
Google will pay Elon Musk a fortune every single month

Google will pay Elon Musk a fortune every single month

June 8, 2026

LATEST NEWS

Google Gemini outage affects users reporting error 1076 and 1099

Geoffrey Hinton rethinks AI’s role in warfare after Ukraine conflict

Logitech launches foldable Mobi Fold mouse for mobile workers

Anthropic launches Claude Fable 5 ahead of $965 billion IPO

Hasbro launches AI licensing studio Sixth Wall for approved character use

EU moves to ban transactions on 11 crypto platforms tied to Russia

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Roboto AI

Pickaxe

Pfpmaker

MindPal

Syllaby

ScreenApp

FinanceBrain

GitHub Spark

Hints

VisionStory AI

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.