Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

OpenAI reportedly deleted evidence in NY Times copyright lawsuit

The lawsuit alleges that OpenAI has scraped articles from The New York Times and Daily News without obtaining permission to train its models

byKerem Gülen
November 21, 2024
in Artificial Intelligence, News

Lawyers for The New York Times and Daily News claim that OpenAI inadvertently deleted crucial data related to their copyright lawsuit against the company regarding unauthorized use of their content, according to a TechCrunch report. The incident occurred after OpenAI agreed to provide access to its training datasets to aid the plaintiffs in verifying the usage of their copyrighted materials.

The lawsuit alleges that OpenAI has scraped articles from The New York Times and Daily News without obtaining permission to train its models. In response to the suit, OpenAI provided two virtual machines for the publishers’ attorneys to search its training data for their copyrighted content. Since November 1, the legal teams have dedicated more than 150 hours to this search. However, on November 14, OpenAI engineers mistakenly erased all search data stored on one of the virtual machines, as noted in a filing made in the U.S. District Court for the Southern District of New York.

OpenAI’s attempts to recover the deleted data were mostly successful, but the loss of the folder structure and file names rendered the recovered data unusable in tracking where the plaintiffs’ articles were included in the AI’s training. The letter filed by the plaintiffs’ counsel emphasized that they had to reconstruct their work, consuming extensive resources and time.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Despite the deletion of data, the counsel clarified that there is no indication the incident was intentional. They expressed concern that OpenAI is ideally positioned to search its own datasets, indicating an obligation to assist in the investigation of potential copyright infringement.


OpenAI just made macOS smarter with ChatGPT app support


OpenAI contends that using publicly available data for training its models falls under “fair use.” The company maintains that it does not need to license or compensate for these contents, even as it profits from its AI products. Nonetheless, OpenAI has entered into licensing agreements with several publishers, including prominent names like the Associated Press and Financial Times. While the specific terms of these deals remain undisclosed, it is reported that Dotdash, one of the partners, receives at least $16 million annually.

OpenAI has yet to issue a statement addressing the incident or its implications for its relationship with the plaintiffs.


Featured image credit: Jonathan Kemper/Unsplash

Tags: FeaturedopenAI

Related Posts

Could CTEM have prevented the Oracle Cloud breach?

Could CTEM have prevented the Oracle Cloud breach?

October 5, 2025
ChatGPT reportedly reduces reliance on Reddit as a data source

ChatGPT reportedly reduces reliance on Reddit as a data source

October 3, 2025
Perplexity makes Comet AI browser free, launches background assistant and Chess.com partnership

Perplexity makes Comet AI browser free, launches background assistant and Chess.com partnership

October 3, 2025
Light-powered chip makes AI computation 100 times more efficient

Light-powered chip makes AI computation 100 times more efficient

October 3, 2025
Free and effective anti-robocall tools are now available

Free and effective anti-robocall tools are now available

October 3, 2025
Choosing the right Web3 server: OVHcloud options for startups to enterprises

Choosing the right Web3 server: OVHcloud options for startups to enterprises

October 3, 2025

LATEST NEWS

Could CTEM have prevented the Oracle Cloud breach?

ChatGPT reportedly reduces reliance on Reddit as a data source

Perplexity makes Comet AI browser free, launches background assistant and Chess.com partnership

Light-powered chip makes AI computation 100 times more efficient

Free and effective anti-robocall tools are now available

Choosing the right Web3 server: OVHcloud options for startups to enterprises

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.