A federal magistrate judge in the Southern District of New York has rejected OpenAI’s request to reconsider an earlier order requiring it to hand over 20 million de-identified ChatGPT output logs to news publishers suing the company for copyright infringement.
In a detailed opinion issued by Magistrate Judge Ona T. Wang in the sprawling In re OpenAI Copyright Litigation, the court ruled that OpenAI failed to show any change in law, new evidence, or clear error that would justify revisiting the November 7 order. As a result, OpenAI must produce the log dataset, which the plaintiffs say is necessary to test the company’s models for potential misuse of their copyrighted material.
Background of the Dispute on ChatGPT logs
News organisations have been seeking access to ChatGPT output logs since May 2024 to determine whether the conversational AI system reproduced their copyrighted works and to test OpenAI’s fair-use and substantial non-infringing use defences.
OpenAI retains “tens of billions” of consumer ChatGPT logs. After the plaintiffs learnt earlier this year that the company had been deleting certain API, enterprise, and user-deleted consumer logs, Judge Wang ordered OpenAI to preserve all logs going forward.
Over the past year, both sides negotiated over what size of a “statistically valid” monthly sample OpenAI should produce for the merits phase. While publishers sought roughly 120 million logs, OpenAI repeatedly argued that a smaller 20-million-log sample was sufficient, citing the time and cost needed to de-identify the data. Plaintiffs eventually agreed to accept the 20-million-log sample.
But in October 2025, OpenAI told the plaintiffs it would not produce the entire sample and instead proposed using additional keyword filters to narrow the dataset. The publishers moved to compel, and Judge Wang ordered production of all 20 million ChatGPT logs on November 7.
OpenAI then sought reconsideration, claiming the sample was not proportional under Rule 26 and would expose user privacy.
The lawsuit, originally filed by The New York Times in 2023, is among the most closely watched copyright fights involving AI. It is part of a broader wave of cases targeting OpenAI, Microsoft and Meta for allegedly using copyrighted journalism to train AI systems without permission. Newspapers owned by Alden Global Capital’s MediaNews Group are also part of the suit, arguing that OpenAI’s business model unfairly exploits decades of reporting.
Court Rejects OpenAI’s Reconsideration Bid in ChatGPT Logs Case
Judge Wang denied the motion, finding:
- No new law or overlooked facts were presented.
- The logs are unquestionably relevant, not just to reproduction of copyrighted works, but also to OpenAI’s fair-use defence and its argument that ChatGPT has substantial non-infringing uses.
- Production is proportional, as the sample represents less than 0.05% of all retained logs, and OpenAI has already run…
Source link
Disclaimer
We strive to uphold the highest ethical standards in all of our reporting and coverage. We blogs.grocliq.com want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.
Website Upgradation is going on for any glitch kindly connect at [email protected]