New York Times sues OpenAI, Microsoft in copyright clash
Sign up now: Get ST's newsletters delivered to your inbox
The Times is seeking damages, as well as an order that the companies stop using its content – and destroy data already harvested.
PHOTO: REUTERS
Follow topic:
NEW YORK – The New York Times sued ChatGPT-maker OpenAI and Microsoft in a United States court on Dec 27, alleging that the companies’ powerful artificial intelligence (AI) models used millions of articles for training without permission.
Through their AI chatbots, the companies “seek to free-ride on The Times’ massive investment in its journalism by using it to build substitutive products without permission or payment”, the lawsuit said.
Copyright is becoming a major battleground for the much-hyped generative AI sector, with publishers, musicians and artists increasingly lawyering up to get paid for technology that is being built with their content.
With the suit, the New York Times also chose a more confrontational approach to the sudden rise of AI chatbots, in contrast to other media groups such as Germany’s Axel Springer and the Associated Press that have entered content deals with OpenAI.
“If The Times and other news organisations cannot produce and protect their independent journalism, there will be a vacuum that no computer or artificial intelligence can fill,” said the Times’ complaint.
“Less journalism will be produced, and the cost to society will be enormous.”
The Times, one of the most respected news organisations in the US, is seeking damages and an order that the companies stop using its content for the training of AI models – and destroy data already harvested.
While no sum is specifically requested, the Times alleges that the infringement could have cost “billions of dollars in statutory and actual damages.”
“We respect the rights of content creators and owners,” OpenAI said in an emailed statement. “Our ongoing conversations with the New York Times have been productive and moving forward constructively, so we are surprised and disappointed with this development.”
Microsoft did not respond to requests for comment.
Microsoft, the world’s second biggest company by market capitalisation, is a major investor in OpenAI, and swiftly implemented the powers of AI to its own products after the release of ChatGPT in 2022.
The AI models that power ChatGPT and Microsoft’s Copilot (formerly Bing) were trained for years on content available on the Internet, under the assumption that it was fair to be used without need for compensation.
But the lawsuit, filed in a federal court in New York, argued that the use of the Times’ work was unlawful, notably because the products being built created a potential rival and threatened the news company’s ability to provide quality journalism.
The Times said it attempted to seal a content agreement with OpenAI and Microsoft, but that the companies maintained that their technology was “transformative” and therefore did not require a commercial arrangement.
“There is nothing ‘transformative’ about using The Times’ content without payment to create products that substitute for The Times and steal audiences away from it,” the lawsuit alleged.
The lawsuit also said that content generated by ChatGPT and Copilot closely mimicked New York Times style and that its output was given a privileged status in perfecting the chatbot technology.
It also said that bot-generated content that was false was sourced incorrectly to The New York Times.
“In the ocean of repetitive junk data that models are trained over, decades of news archives are very useful,” wrote competition expert Cristina Caffarra on X.
“This is about division of rents, expect a lot more of it,” she added.
The emerging AI giants are facing a wave of lawsuits over their use of Internet content to build their AI systems that create content on simple prompts.
In 2022, Game Of Thrones author George R.R. Martin and other best-selling fiction writers filed a class-action lawsuit against OpenAI, accusing the start-up of violating their copyrights to fuel ChatGPT.
Universal and other music publishers have sued AI company Anthropic in a US court for using copyrighted lyrics to train its AI systems and in generating answers to user queries.
US photo distributor Getty Images has accused Stability AI of profiting from its pictures and those of its partners in order to make visual AI that creates original images on simple demand.
With lawsuits fast piling up, Microsoft and AI player Google have announced that they would provide legal protection for customers sued for copyright infringement over content generated by their AI models. AFP, REUTERS

