NY Times Sues OpenAI, Microsoft Over AI Content Theft

The New York Times Takes on OpenAI and Microsoft

The New York Times has filed a lawsuit against OpenAI and Microsoft, alleging that the two companies built their AI models by copying and using millions of the publication’s articles without permission. The lawsuit claims that OpenAI’s ChatGPT and Microsoft’s Copilot can generate output that recites Times content verbatim, closely summarizes it, and mimics its expressive style.

The Times alleges that this use of its content undermines and damages its relationship with readers, while also depriving it of subscription, licensing, advertising, and affiliate revenue. The lawsuit argues that these AI models threaten high-quality journalism by hurting the ability of news outlets to protect and monetize content.

A History of Tension Between News Outlets and AI Companies

The New York Times is one of many news outlets that have blocked OpenAI’s web crawler in recent months, preventing the AI company from continuing to scrape content from its website and using the data to train AI models. The BBC, CNN, and Reuters have moved to block OpenAI’s web crawler as well. Other publications, however, are embracing AI — or, at least, the payments that come with it.

Axel Springer, which owns Politico and Business Insider, struck a deal with OpenAI earlier this month that allows ChatGPT to pull information directly from both sources, while the Associated Press is allowing OpenAI to train its models on its news stories for the next two years.

The Stakes for the Future of Journalism

The lawsuit filed by The New York Times is significant because it directly challenges the business model of AI companies like OpenAI and Microsoft. The Times is seeking billions of dollars in statutory and actual damages, as well as an order preventing OpenAI and Microsoft from training their AI models using its content.

The case also raises important questions about the role of AI in journalism and the need for publishers to be compensated for their work. As the use of AI models becomes more widespread, news outlets are increasingly concerned about the potential impact on their business.

What’s Next

The lawsuit is likely to be closely watched by the tech industry and the media. OpenAI and Microsoft have until now been able to use content from news outlets to train their AI models without facing significant pushback. But The New York Times is a major publication with a strong reputation for its journalism.

The case could set a precedent for how AI companies use content from publishers in the future. It could also lead to changes in the way that AI models are trained and the way that publishers are compensated for their work.

Industry Context

The dispute between The New York Times and OpenAI and Microsoft is part of a larger debate about the role of AI in the media. As AI models become more sophisticated, they are increasingly able to generate content that is similar to human-written content.

This has raised concerns among publishers about the potential impact on their business. If AI models can generate content that is similar to human-written content, then why do publishers need to invest in human journalists?

The lawsuit filed by The New York Times is a significant development in this debate. It highlights the need for publishers to be compensated for their work and for AI companies to be transparent about how they use content from publishers.

Technical Mechanics

The AI models at issue in the lawsuit are large language models (LLMs) that are trained on vast amounts of text data. These models are able to generate human-like text by predicting the next word in a sequence of text.

The LLMs used by OpenAI and Microsoft are trained on a massive dataset of text that includes millions of articles from The New York Times. The Times alleges that this use of its content without permission is a copyright infringement.

The case raises important questions about the intersection of copyright law and AI technology. It could lead to changes in the way that AI models are trained and the way that publishers are compensated for their work.

Conclusion

The lawsuit filed by The New York Times against OpenAI and Microsoft is a significant development in the debate about the role of AI in the media. It highlights the need for publishers to be compensated for their work and for AI companies to be transparent about how they use content from publishers.

What to Watch

The next step in the lawsuit will be for OpenAI and Microsoft to respond to the allegations made by The New York Times. The case is likely to be closely watched by the tech industry and the media.

The outcome of the case could have significant implications for the future of journalism and the use of AI in the media. It could lead to changes in the way that AI models are trained and the way that publishers are compensated for their work.

The reader should track the decision on this case, as well as any potential appeals. The case could set a precedent for how AI companies use content from publishers in the future.

NY Times Sues OpenAI, Microsoft Over AI Content Theft

The New York Times Takes on OpenAI and Microsoft

A History of Tension Between News Outlets and AI Companies

The Stakes for the Future of Journalism

What’s Next

Industry Context

Technical Mechanics

Conclusion

What to Watch

Stay in the loop

Related Articles

AI Copyright Infringement Lawsuits Mount

Microsoft's Data Centers Threaten Clean Energy Goals

OpenAI Trial Unveils Founding Tensions