New York Times-ChatGPT lawsuit poses new legal threats to artificial intelligence

Julia Shapero

January 9, 2024 at 6:00 AM·7 min read

After a year of explosive growth, generative artificial intelligence (AI) may be facing its most significant legal threat yet from The New York Times.

The Times sued Microsoft and OpenAI, the company behind the popular ChatGPT tool, for copyright infringement shortly before the new year, alleging the companies impermissibly used millions of its articles to train their AI models.

The newspaper joins scores of writers and artists who have sued major technology companies in recent months for training AI on their copyrighted work without permission. Many of these lawsuits have hit road bumps in court.

However, experts believe The Times’s complaint is sharper than earlier AI-related copyright suits.

“I think they have learned from some of the previous losses,” Robert Brauneis, a professor of intellectual property law at the George Washington University Law School, told The Hill.

The Times lawsuit is “a little bit less scattershot in their causes of action,” Brauneis said.

“The attorneys here for the New York Times are careful to avoid just kind of throwing up everything against the wall and seeing what sticks there,” he added. “They’re really concentrated on what they think will stick.”

Transformation vs. Reproduction

Generative AI models require mass amounts of material for training. Large language models, like OpenAI’s ChatGPT and Microsoft’s Copilot, use the material they are trained on to predict what words are likely to follow a string of text to produce human-like responses.

Typically, these AI models are transformative in nature, said Shabbi Khan, co-chair for the Artificial Intelligence, Automation, and Robotics group at law firm Foley & Lardner.

“If you asked it a general query …. it doesn’t do a search and find the right passage and just reproduce the passage,” Khan explained. “It will try to probabilistically create its own version of what needs to be said based on a pattern that it picks up through parsing billions of words of content.”

However, in its suit against OpenAI and Microsoft, the Times alleges the AI models developed by the companies have “memorized” and can sometimes reproduce chunks of the newspaper’s articles.

“If individuals can access The Times’s highly valuable content through Defendants’ own products without having to pay for it and without having to navigate through The Times’s paywall, many will likely do so,” the lawsuit reads.

“Defendants’ unlawful conduct threatens to divert readers, including current and potential subscribers, away from The Times, thereby reducing the subscription, advertising, licensing, and affiliate revenues that fund The Times’s ability to continue producing its current level of groundbreaking journalism,” it adds.

In response to the lawsuit, an OpenAI spokesperson said in a statement that the company respects “the rights of content creators and owners” and is “committed to working with them to ensure they benefit from AI technology and new revenue models.”

While a Times spokesperson said the newspaper “recognizes the power and potential of GenAI for the public and for journalism,” they also emphasized that the AI models were built on “independent journalism and content that is only available because we and our peers reported, edited, and fact-checked it at high cost and with considerable expertise.”

“Settled copyright law protects our journalism and content,” the spokesperson added. “If Microsoft and OpenAI want to use our work for commercial purposes, the law requires that they first obtain our permission. They have not done so.”

Brauneis said some of the “most impressive” portions of the Times case are its repeated examples of the AI models simply regurgitating its material, nearly verbatim.

Earlier copyright lawsuits haven’t been able to show such direct reproductions of their material by the models, Khan noted.

In recent months, courts have dismissed claims from plaintiffs in similar lawsuits who argued that the outputs of particular AI models infringed on their copyright because they failed to show outputs that were substantially similar to their copyrighted work.

“I think [the Times] did a good job relative to what maybe other complaints have been put out in the past,” Khan told The Hill. “They provided multiple examples of basically snippets and quite frankly more than snippets, passages of the New York Times as reproductions.”

Khan suggested the court could decide that particular use cases of generative AI are not transformative enough and require companies to limit certain prompts or outputs to prevent AI models from reproducing copyrighted content.

While Brauneis similarly noted the issue could result in an injunction against the tech companies or damages for the Times, he also emphasized it is not an unsolvable issue for generative AI.

“I think that the companies will respond to that and develop filters that dramatically reproduce and reduce the incidence of that kind of output,” he said. “So, I don’t think that’s a long-term, huge problem for these companies.”

In an October response to an inquiry from the U.S. Copyright Office, OpenAI said it had developed measures to reduce the likelihood of “memorization” or verbatim repetition by its AI models, including removing duplications from its training data and teaching its models to decline prompts aimed at reproducing copyrighted works.

The company noted, however, “Because of the multitude of ways a user may ask questions, ChatGPT may not be perfect at understanding and declining every request aimed at getting outputs that may include some part of content the model was trained on.”

The AI model is also equipped with output filters that can block potentially violative content that is generated despite other safeguards, OpenAI said.

OpenAI also emphasized in a statement on Monday that memorization is a “rare bug” and alleged that the Times “intentionally manipulated prompts” in order to get ChatGPT to regurgitate its articles.

“Even when using such prompts, our models don’t typically behave the way The New York Times insinuates, which suggests they either instructed the model to regurgitate or cherry-picked their examples from many attempts,” the company said.

“Despite their claims, this misuse is not typical or allowed user activity, and is not a substitute for The New York Times,” it added. “Regardless, we are continually making our systems more resistant to adversarial attacks to regurgitate training data, and have already made much progress in our recent models.”

How the media, AI can shape each other

Carl Szabo, the vice president and general counsel of the tech industry group NetChoice, warned that lawsuits like the Times’ could stifle the industry.

“You’re gonna see a bunch of these efforts to kind of shakedown AI developers for money in a way that harms the public, that harms public access to information and kind of undermines the purpose of the Copyright Act, which is to promote human knowledge at the end of the day,” Szabo told The Hill.

Eventually, Khan said he thinks there will be a mechanism in place through which tech companies can obtain licenses to content, such as articles from the Times, for training their AI models.

OpenAI has already struck deals with The Associated Press and Axel Springer — a German media company that owns Politico, Business Insider and other publications — to use their content.

The Times also noted in its lawsuit that it reached out to Microsoft and OpenAI in April to raise intellectual property concerns and the possibility of an agreement, which OpenAI acknowledged in its statement about the case.

“Our ongoing conversations with the New York Times have been productive and moving forward constructively, so we are surprised and disappointed with this development,” a spokesperson said.

The OpenAI spokesperson added that the company is “hopeful that we will find a mutually beneficial way to work together.”

“I think most publishers will adopt that model because it provides for additional revenue to the company,” Khan told The Hill. “And we can see that because New York Times tried to enter into [an agreement]. So, there is a price that they’re willing to accept.”

Updated at 11:28 a.m.

For the latest news, weather, sports, and streaming video, head to The Hill.

Yahoo Finance
The FDIC change that leaves wealthy bank depositors with less protection
Affluent Americans may want to double-check how much of their bank deposits are protected by government-backed insurance. The rules governing trust accounts just changed.
Yahoo Sports
Former NBA guard Darius Morris dies at 33
Former NBA guard Darius Morris has died at the age of 33. He played for five teams during his four NBA seasons. Morris played college basketball at Michigan.
Yahoo Sports
Timberwolves coach Chris Finch calls Jamal Murray's heat-pack toss on court 'inexcusable and dangerous'
Murray made a bad night on the court worse during a moment of frustration on the bench.
Yahoo Finance
Former House Speaker Paul Ryan says he’s not voting for Trump : 'Character is too important'
Ryan says he would be writing in a Republican candidate instead of voting for Donald Trump.
Yahoo Sports
Post-draft NFL fantasy power rankings: Offenses we love, like and want to stay away from
With free agency and the draft behind us, what 32 teams look like today will likely be what they look like Week 1 and beyond for the 2024 season. Matt Harmon and Scott Pianowski reveal the post-draft fantasy power rankings. The duo break down the rankings in six tiers: Elite offensive ecosystems, teams on the cusp of being complete mixed bag ecosystems, offensive ecosystems with something to prove, offenses that could go either way, and offenses that are best to stay away from in fantasy.
Yahoo Sports
Ranking the best situations for the rookie quarterbacks: Start with Michael Penix in Atlanta at No. 1
It’s key to note that we’re not saying the “best team” or “best roster.” Instead, we’re talking about the best confluence of factors that can outline a path for survival and then success.
Yahoo Sports
Cardinals lose C Willson Contreras after left arm fractured by J.D. Martinez's swing
The Cardinals' nightmare season continues.
Yahoo Finance
These 3 stocks are poised to benefit from the massive energy transition
The energy transition will benefit companies providing electrical needs for surging demand. Analysts point to these three stocks as a Buy.
Yahoo Sports
Blockbuster May trade by Padres, MVP Ohtani has arrived, Willie Mays’ 93rd birthday & weekend recap
Jake Mintz & Jordan Shusterman discuss the Padres-Marlins trade that sent Luis Arraez to San Diego, as well as recap all the action from this weekend in baseball and send birthday wishes to hall-of-famer Willie Mays.
Yahoo Finance
Social Security just passed Medicare as the government's most pressing insolvency risk
An annual government report offered a glimmer of good news for Social Security and a jolt of good news for Medicare even as both programs continue to be on pace to run dry next decade.
Yahoo Sports
2024 NFL Team Fantasy Football Power Rankings, 1.0
With NFL rosters pretty much set before training camp, Scott Pianowski reveals his first set of team fantasy power rankings for the 2024 season.
Yahoo Sports
Phil Mickelson on the majors: 'What if none of the LIV players played?'
Phil Mickelson hints that big changes could be coming to LIV Golf's rosters, and the majors will need to pay attention.
Yahoo Sports
Fantasy Baseball Trade Analyzer: Buy into a pair of Astros sluggers
Fantasy baseball analyst Fred Zinkie offers up his top buy low/high and sell low-high candidates for Week 6.
Engadget
The best budgeting apps for 2024
Budgeting apps can help you keep track of your finances, stick to a spending plan and reach your money goals. These are the best budget-tracking apps available right now.
Yahoo Sports
NBA playoffs: Officials admit they flubbed critical kick-ball call in controversial final minute of Pacers-Knicks
Tuesday's last-2-minute report should be interesting.
Yahoo Sports
The Scorecard: Andy Pages looks set to go down as one of the best fantasy baseball waiver wire pickups of 2024
Fantasy baseball analyst Dalton Del Don delivers his latest batch of hot takes as we enter Week 6 of the season.
Yahoo Sports
NBA fines Nuggets G Jamal Murray $100K for tossing heat pack, towel on court vs. Timberwolves; no suspension
Murray tossed a heat pad onto the court during gameplay vs. the Timberwolves.
Yahoo Sports
Ex-Ole Miss QB and Denver Broncos draft pick Chad Kelly suspended at least nine games by CFL
Kelly allegedly harassed a female strength and conditioning coach who sued him and the Toronto Argonauts in February.
Yahoo Sports
NFL Draft fashion: Caleb Williams, Malik Nabers dressed to impress, but Marvin Harrison Jr.'s medallion stole the show
Every player was dressed to impress at the 2024 NFL Draft.
Yahoo Finance
Fed’s Kashkari: Rates will stay high for 'extended period' and can't rule out a hike
Minneapolis Fed president Neel Kashkari said interest rates will likely stay at current levels for an "extended period" and didn't rule out a hike if inflation stalls near 3%.

News

Life

Entertainment

Finance

Sports

New on Yahoo

New York Times-ChatGPT lawsuit poses new legal threats to artificial intelligence

Transformation vs. Reproduction

How the media, AI can shape each other

Recommended Stories

The FDIC change that leaves wealthy bank depositors with less protection

Former NBA guard Darius Morris dies at 33

Timberwolves coach Chris Finch calls Jamal Murray's heat-pack toss on court 'inexcusable and dangerous'

Former House Speaker Paul Ryan says he’s not voting for Trump : 'Character is too important'

Post-draft NFL fantasy power rankings: Offenses we love, like and want to stay away from

Ranking the best situations for the rookie quarterbacks: Start with Michael Penix in Atlanta at No. 1

Cardinals lose C Willson Contreras after left arm fractured by J.D. Martinez's swing

These 3 stocks are poised to benefit from the massive energy transition

Blockbuster May trade by Padres, MVP Ohtani has arrived, Willie Mays’ 93rd birthday & weekend recap

Social Security just passed Medicare as the government's most pressing insolvency risk

2024 NFL Team Fantasy Football Power Rankings, 1.0

Phil Mickelson on the majors: 'What if none of the LIV players played?'

Fantasy Baseball Trade Analyzer: Buy into a pair of Astros sluggers

The best budgeting apps for 2024

NBA playoffs: Officials admit they flubbed critical kick-ball call in controversial final minute of Pacers-Knicks

The Scorecard: Andy Pages looks set to go down as one of the best fantasy baseball waiver wire pickups of 2024

NBA fines Nuggets G Jamal Murray $100K for tossing heat pack, towel on court vs. Timberwolves; no suspension

Ex-Ole Miss QB and Denver Broncos draft pick Chad Kelly suspended at least nine games by CFL

NFL Draft fashion: Caleb Williams, Malik Nabers dressed to impress, but Marvin Harrison Jr.'s medallion stole the show

Fed’s Kashkari: Rates will stay high for 'extended period' and can't rule out a hike